Nothing Special   »   [go: up one dir, main page]

CN108199901A - Hardware reports method, system, equipment, hardware management server and storage medium for repairment - Google Patents

Hardware reports method, system, equipment, hardware management server and storage medium for repairment Download PDF

Info

Publication number
CN108199901A
CN108199901A CN201810068181.XA CN201810068181A CN108199901A CN 108199901 A CN108199901 A CN 108199901A CN 201810068181 A CN201810068181 A CN 201810068181A CN 108199901 A CN108199901 A CN 108199901A
Authority
CN
China
Prior art keywords
server
hardware
error log
repairment
processing
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN201810068181.XA
Other languages
Chinese (zh)
Other versions
CN108199901B (en
Inventor
刘冰
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhengzhou Yunhai Information Technology Co Ltd
Original Assignee
Zhengzhou Yunhai Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhengzhou Yunhai Information Technology Co Ltd filed Critical Zhengzhou Yunhai Information Technology Co Ltd
Priority to CN201810068181.XA priority Critical patent/CN108199901B/en
Publication of CN108199901A publication Critical patent/CN108199901A/en
Application granted granted Critical
Publication of CN108199901B publication Critical patent/CN108199901B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Debugging And Monitoring (AREA)

Abstract

This application discloses a kind of hardware to report method for repairment, applied to hardware management server, including:Obtain the SEL daily records of cluster server;SEL daily records are analyzed, judge whether error log occur;If so, obtain failed server corresponding with error log;Error log is sent to processing server so that processing server distributes error log to default processing terminal.Hardware management server automatically analyzes the SEL daily records of cluster server in the present invention, processing server is sent it to when finding error log, error log is assigned to default processing terminal by processing server, is preset processing terminal and is carried out corresponding measure to failed server.Hardware problem energy automatic repair-reporting in the present invention alleviates the work load of operation maintenance personnel, more efficient compared with artificial report for repairment, and the service efficiency of cluster server improves, and makes client cost lower.Disclosed herein as well is a kind of hardware repair reporting system, equipment, hardware service manager and readable storage medium storing program for executing.

Description

Hardware reports method, system, equipment, hardware management server and storage medium for repairment
Technical field
The present invention relates to equipment O&M field, more particularly to a kind of hardware reports method, system, equipment, hardware management clothes for repairment Business device and readable storage medium storing program for executing.
Background technology
It is well known that server is the core of whole network system and computing platform, with cloud computing and big data technology Fast development, the data center that the country is built is also more and more, and quantity exponentially other growth of server system is special It is not a large amount of buyings that cloud server system even more obtains the major Internet company in the whole world.Cloud server system is mainly by four big portions Part:Processor, memory, I/O equipment (storage devices such as including hard disk) composition, in addition also have other big and small various parts And component composition.There is mistake in one component of any of which, is likely to cause server system to delay and machine or restarts, particularly The possibility higher of system failure caused by above-mentioned four critical pieces.In face of the server of such vast number, such as What efficiently can quickly position and restore the server for hardware problem occur, which becomes server operation maintenance personnel and face A major challenge.
The X86 Cloud Servers management process of mainstream is server operation maintenance personnel based on personal experience oneself exploitation one at present Set is based on BMC (Baseboard Management Controller, onboard Management Controller) IPMI (Intelligent Platform Management Controller, Intelligent Platform Management Interface) agreement outband management software, the software is continuous Inspection Cloud Server BMC SEL daily records (System Event Log, System Event Log), find SEL daily records in it is wrong After daily record, alert notice operation maintenance personnel is generated.Operation maintenance personnel will check corresponding SEL error logs, if had in daily record bright It is all right (if problem component is replaceable component that as long as the problem of aobvious reason direction then replaces corresponding problem component Words).If not having the reason of clear and definite to be directed toward in SEL problem logs, operation maintenance personnel will be all BMC in failed server Syslog file in SEL daily records and OS is packaged, and is then sent to contact staff's requirement analysis problem of manufacturer server Reason.This transmission process may take up one day or even time a couple of days, less efficient.And the report of similar server problem Repair all be operation maintenance personnel artificial participation, increase the work load of operation maintenance personnel.
Invention content
In view of this, can method be reported for repairment the purpose of the present invention is to provide a kind of with the hardware of automatic repair-reporting, system, set Standby, hardware management server and readable storage medium storing program for executing.Its concrete scheme is as follows:
A kind of hardware reports method for repairment, applied to hardware management server, including:
Obtain the SEL daily records of cluster server;
The SEL daily records are analyzed, judge whether error log occur;
If so, obtain failed server corresponding with the error log in the cluster server;
The error log is sent to processing server so that the processing server by the error log distribute to Default processing terminal handles the failed server.
Preferably, the mistake for therefrom obtaining failed server corresponding with the error log in the cluster server Journey simultaneously, further includes:
The failed server is sent to O&M terminal, and wrong warning occurs.
Preferably, the hardware reports method for repairment and further includes:
The processing procedure to the failed server is sent to the O&M terminal.
Preferably, the process that the error log is sent to processing server, further includes:
The processing server will be sent to the relevant association daily record of the failed server in the SEL daily records, So that the processing server distributes the association daily record to the default processing terminal.
Preferably, it is described to be sent to processing with the relevant association daily record of the failed server in the SEL daily records Before the process of server, further include:
According to the error log and the failed server, the association daily record in the SEL daily records is obtained.
Preferably, the cluster server includes multiple Cloud Servers.
Correspondingly, the invention also discloses a kind of hardware repair reporting system, applied to hardware management server, including:
First acquisition module, for obtaining the SEL daily records of cluster server;
Judgment module for analyzing the SEL daily records, judges whether error log occur;If it is, triggering second obtains Modulus block;
Second acquisition module, for obtaining failed services corresponding with the error log in the cluster server Device;
Sending module, for the error log to be sent to processing server, so that the processing server is by described in Error log is distributed to default processing terminal.
Correspondingly, the invention also discloses a kind of hardware management server, including:
Memory, for storing computer program;
Processor, the step of realizing that hardware reports method for repairment as described above during for performing the computer program.
Correspondingly, the invention also discloses a kind of hardware to report equipment for repairment, including:
Hardware management server described above;
The error log that the hardware management server is sent is distributed to the processing server of default processing terminal.
Correspondingly, the invention also discloses a kind of readable storage medium storing program for executing, computer is stored on the readable storage medium storing program for executing Program realizes the step of hardware reports method for repairment as described above when the computer program is executed by processor.
The invention discloses a kind of hardware to report method for repairment, applied to hardware management server, including:Obtain cluster server SEL daily records;The SEL daily records are analyzed, judge whether error log occur;If so, obtain in the cluster server with The corresponding failed server of the error log;The error log is sent to processing server, so that the processing service Device distributes the error log to default processing terminal.Hardware management server is automatically to cluster server in the present invention SEL daily records are analyzed, and processing server is sent it to when finding error log, and processing server can be by error log point Default processing terminal is fitted on, default processing terminal corresponding failed server can carry out corresponding measure to error log.This hair Bright middle hardware problem can automatic repair-reporting, alleviate the work load of operation maintenance personnel, and more efficient compared with artificial report for repairment, The service efficiency of cluster server is substantially increased, makes client cost lower.
Description of the drawings
In order to illustrate more clearly about the embodiment of the present invention or technical scheme of the prior art, to embodiment or will show below There is attached drawing needed in technology description to be briefly described, it should be apparent that, the accompanying drawings in the following description is only this The embodiment of invention, for those of ordinary skill in the art, without creative efforts, can also basis The attached drawing of offer obtains other attached drawings.
Fig. 1 is the step flow chart that a kind of hardware reports method for repairment in the embodiment of the present invention;
Fig. 2 is a kind of structure distribution figure of hardware repair reporting system in the embodiment of the present invention.
Specific embodiment
Below in conjunction with the attached drawing in the embodiment of the present invention, the technical solution in the embodiment of the present invention is carried out clear, complete Site preparation describes, it is clear that described embodiment is only part of the embodiment of the present invention, instead of all the embodiments.It is based on Embodiment in the present invention, those of ordinary skill in the art are obtained every other without making creative work Embodiment shall fall within the protection scope of the present invention.
The embodiment of the invention discloses a kind of hardware to report method for repairment, applied to hardware management server (Hardware Management Server, HMS), it is shown in Figure 1, including:
S1:Obtain the SEL daily records of cluster server;
Wherein, cluster server includes multiple servers, such as Cloud Server etc..
It is understood that obtaining the frequency of SEL daily records can set, a certain period has both been can be set as, it can also be real When obtain and monitor.In view of the content of SEL daily records is more, it can also carry out having the acquisition of screening to include base according to preset condition The SEL daily records of plinth, important information.
S2:The SEL daily records are analyzed, judge whether error log occur;
Wherein, the operating status of cluster server has been recorded in SEL daily records, when wherein there is server failure, Mistake can also occur for corresponding SEL daily records, therefore SEL daily records are analyzed, once finding error log, that is, prove there is service Device breaks down, and operation maintenance personnel or manufacturer server is needed failed server is replaced or adjusted, general service Device failure includes two aspect of hardware and software, and hardware fault can replace faulty hardware, and software fault can change program etc..
S3:If so, obtain failed server corresponding with the error log in the cluster server;
If it is not, then continue to execute step S1 and S2.
S4:The error log is sent to processing server, so that the processing server divides the error log Default processing terminal is assigned to handle the failed server.
Wherein, processing server can carry out preliminary analysis error log and screening error log in assignment error daily record Type, then error log is distributed to corresponding default processing terminal, so as to quickly and efficiently solve failed services The problems in device.
The embodiment of the invention discloses a kind of hardware to report method for repairment, applied to hardware management server, including:Obtain cluster The SEL daily records of server;The SEL daily records are analyzed, judge whether error log occur;If so, obtain the cluster service Failed server corresponding with the error log in device;The error log is sent to processing server, so that the place Reason server distributes the error log to default processing terminal.Hardware management server is automatically to cluster service in the present invention The SEL daily records of device are analyzed, and send it to processing server when finding error log, processing server can be by wrong day Will is assigned to default processing terminal, and default processing terminal corresponding failed server can carry out corresponding measure to error log. In the present invention hardware problem can automatic repair-reporting, alleviate the work load of operation maintenance personnel, and the efficiency compared with artificial report for repairment Higher substantially increases the service efficiency of cluster server, makes client cost lower.
The embodiment of the invention discloses a kind of specific hardware to report method for repairment, relative to a upper embodiment, the present embodiment pair Technical solution has made further instruction and optimization.Specifically:
The error log described in step S3 is sent to the process of processing server, is further included:
The processing server will be sent to the relevant association daily record of the failed server in the SEL daily records, So that the processing server distributes the association daily record to the default processing terminal.
It is understood that since server system is interrelated, if SEL daily records do not have apparent questions and prospect to refer to To only being possible to not to be accurately judged to the failure cause of failed server according to error log, it is therefore desirable to will be taken with failure The relevant association daily record of business device is sent to processing server.
Correspondingly, described will be sent to processing in the SEL daily records with the relevant association daily record of the failed server Before the process of server, further include:
According to the error log and the failed server, the association daily record in the SEL daily records is obtained.
It is understood that association daily record here is may to imply event in ground SEL daily records corresponding with failed server Hinder the SEL daily records of server failure reason.Default processing terminal combination error log understands failed server with daily record is associated with Status information, more accurately judge so as to make, solve failure.
The embodiment of the invention discloses a kind of specific hardware to report method for repairment, relative to a upper embodiment, the present embodiment pair Technical solution has made further instruction and optimization.Specifically:
It is described in step s3 therefrom to obtain failed server corresponding with the error log in the cluster server Process simultaneously, further include:
The failed server is sent to O&M terminal, and wrong warning occurs.
It is understood that the operation maintenance personnel of O&M terminal is understood that the status information of failed server, at O&M end After termination receives warning, subsequent action can be carried out, such as closing fault server, restart failed services according to scheduled program Device preserves capsule information etc., as far as possible by the damage control that failed server breaks down in smaller range.
Further, the hardware reports method for repairment and can also include:
The processing procedure to the failed server is sent to the O&M terminal.
It is understood that processing server can record the processing procedure of failed server and its error log And O&M terminal is sent to, so that the operation maintenance personnel of O&M terminal is managed lookup.Specifically, processing server can pass through Hardware management server sends the processing progress of failed server to O&M terminal in real time, so as to the operation maintenance personnel energy of O&M terminal The action of enough arranged rational cluster servers.
Correspondingly, the embodiment of the invention also discloses a kind of hardware repair reporting system, applied to hardware management server, referring to Shown in Fig. 2, including:
First acquisition module 1, for obtaining the SEL daily records of cluster server;
Judgment module 2 for analyzing the SEL daily records, judges whether error log occur;If it is, triggering second Acquisition module 3;
Second acquisition module 3, for obtaining failure clothes corresponding with the error log in the cluster server Business device;
Sending module 4, for the error log to be sent to processing server, so that the processing server is by described in Error log is distributed to default processing terminal.
Hardware repair reporting system in the present embodiment has and the advantageous effect that report method for repairment identical of hardware in above-described embodiment.
Correspondingly, the embodiment of the invention also discloses a kind of hardware management server, including:
Memory, for storing computer program;
Processor realizes that the hardware as described in foregoing embodiments reports the step of method for repairment during for performing the computer program Suddenly.
Wherein, in the present embodiment the detail of hardware management server with reference to the hardware side of reporting for repairment related in above-described embodiment The description of method, is no longer repeated herein.
Hardware management server has and the advantageous effect that report method for repairment identical of hardware in above-described embodiment in the present embodiment.
Correspondingly, the invention also discloses a kind of hardware to report equipment for repairment, including:
Hardware management server described in foregoing embodiments;
The error log that the hardware management server is sent is distributed to the processing server of default processing terminal.
Wherein, in the present embodiment the detail of hardware management server with reference to the hardware side of reporting for repairment related in above-described embodiment The description of method, is no longer repeated herein.
Hardware reports equipment for repairment and has and the advantageous effect that report method for repairment identical of hardware in above-described embodiment in the present embodiment.
Correspondingly, the invention also discloses a kind of readable storage medium storing program for executing, computer is stored on the readable storage medium storing program for executing Program realizes the step of hardware reports method for repairment as described above when the computer program is executed by processor.
Wherein, the detail of readable storage medium storing program for executing reports method for repairment with reference to hardware related in above-described embodiment in the present embodiment Description, no longer repeated herein.
Readable storage medium storing program for executing has and the advantageous effect that report method for repairment identical of hardware in above-described embodiment in the present embodiment.
Finally, it is to be noted that, herein, relational terms such as first and second and the like be used merely to by One entity or operation are distinguished with another entity or operation, without necessarily requiring or implying these entities or operation Between there are any actual relationship or orders.Moreover, term " comprising ", "comprising" or its any other variant meaning Covering non-exclusive inclusion, so that process, method, article or equipment including a series of elements not only include that A little elements, but also including other elements that are not explicitly listed or further include for this process, method, article or The intrinsic element of equipment.In the absence of more restrictions, the element limited by sentence "including a ...", is not arranged Except also there are other identical elements in the process, method, article or apparatus that includes the element.
Method, system, equipment, hardware management server is reported for repairment to a kind of hardware provided by the present invention above to deposit with readable Storage media is described in detail, and specific case used herein is expounded the principle of the present invention and embodiment, The explanation of above example is only intended to facilitate the understanding of the method and its core concept of the invention;Meanwhile for the one of this field As technical staff, thought according to the present invention, there will be changes in specific embodiments and applications, to sum up institute It states, the content of the present specification should not be construed as limiting the invention.

Claims (10)

1. a kind of hardware reports method for repairment, which is characterized in that applied to hardware management server, including:
Obtain the SEL daily records of cluster server;
The SEL daily records are analyzed, judge whether error log occur;
If so, obtain failed server corresponding with the error log in the cluster server;
The error log is sent to processing server, so that the processing server distributes the error log to default Processing terminal handles the failed server.
2. hardware reports method for repairment according to claim 1, which is characterized in that it is described therefrom obtain in the cluster server with The process of the corresponding failed server of the error log simultaneously, further includes:
The failed server is sent to O&M terminal, and wrong warning occurs.
3. hardware reports method for repairment according to claim 2, which is characterized in that further includes:
The processing procedure to the failed server is sent to the O&M terminal.
4. hardware reports method for repairment according to claim 1, which is characterized in that described that the error log is sent to processing clothes The process of business device, further includes:
The processing server will be sent to the relevant association daily record of the failed server in the SEL daily records, so as to The processing server distributes the association daily record to the default processing terminal.
5. hardware reports method for repairment according to claim 4, which is characterized in that it is described by the SEL daily records with the failure The relevant association daily record of server is sent to before the process of processing server, is further included:
According to the error log and the failed server, the association daily record in the SEL daily records is obtained.
6. method is reported for repairment according to any one of claim 1 to 5 hardware, which is characterized in that the cluster server includes more A Cloud Server.
7. a kind of hardware repair reporting system, which is characterized in that applied to hardware management server, including:
First acquisition module, for obtaining the SEL daily records of cluster server;
Judgment module for analyzing the SEL daily records, judges whether error log occur;If it is, triggering second obtains mould Block;
Second acquisition module, for obtaining failed server corresponding with the error log in the cluster server;
Sending module, for the error log to be sent to processing server, so that the processing server is by the mistake Daily record is distributed to default processing terminal.
8. a kind of hardware management server, which is characterized in that including:
Memory, for storing computer program;
Processor, realizing that the hardware as described in any one of claim 1 to 6 reports method for repairment during for performing the computer program Step.
9. a kind of hardware reports equipment for repairment, which is characterized in that including:
Hardware management server as claimed in claim 8;
The error log that the hardware management server is sent is distributed to the processing server of default processing terminal.
10. a kind of readable storage medium storing program for executing, which is characterized in that computer program, the meter are stored on the readable storage medium storing program for executing The step of hardware as described in any one of claim 1 to 6 reports method for repairment is realized when calculation machine program is executed by processor.
CN201810068181.XA 2018-01-24 2018-01-24 Hardware repair reporting method, system, device, hardware management server and storage medium Active CN108199901B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201810068181.XA CN108199901B (en) 2018-01-24 2018-01-24 Hardware repair reporting method, system, device, hardware management server and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201810068181.XA CN108199901B (en) 2018-01-24 2018-01-24 Hardware repair reporting method, system, device, hardware management server and storage medium

Publications (2)

Publication Number Publication Date
CN108199901A true CN108199901A (en) 2018-06-22
CN108199901B CN108199901B (en) 2021-06-29

Family

ID=62590553

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201810068181.XA Active CN108199901B (en) 2018-01-24 2018-01-24 Hardware repair reporting method, system, device, hardware management server and storage medium

Country Status (1)

Country Link
CN (1) CN108199901B (en)

Cited By (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109086155A (en) * 2018-07-27 2018-12-25 郑州云海信息技术有限公司 Server failure localization method, device, equipment and computer readable storage medium
CN109144765A (en) * 2018-08-21 2019-01-04 平安科技(深圳)有限公司 Report form generation method, device, computer equipment and storage medium
CN110119325A (en) * 2019-05-10 2019-08-13 深圳前海微众银行股份有限公司 Server failure processing method, device, equipment and computer readable storage medium
CN112529223A (en) * 2020-12-24 2021-03-19 同盾控股有限公司 Equipment fault repair method and device, server and storage medium
CN112734052A (en) * 2019-10-15 2021-04-30 贵州白山云科技股份有限公司 Fault repair reporting method and system

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7188171B2 (en) * 2003-01-23 2007-03-06 Hewlett-Packard Development Company, L.P. Method and apparatus for software and hardware event monitoring and repair
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system
CN104243216A (en) * 2014-09-28 2014-12-24 北京国双科技有限公司 Maintenance method and device of cluster server
CN104301136A (en) * 2014-09-11 2015-01-21 青岛海信电器股份有限公司 Method and equipment for reporting and processing fault information
CN105718354A (en) * 2016-01-20 2016-06-29 上海斐讯数据通信技术有限公司 Fault information reproducing method and device
CN107171873A (en) * 2017-07-21 2017-09-15 北京微影时代科技有限公司 A kind of method and apparatus of Message Processing

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US7188171B2 (en) * 2003-01-23 2007-03-06 Hewlett-Packard Development Company, L.P. Method and apparatus for software and hardware event monitoring and repair
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system
CN104301136A (en) * 2014-09-11 2015-01-21 青岛海信电器股份有限公司 Method and equipment for reporting and processing fault information
CN104243216A (en) * 2014-09-28 2014-12-24 北京国双科技有限公司 Maintenance method and device of cluster server
CN105718354A (en) * 2016-01-20 2016-06-29 上海斐讯数据通信技术有限公司 Fault information reproducing method and device
CN107171873A (en) * 2017-07-21 2017-09-15 北京微影时代科技有限公司 A kind of method and apparatus of Message Processing

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN109086155A (en) * 2018-07-27 2018-12-25 郑州云海信息技术有限公司 Server failure localization method, device, equipment and computer readable storage medium
CN109144765A (en) * 2018-08-21 2019-01-04 平安科技(深圳)有限公司 Report form generation method, device, computer equipment and storage medium
CN109144765B (en) * 2018-08-21 2024-02-02 平安科技(深圳)有限公司 Report generation method, report generation device, computer equipment and storage medium
CN110119325A (en) * 2019-05-10 2019-08-13 深圳前海微众银行股份有限公司 Server failure processing method, device, equipment and computer readable storage medium
CN112734052A (en) * 2019-10-15 2021-04-30 贵州白山云科技股份有限公司 Fault repair reporting method and system
CN112734052B (en) * 2019-10-15 2024-01-30 贵州白山云科技股份有限公司 Fault repairing method and system
CN112529223A (en) * 2020-12-24 2021-03-19 同盾控股有限公司 Equipment fault repair method and device, server and storage medium

Also Published As

Publication number Publication date
CN108199901B (en) 2021-06-29

Similar Documents

Publication Publication Date Title
KR101513408B1 (en) Providing dynamic reliability and security in communications environments
CN108199901A (en) Hardware reports method, system, equipment, hardware management server and storage medium for repairment
US9311160B2 (en) Elastic cloud networking
US7574502B2 (en) Early warning of potential service level agreement violations
EP3311529B1 (en) Resilience as a service
Cotroneo et al. NFV-bench: A dependability benchmark for network function virtualization systems
CN107612787B (en) Cloud host fault detection method based on Openstack open source cloud platform
US11012461B2 (en) Network device vulnerability prediction
US20100318836A1 (en) Monitoring and healing a computing system
CN107239383A (en) A kind of failure monitoring method and device of OpenStack virtual machines
WO2013126300A1 (en) Method and apparatus for automatic migration of application service
CN110851320A (en) Server downtime supervision method, system, terminal and storage medium
CN103716173A (en) Storage monitoring system and monitoring alarm issuing method
WO2017080161A1 (en) Alarm information processing method and device in cloud computing
CN110134518A (en) A kind of method and system improving big data cluster multinode high application availability
De Carvalho et al. A cloud monitoring framework for self-configured monitoring slices based on multiple tools
CN108390907B (en) Management monitoring system and method based on Hadoop cluster
CN109947585A (en) The processing method and processing device of PCIE device failure
CN102902615A (en) Failure alarm method and system for Lustre parallel file system
US20170199800A1 (en) System and method for comprehensive performance and availability tracking using passive monitoring and intelligent synthetic transaction generation in a transaction processing system
US20150012647A1 (en) Router-based end-user performance monitoring
CN106789158A (en) Damage identification method and system are insured in a kind of cloud service
Huang et al. PDA: A Tool for Automated Problem Determination.
CN110389892A (en) A kind of fault filling method based on cloud platform historical failure data
CA3144664A1 (en) Determining problem dependencies in application dependency discovery, reporting, and management tool

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant