Nothing Special   »   [go: up one dir, main page]

CN107395379A - A kind of cluster cruising inspection system and method - Google Patents

A kind of cluster cruising inspection system and method Download PDF

Info

Publication number
CN107395379A
CN107395379A CN201610320492.1A CN201610320492A CN107395379A CN 107395379 A CN107395379 A CN 107395379A CN 201610320492 A CN201610320492 A CN 201610320492A CN 107395379 A CN107395379 A CN 107395379A
Authority
CN
China
Prior art keywords
node
analysis result
cluster
inspection
result
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201610320492.1A
Other languages
Chinese (zh)
Inventor
徐新坤
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Original Assignee
Beijing Jingdong Century Trading Co Ltd
Beijing Jingdong Shangke Information Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Jingdong Century Trading Co Ltd, Beijing Jingdong Shangke Information Technology Co Ltd filed Critical Beijing Jingdong Century Trading Co Ltd
Priority to CN201610320492.1A priority Critical patent/CN107395379A/en
Publication of CN107395379A publication Critical patent/CN107395379A/en
Pending legal-status Critical Current

Links

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0604Management of faults, events, alarms or notifications using filtering, e.g. reduction of information by using priority, element types, position or time
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/069Management of faults, events, alarms or notifications using logs of notifications; Post-processing of notifications

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Information Transfer Between Computers (AREA)

Abstract

The invention provides a kind of cluster cruising inspection system and method, system includes center module, inspection execution module and processing module, and it coordinates information model to carry out cluster inspection.Patrol task is distributed to multiple inspection execution modules by center module, and each inspection execution module, which sends patrol task into the node each connected, to be performed, this distributed cruising inspection system, effectively improves the efficiency of inspection.In addition, the present invention uses information model so that inspection content template, when needing to increase or changing inspection content, it is only necessary to change template, you can come into force, avoid the problem of upgrading is brought every time.Furthermore processing module of the invention can inform the user analysis result in a manner of daily record, mail, short message etc., abnormal nodes can also be repaired automatically according to analysis result.

Description

A kind of cluster cruising inspection system and method
Technical field
The invention belongs to field of computer technology, more particularly to a kind of cluster cruising inspection system and method, in cluster Multiple nodes are checked.
Background technology
Cluster is one group of computer interconnected independently of each other, by express network, and each computer is referred to as in cluster Node.For large-scale cluster, such as:Server cluster, container cluster or cluster virtual machine are, it is necessary to cluster interior joint Service state, resource utilization, daily record etc. regularly checked.And according to inspection result, related personnel is notified, Or the reparation automated.
Existing cluster cruising inspection system generally comprises central server and agent, in server (node) to be checked Agent programs are implanted into, periodically according to predetermined flow, check book server, and inspection result is reported into central server.This Kind mode is primarily present problems with:
1. when cluster scale increases, because all results need to report central server, therefore the efficiency of inspection It is limited to central server.
2. when increasing a check item newly, it is necessary to upgrade to the agent programs of all nodes, upgrading cost is larger.
The content of the invention
(1) technical problems to be solved
It is an object of the present invention to provide a kind of cluster cruising inspection system and method, when cluster scale increases, can facilitate Be extended, ensure the efficiency of inspection.
(2) technical scheme
The present invention provides a kind of cluster cruising inspection system, for checking that system includes to multiple nodes in cluster:
Center module, for determining node to be inspected in cluster and producing patrol task, and the patrol task is distributed To multiple inspection execution modules, wherein, the inspection execution module for obtaining task distribution is connected with least one node;
Inspection execution module, for the patrol task of distribution to be sent in the node connected to it, and drive the node Patrol task is performed, then obtains the implementing result of node, wherein, the implementing result of node can reflect the work shape of egress State, such as service state, resource utilization, then, implementing result is analyzed, obtains analysis result;
Processing module, handled for obtaining analysis result from inspection execution module, and to analysis result.
Further, system also includes an information model, is stored with the node address of node, center module is according to node address Determine node to be inspected in cluster.
Further, information model is also stored with the activation bit of node, and inspection execution module should according to activation bit, driving Node performs patrol task.
Further, information model also includes multiple alarm conditions, and multiple alarm conditions have different alarm levels, inspection The alarm conditions that execution module is matched by determining implementing result, generate the analysis result of different alarm levels.
Further, information model also includes filter condition, and processing module filters out different alert levels according to the filter condition Other analysis result.
Further, processing module is handled analysis result, including one kind in the following manner:
Analysis result be recorded into journal file;
Analysis result is subjected to short message sending;
Analysis result is subjected to mail transmission.
Further, processing module is repaired always according to processing mode information to the abnormal nodes indicated in analysis result.
The present invention also provides a kind of cluster method for inspecting, and using above-mentioned cluster cruising inspection system, method includes:
S1, determine node to be inspected in cluster and produce patrol task;
S2, the patrol task of distribution is sent into node to be inspected, and drive the node to perform patrol task, then The implementing result of node is obtained, then, the implementing result is analyzed, obtains analysis result;
S3, analysis result is obtained, and analysis result is handled.
Further, step S1 includes:Node to be inspected in cluster is determined according to node address, wherein node address stores In an information model.
Further, information model also includes the activation bit of node, in step S2, according to activation bit, drives the node Perform patrol task.
Further, information model also includes multiple alarm conditions, and multiple alarm conditions have different alarm levels, described Implementing result is analyzed in step S2, obtains analysis result, including:The alarm bar matched by determining implementing result Part, generate the analysis result of different alarm levels.
Further, information model also includes filter condition, also includes in the step S3:Filtered out according to the filter condition The analysis result of different alarm levels.
Further, in step S3, analysis result is handled, including but not limited to one kind in the following manner:
Analysis result be recorded into journal file;
Analysis result is subjected to short message sending;
Analysis result is subjected to mail transmission.
Further, step S3, in addition to:According to processing mode information, the abnormal nodes indicated in analysis result are carried out Repair.
(3) beneficial effect
The present invention's has advantages below:
(1) patrol task is distributed to multiple inspection execution modules by center module, and each inspection execution module appoints inspection Business sends and performed into the node each connected, this distributed cruising inspection system, effectively improves the efficiency of inspection, solves The bottleneck problem that general cruising inspection system central server is brought, is easy to extending transversely.
(2) information model is used so that inspection content template, when needing to increase or changing inspection content, it is only necessary to Change template, you can come into force, avoid the problem of upgrading is brought every time.
(3) processing module can inform the user analysis result in a manner of daily record, mail, short message etc., can also be according to analysis As a result abnormal nodes are repaired automatically in a manner of HTTP or RPC etc..
Brief description of the drawings
Fig. 1 is the schematic diagram of cluster cruising inspection system provided by the invention.
Fig. 2 is the workflow diagram of the center module Core in the present invention.
Fig. 3 is the workflow diagram of the inspection execution module Master in the present invention.
Fig. 4 is the flow chart that inspection execution module Master carries out implementing result analysis in the present invention.
Fig. 5 is the workflow diagram of the processing module Notifier in the present invention.
Embodiment
For the object, technical solutions and advantages of the present invention are more clearly understood, below in conjunction with specific embodiment, and reference Accompanying drawing, the present invention is described in more detail.
Fig. 1 is the schematic diagram of cluster cruising inspection system provided by the invention, as shown in figure 1, system include center module Core, Inspection execution module Master and processing module Notifier, wherein, center module Core is to be inspected in cluster for determining Node node simultaneously produces patrol task, and the patrol task is distributed into multiple inspection execution module Master, obtains task point The inspection execution module Master matched somebody with somebody is connected with multiple node node, it is necessary to illustrate, total in system according to being actually needed Number of nodes to be inspected be possibly less than total Master quantity, the inspection execution module that unassigned task so be present Master, these inspection execution modules Master are not connected without work with node node;Inspection execution module Master will The patrol task of distribution is sent in the node connected to it, and drives the node to perform patrol task, then obtains node Implementing result, then, implementing result is analyzed, obtains analysis result;Processing module Notifier is from inspection execution module Analysis result is obtained in Master, and analysis result is handled.
Cluster cruising inspection system is except comprising above-mentioned 3 functional modules, in addition to an information model, for coordinating above-mentioned 3 Individual functional module carries out cluster inspection.Specifically, information model includes following information:
1. node address nodes
Node address nodes includes node node to be inspected address information, such as ip information, center module pass through The node address in information model is obtained, determines node to be inspected in cluster.
2. activation bit driver
Inspection execution module Master drives the node to perform patrol task according to activation bit driver.Activation bit Driver specifically includes the title of driving, the relevant parameter of driving etc..Driving can be many kinds, can for server inspection So that using ssh driving, if the service that inspection is special, such as mysql are serviced, mysql driving can be used.According to inspection The difference of target, different activation bit driver is developed, so that user can configure and select in a template.
3. check point list checkpoints
Checking that point list checkpoints includes multiple checkpoints, each checkpoint includes module, name, The fields such as desc, args, type, info, warn, alarm.
Module is used for expression and is connected to the module that node uses, and according to the difference of task, can use different modules, Such as when needing perform script or order to perform task, then using shell modules.If needing perform script, then make With script modules.
Name is used for the name for representing the checkpoint;
Desc is used for some detailed descriptions for representing the checkpoint;
Args is used to represent parameter when using module.For example use ssh modules, then what args was represented is to use ssh The order of execution;
Type is used for the type for representing the result of return.Include integer, character string, floating number etc. type;
Info/warn/alarm is user-defined alarm conditions, has respective alarm level.The implementing result of node The working condition of egress, such as service state, resource utilization can be reflected.The working condition of node is poorer, corresponding to it Alarm level is also high, wherein, the priority of alarm level is alarm > warn > info.Inspection execution module Master passes through The alarm conditions that implementing result is matched are determined, generate the analysis result of different alarm levels.
4. filter condition filter and processing mode information type
Processing module Notifier filters out the analysis result of different alarm levels according to filter condition filter, so The analysis result of some low alarm levels can be masked, and allows user to focus more on the analysis knot of high alarm level Fruit.Processing module Notifier is handled the analysis result filtered out always according to processing mode information type, for example, will divide Analysis result recorded journal file, either by analysis result carry out short message sending or, by analysis result carry out mail transmission. For example, when the filter condition filter in information model is set as " alarm ", processing module Notifier is then according to " alarm " Filter condition, the analysis result for meeting " alarm " alarm level is screened;For another example the processing mode in information model Information type is " analysis result of alarm alarm levels is carried out into mail transmission ", and now, processing module Notifier will be screened The analysis result for meeting " alarm " alarm level gone out is sent to user by the form of mail.Processing module Notifier is also According to processing mode information, the abnormal nodes indicated in analysis result are repaired, for example, processing mode information is set as " note abnormalities node when by http calling interfaces carry out node repair automatically ".
Fig. 2 is the workflow diagram of the center module Core in the present invention, as shown in Fig. 2 center module Core accesses letter Template is ceased, node to be inspected is determined according to the node address nodes that information model stores, due to the corresponding connection of each node One inspection execution module Master, it is possible thereby to determine to participate in the inspection execution module Master of work, performed according to inspection The patrol task of the module Master corresponding number of quantity generation, and task is sent to corresponding inspection execution module Master。
Fig. 3 is the workflow diagram of the inspection execution module Master in the present invention, as shown in figure 3, inspection execution module After Master obtains the patrol task that center module Core is distributed, access information template, the driving stored according to information model Information driver is connected to node node, and driving node node performs patrol task according to module, if module is configured For shell, then shell-command is performed on node node, if configured to script, then by script transmission to node node Performed.Inspection execution module Master obtains node node implementing result, and implementing result is analyzed, and is divided Result is analysed, then, analysis result is sent to processing module Notifier.
Fig. 4 is the flow chart that inspection execution module Master carries out implementing result analysis in the present invention, as shown in figure 4, patrolling Execution module Master is examined after node node obtains implementing result, judges whether implementing result is effective, i.e., whether can normally connect It is connected to node nodes and performs, and obtain returning result, if implementing result is invalid, generation fails to obtain point of implementing result Analyse result.If implementing result is effective, the alarm conditions in information model, the analysis knots of different alarm levels is generated Fruit.As shown in figure 4, first determine whether implementing result meets alarm alarm conditions, such as according to alarm level from high to low Fruit meets, then generates the analysis result of alarm alarm levels;If not meeting alarm alarm conditions, then judge that implementing result is It is no to meet warn alarm conditions, if met, generate the analysis result of warn alarm levels;If warn alarm bars are not met Part, then judge whether implementing result meets info alarm conditions, if met, generate the analysis result of info alarm levels. If fruit does not comply with, implementing result is abandoned, and generates the analysis result of sky.
Fig. 5 is the workflow diagram of the processing module Notifier in the present invention, as shown in figure 5, processing module Notifier obtains access information template after analysis result, and different alarms are filtered out according to filter condition filter in information model The analysis result of rank.Processing module Notifier is always according to processing mode information type at the analysis result that filters out Reason, for example, analysis result recorded into journal file, either by analysis result carry out short message sending or, analysis result is entered Row mail is sent, or, the abnormal nodes indicated in analysis result are repaired by http calling interfaces.
In summary, by distributed cruising inspection system provided by the invention, the efficiency of inspection is effectively improved, moreover, Corresponding inspection content is configured by information model, asked without in node installation agent programs, avoid that upgrading every time brings Topic, cruising inspection system support ssh drivings, can also be by developing new driving.Inspection processing module support daily record, mail, short message, The modes such as http, can also be by developing new plug-in extension advice method.
Particular embodiments described above, the purpose of the present invention, technical scheme and beneficial effect are carried out further in detail Describe in detail it is bright, should be understood that the foregoing is only the present invention specific embodiment, be not intended to limit the invention, it is all Within the spirit and principles in the present invention, any modification, equivalent substitution and improvements done etc., it should be included in the guarantor of the present invention Within the scope of shield.

Claims (14)

  1. A kind of 1. cluster cruising inspection system, for checking multiple nodes in cluster, it is characterised in that system includes:
    Center module, for determining node to be inspected in cluster and producing patrol task, and the patrol task distributed to more Individual inspection execution module, wherein, the inspection execution module for obtaining task distribution is connected with least one node;
    Inspection execution module, for the patrol task of distribution to be sent in the node connected to it, and the node is driven to perform Patrol task, the implementing result of node is then obtained, then, the implementing result is analyzed, obtains analysis result;
    Processing module, handled for obtaining analysis result from the inspection execution module, and to the analysis result.
  2. 2. cluster cruising inspection system according to claim 1, it is characterised in that also including an information model, described information mould Plate includes the node address of the node, and the center module determines node to be inspected in cluster according to the node address.
  3. 3. cluster cruising inspection system according to claim 2, it is characterised in that described information template also includes the driving of node Information, the inspection execution module drive the node to perform patrol task according to the activation bit.
  4. 4. cluster cruising inspection system according to claim 2, it is characterised in that described information template also includes multiple alarm bars Part, the multiple alarm conditions have different alarm levels, and the inspection execution module is by determining that implementing result is matched Alarm conditions, generate the analysis results of different alarm levels.
  5. 5. cluster cruising inspection system according to claim 4, it is characterised in that described information template also includes filter condition, The processing module filters out the analysis result of different alarm levels according to the filter condition.
  6. 6. cluster cruising inspection system according to claim 5, it is characterised in that described information template also includes processing mode and believed Breath, the processing module is handled the analysis result according to the processing mode information, including one kind in the following manner:
    Analysis result be recorded into journal file;
    Analysis result is subjected to short message sending;
    Analysis result is subjected to mail transmission.
  7. 7. cluster cruising inspection system according to claim 1, it is characterised in that the processing module is believed always according to processing mode The abnormal nodes indicated in analysis result are repaired by breath.
  8. A kind of 8. cluster method for inspecting, applied to cluster cruising inspection system described in claim 1-6 any one, it is characterised in that Method includes:
    S1, determine node to be inspected in cluster and produce patrol task;
    S2, the patrol task of distribution is sent into node to be inspected, and drive the node to perform patrol task, then obtained The implementing result of node, then, the implementing result is analyzed, obtains analysis result;
    S3, analysis result is obtained, and the analysis result is handled.
  9. 9. cluster method for inspecting according to claim 8, it is characterised in that the step S1 includes:According to node address Node to be inspected in cluster is determined, wherein the node address is stored in an information model.
  10. 10. cluster method for inspecting according to claim 9, it is characterised in that described information template also includes the drive of node Dynamic information, in the step S2, according to the activation bit, the node is driven to perform patrol task.
  11. 11. cluster method for inspecting according to claim 9, it is characterised in that described information template also includes multiple alarms Condition, the multiple alarm conditions have different alarm levels, implementing result are analyzed in the step S2, divided Result is analysed, including:The alarm conditions matched by determining implementing result, generate the analysis result of different alarm levels.
  12. 12. cluster method for inspecting according to claim 11, it is characterised in that described information template also includes filtering rod Part, also include in the step S3:The analysis result of different alarm levels is filtered out according to the filter condition.
  13. 13. cluster method for inspecting according to claim 12, it is characterised in that described information template also includes processing mode Information, in the step S3, the analysis result is handled according to the processing mode information, including one in the following manner Kind:
    Analysis result be recorded into journal file;
    Analysis result is subjected to short message sending;
    Analysis result is subjected to mail transmission.
  14. 14. cluster method for inspecting according to claim 8, it is characterised in that the step S3, in addition to:According to processing Mode information, the abnormal nodes indicated in analysis result are repaired.
CN201610320492.1A 2016-05-16 2016-05-16 A kind of cluster cruising inspection system and method Pending CN107395379A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201610320492.1A CN107395379A (en) 2016-05-16 2016-05-16 A kind of cluster cruising inspection system and method

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201610320492.1A CN107395379A (en) 2016-05-16 2016-05-16 A kind of cluster cruising inspection system and method

Publications (1)

Publication Number Publication Date
CN107395379A true CN107395379A (en) 2017-11-24

Family

ID=60338649

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201610320492.1A Pending CN107395379A (en) 2016-05-16 2016-05-16 A kind of cluster cruising inspection system and method

Country Status (1)

Country Link
CN (1) CN107395379A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108959037A (en) * 2018-07-13 2018-12-07 山东汇贸电子口岸有限公司 A kind of data center's automatic detecting method and device
CN111722951A (en) * 2019-03-21 2020-09-29 北京京东尚科信息技术有限公司 Exception handling method and device and storage medium
CN112000539A (en) * 2020-07-17 2020-11-27 新华三大数据技术有限公司 Inspection method and device
CN112506612A (en) * 2020-12-10 2021-03-16 北京浪潮数据技术有限公司 Cluster inspection method, device and equipment and readable storage medium
CN113472577A (en) * 2021-06-30 2021-10-01 济南浪潮数据技术有限公司 Cluster inspection method, device and system
CN113507397A (en) * 2021-07-06 2021-10-15 北京容联七陌科技有限公司 Method for collecting terminal equipment state automatic inspection based on cloud operation and maintenance
CN114064289A (en) * 2021-11-24 2022-02-18 云知声智能科技股份有限公司 Cluster management system
CN117076185A (en) * 2023-10-16 2023-11-17 太平金融科技服务(上海)有限公司 Server inspection method, device, equipment and medium

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1913452A (en) * 2006-08-18 2007-02-14 亿阳信通股份有限公司 Method and equipment of intelligent patrol detection for communication network
CN101043692A (en) * 2007-04-30 2007-09-26 华为技术有限公司 Patrol checking method and patrol checking server
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system
CN103220354A (en) * 2013-04-18 2013-07-24 广东宜通世纪科技股份有限公司 Method for achieving load balancing of server cluster
CN103227839A (en) * 2013-05-10 2013-07-31 网宿科技股份有限公司 Management system for regional autonomy of content distribution network server
CN103491165A (en) * 2013-09-22 2014-01-01 复旦大学 General distributed crawler system capable of automatically detecting shielding
CN103716182A (en) * 2013-12-12 2014-04-09 中国科学院信息工程研究所 Failure detection and fault tolerance method and failure detection and fault tolerance system for real-time cloud platform

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN1913452A (en) * 2006-08-18 2007-02-14 亿阳信通股份有限公司 Method and equipment of intelligent patrol detection for communication network
CN101043692A (en) * 2007-04-30 2007-09-26 华为技术有限公司 Patrol checking method and patrol checking server
CN102075358A (en) * 2010-12-31 2011-05-25 网宿科技股份有限公司 System and method for distributing and deploying content of large-scale server cluster
CN103200050A (en) * 2013-04-12 2013-07-10 北京百度网讯科技有限公司 Server hardware state monitoring method and server hardware state monitoring system
CN103220354A (en) * 2013-04-18 2013-07-24 广东宜通世纪科技股份有限公司 Method for achieving load balancing of server cluster
CN103227839A (en) * 2013-05-10 2013-07-31 网宿科技股份有限公司 Management system for regional autonomy of content distribution network server
CN103491165A (en) * 2013-09-22 2014-01-01 复旦大学 General distributed crawler system capable of automatically detecting shielding
CN103716182A (en) * 2013-12-12 2014-04-09 中国科学院信息工程研究所 Failure detection and fault tolerance method and failure detection and fault tolerance system for real-time cloud platform

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108959037A (en) * 2018-07-13 2018-12-07 山东汇贸电子口岸有限公司 A kind of data center's automatic detecting method and device
CN111722951A (en) * 2019-03-21 2020-09-29 北京京东尚科信息技术有限公司 Exception handling method and device and storage medium
CN111722951B (en) * 2019-03-21 2023-11-03 北京京东振世信息技术有限公司 Exception handling method and device and storage medium
CN112000539A (en) * 2020-07-17 2020-11-27 新华三大数据技术有限公司 Inspection method and device
CN112506612A (en) * 2020-12-10 2021-03-16 北京浪潮数据技术有限公司 Cluster inspection method, device and equipment and readable storage medium
CN113472577A (en) * 2021-06-30 2021-10-01 济南浪潮数据技术有限公司 Cluster inspection method, device and system
CN113472577B (en) * 2021-06-30 2023-07-25 济南浪潮数据技术有限公司 Cluster inspection method, device and system
CN113507397A (en) * 2021-07-06 2021-10-15 北京容联七陌科技有限公司 Method for collecting terminal equipment state automatic inspection based on cloud operation and maintenance
CN114064289A (en) * 2021-11-24 2022-02-18 云知声智能科技股份有限公司 Cluster management system
CN117076185A (en) * 2023-10-16 2023-11-17 太平金融科技服务(上海)有限公司 Server inspection method, device, equipment and medium
CN117076185B (en) * 2023-10-16 2024-01-05 太平金融科技服务(上海)有限公司 Server inspection method, device, equipment and medium

Similar Documents

Publication Publication Date Title
CN107395379A (en) A kind of cluster cruising inspection system and method
KR20210019564A (en) Operation maintenance system and method
CN108521339B (en) Feedback type node fault processing method and system based on cluster log
CN107508722B (en) Service monitoring method and device
KR20180108446A (en) System and method for management of ict infra
CN104796273A (en) Method and device for diagnosing root of network faults
CN111181800B (en) Test data processing method and device, electronic equipment and storage medium
CN106526376A (en) Simulation test system and simulation test method
CN111143167B (en) Alarm merging method, device, equipment and storage medium for multiple platforms
CN112507623B (en) Method and system for constructing algorithm middle station
CN110471945B (en) Active data processing method, system, computer equipment and storage medium
JP2016115352A (en) System and method for monitoring production system
CN114356499A (en) Kubernetes cluster alarm root cause analysis method and device
CN105871581A (en) Method and device for processing of alarm information in cloud calculation
JP2016115351A (en) Method and production system to configure control device for production system
CN105871957A (en) Monitoring framework design method, monitoring server, proxy unit and center control server
CN108108445A (en) A kind of data intelligence processing method and system
CN113505993A (en) Allocation center management method, device, equipment and storage medium
CN106920158A (en) Order real-time monitoring system based on Storm and Kafka technologies
CN115269438A (en) Automatic testing method and device for image processing algorithm
CN105844390A (en) Method and device for tracing data quality and hardware processor
CN109643311A (en) The sequence conjunctive query method that transactional unstructured data for distributed system drives
CN102546235B (en) Performance diagnosis method and system of web-oriented application under cloud computing environment
CN109818808A (en) Method for diagnosing faults, device and electronic equipment
CN107995026B (en) Management and control method, management node, managed node and system based on middleware

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20171124

RJ01 Rejection of invention patent application after publication