Nothing Special   »   [go: up one dir, main page]

CN112383435B - Fault processing method and device - Google Patents

Fault processing method and device Download PDF

Info

Publication number
CN112383435B
CN112383435B CN202011291320.9A CN202011291320A CN112383435B CN 112383435 B CN112383435 B CN 112383435B CN 202011291320 A CN202011291320 A CN 202011291320A CN 112383435 B CN112383435 B CN 112383435B
Authority
CN
China
Prior art keywords
fault
information
cloud server
processing
determining
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Active
Application number
CN202011291320.9A
Other languages
Chinese (zh)
Other versions
CN112383435A (en
Inventor
邓练兵
李皓
宋宇轩
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Zhuhai Dahengqin Technology Development Co Ltd
Original Assignee
Zhuhai Dahengqin Technology Development Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Zhuhai Dahengqin Technology Development Co Ltd filed Critical Zhuhai Dahengqin Technology Development Co Ltd
Priority to CN202011291320.9A priority Critical patent/CN112383435B/en
Publication of CN112383435A publication Critical patent/CN112383435A/en
Application granted granted Critical
Publication of CN112383435B publication Critical patent/CN112383435B/en
Active legal-status Critical Current
Anticipated expiration legal-status Critical

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L41/00Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
    • H04L41/06Management of faults, events, alarms or notifications
    • H04L41/0631Management of faults, events, alarms or notifications using root cause analysis; using analysis of correlation between notifications, alarms or events based on decision criteria, e.g. hierarchy, tree or time analysis
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/10Protocols in which an application is distributed across nodes in the network
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04LTRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
    • H04L67/00Network arrangements or protocols for supporting network services or applications
    • H04L67/01Protocols
    • H04L67/12Protocols specially adapted for proprietary or special-purpose networking environments, e.g. medical networks, sensor networks, networks in vehicles or remote metering networks

Landscapes

  • Engineering & Computer Science (AREA)
  • Computer Networks & Wireless Communication (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Computing Systems (AREA)
  • General Health & Medical Sciences (AREA)
  • Medical Informatics (AREA)
  • Debugging And Monitoring (AREA)
  • Computer And Data Communications (AREA)

Abstract

The embodiment of the invention provides a method and a device for processing faults, wherein the method comprises the following steps: determining a failed cloud server from the plurality of cloud servers; acquiring processing task information corresponding to the fault cloud server, and determining priority information corresponding to the fault cloud server according to the processing task information; acquiring server state information corresponding to the fault cloud server, and generating fault result information according to the server state information; determining a fault processing mode according to the priority information and the fault result information; and according to the fault processing mode, carrying out fault processing on the fault cloud server. According to the embodiment of the invention, the fault of the cloud server is processed in time; by determining the fault processing mode according to the priority information and the fault result information, the most appropriate fault processing mode can be determined, and the fault processing efficiency is improved.

Description

Fault processing method and device
Technical Field
The present invention relates to the field of data processing technologies, and in particular, to a method and an apparatus for processing a fault.
Background
At present, while the development and application of big data are rapidly developed, there are many problems, such as: insufficient data opening and sharing, not wide application field, insufficient development and utilization of data resources, serious disorder abuse phenomenon and the like.
In the smart city construction, big data plays an important role, and according to the experience of smart city construction in the past, due to the lack of a unified development management platform, each city application can only be independently constructed, so that the problems of data barriers and application barriers exist among all the city applications, a large amount of information islands are formed, and the function and value of the big data cannot be played. Therefore, a unified cloud platform which can be popularized and used is urgently needed to be explored for breaking the stripe division among urban applications, eliminating the information gap and realizing the quality fusion of big data.
In the process of constructing the unified cloud platform, a regional Internet of things platform is established for managing equipment and equipment data, in the regional Internet of things platform, a cloud service needs to process a large number of tasks, faults are easy to occur, when a cloud server fails, if the faults are not processed in time or the fault processing efficiency is low, the tasks being processed by the cloud server can be shelved, and therefore service requests of users cannot be responded in time.
Disclosure of Invention
In view of the above, it is proposed to provide a method and apparatus for fault handling that overcomes or at least partially solves the above mentioned problems, comprising:
a fault handling method is applied to a regional Internet of things platform, wherein a plurality of cloud servers are deployed on the regional Internet of things platform, and the method comprises the following steps:
determining a failed cloud server from the plurality of cloud servers;
acquiring processing task information corresponding to the fault cloud server, and determining priority information corresponding to the fault cloud server according to the processing task information;
acquiring server state information corresponding to the fault cloud server, and generating fault result information according to the server state information;
determining a fault processing mode according to the priority information and the fault result information;
and according to the fault processing mode, carrying out fault processing on the fault cloud server.
Optionally, the determining, according to the processing task information, priority information corresponding to the cloud server with the fault includes:
determining task type information and task quantity information from the processing task information;
and determining priority information corresponding to the fault cloud server according to the task type information and the task quantity information.
Optionally, the number of the cloud servers with the fault is at least two, the obtaining of the server state information corresponding to the cloud server with the fault, and the generating of the fault result information according to the server state information include:
and sequentially acquiring server state information corresponding to the fault cloud server according to the priority information, and generating fault result information according to the server state information.
Optionally, determining a fault processing mode according to the priority information and the fault result information, including;
determining a fault level according to the priority information and the fault result information;
and determining a fault processing mode corresponding to the fault level.
Optionally, the performing fault processing on the fault cloud server according to the fault processing manner includes:
and calling a preset fault processing assembly to process the fault of the fault cloud server.
Optionally, the performing fault processing on the fault cloud server according to the fault processing manner includes:
and sending the fault result to a maintenance terminal, calling a preset fault processing assembly, and carrying out fault processing on the fault cloud server.
Optionally, the performing fault processing on the fault cloud server according to the fault processing manner includes:
generating reminding information according to the fault result information;
and sending the fault result to a maintenance terminal, and sending the reminding information to a maintenance worker corresponding to the maintenance terminal.
The utility model provides a device of fault handling, is applied to regional thing networking platform, regional thing networking platform deploys there are a plurality of cloud ware, the device includes:
a failure cloud server determination module, configured to determine a failure cloud server from the plurality of cloud servers;
the priority information determining module is used for acquiring processing task information corresponding to the fault cloud server and determining the priority information corresponding to the fault cloud server according to the processing task information;
the fault result information generating module is used for acquiring server state information corresponding to the fault cloud server and generating fault result information according to the server state information;
the fault processing mode determining module is used for determining a fault processing mode according to the priority information and the fault result information;
and the fault processing module is used for carrying out fault processing on the fault cloud server according to the fault processing mode.
A server comprising a processor, a memory and a computer program stored on the memory and capable of running on the processor, the computer program when executed by the processor implementing the method of fault handling as described above.
A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out the method of fault handling as described above.
The embodiment of the invention has the following advantages:
according to the embodiment of the invention, the fault cloud server is determined from the plurality of cloud servers, the processing task information corresponding to the fault cloud server is obtained, the priority information corresponding to the fault cloud server is determined according to the processing task information, the server state information corresponding to the fault cloud server is obtained, the fault result information is generated according to the server state information, the fault processing mode is determined according to the priority information and the fault result information, the fault cloud server is subjected to fault processing according to the fault processing mode, the fault of the cloud server is processed in time, the most appropriate fault processing mode can be determined by determining the fault processing mode according to the priority information and the fault result information, and the fault processing efficiency is improved.
Drawings
In order to more clearly illustrate the technical solution of the present invention, the drawings needed to be used in the description of the present invention will be briefly introduced below, and it is obvious that the drawings in the following description are only some embodiments of the present invention, and it is obvious for those skilled in the art to obtain other drawings based on these drawings without inventive exercise.
Fig. 1 is an overall architecture diagram of a cloud platform according to an embodiment of the present invention;
FIG. 2 is a flow chart illustrating steps of a method for fault handling according to an embodiment of the present invention;
FIG. 3 is a flow chart of steps in another method for fault handling provided by an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a fault handling apparatus according to an embodiment of the present invention.
Detailed Description
In order to make the aforementioned objects, features and advantages of the present invention comprehensible, embodiments accompanied with figures are described in further detail below. It is to be understood that the embodiments described are only a few embodiments of the present invention, and not all embodiments. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
In the construction of the smart city, a cross-domain multidimensional big data public service cloud platform with unified standards, unified entries, unified acquisition, unified management, unified service and unified data is built, an urban-level unified data standard is built, a data barrier is broken, the Internet of Things (IOT) and system data resources of a region are converged, all service systems of the smart city are borne, and the smart city ecology is created through data open sharing, platform capability opening and the smart city ecology creation.
The construction target of the cross-domain multi-dimensional big data public service cloud platform is that various main bodies, all levels of business coordination mechanisms and intelligent application in various fields of a smart city are built by introducing advanced technologies such as cloud computing, big data, Internet of things, mobile interconnection and the like to form an open, interconnected and intelligent smart city ecological system, so that data sharing in various fields of city management, social civilian life, resource environment and economic industry is promoted, administrative efficiency, city management capability and resident life quality are improved, industry fusion development is promoted, industry transformation and upgrading are promoted, business modes are innovated, and popularization and application of the cross-domain multi-dimensional big data public service cloud platform are realized.
A cross-domain multi-dimensional big data public service cloud platform mainly relates to leading-edge IT information technologies such as cloud computing, big data, Internet of things and artificial intelligence:
1. cloud computing technology: the cloud computing mainly comprises six core components including elastic computing, a network, storage, a database, safety and middleware, and provides elastic, quick, stable and safe resources and computing power services.
2. Big data technology: the data construction and management are taken as the core, and the capabilities of data communication, data integration, data management, data sharing and the like are provided through related components such as data calculation, data development, data analysis, data visualization and the like.
3. The technology of the Internet of things comprises the following steps: the Internet of things platform provides one-stop services such as equipment access, equipment management, monitoring operation and maintenance, safety guarantee and the like, can provide basic capability support of the Internet of things as an important component of a space-time Internet of things engine, and meets the requirement of intelligent management of a novel smart city in the future.
4. Artificial intelligence technology: an AI algorithm development platform is taken as a core, and a series of intelligent services are provided through related components such as a visual AI, text voice recognition, a Natural Language Processing (NLP) platform, a map service and the like.
As shown in fig. 1, an internet engine, a space-time internet of things engine, a cross-domain multi-dimensional big data engine, a regional internet of things sensing system, a fusion service sharing center, a fusion data creation center, a regional application portal, a secure operation and maintenance system, an open operation system, and other structures are deployed in a cloud platform.
The following describes the details of the cloud platform:
space-time internet of things engine
The space-time internet of things engine is composed of a Geographic Information System (GIS), a Building Information Model (BIM) and a regional internet of things platform, and is used for applying space data and a three-dimensional model to regional internet of things.
The geographic information system is a special and very important spatial information system, and can collect, store, manage, calculate, analyze, display and describe relevant geographic distribution data in the whole or part of space under the support of a computer hardware and software system.
The building information model is based on a three-dimensional digital technology, integrates engineering data models of various related information of a building engineering project, and the built model is in continuous deepening and changing along with the progress of the project.
The regional Internet of things platform provides the basic capability support of the Internet of things, and can meet the requirement of future urban management. The regional Internet of things platform comprises a sensing equipment management center, a sensing data open center and a regional event operation center, regional events are constructed by using a large amount of equipment data through the management of the equipment and the equipment data, and valuable data information is provided for users through the analysis and the processing of the regional events.
(II) Internet Engine
Cloud efficient (DevOps) and distributed middleware are deployed in the Internet engine and used for achieving efficient resource sharing and efficient function sharing of data.
Wherein, DevOps is a combination word of Development and Operations, which is a collective name of a group of processes, methods and systems, and is used for promoting Development of application programs/software engineering, communication, cooperation and integration between technical operation and quality assurance departments.
The distributed middleware is a kind of software between the application system and the system software, and links each part of the application system or different applications on the network by using the basic service or function provided by the system software, thereby achieving the purpose of resource sharing and function sharing.
(III) Cross-domain multidimensional big data engine
The cross-domain multi-dimensional big data engine is provided with a unified data management platform and a big data engine and used for realizing the unified management of cross-domain data.
(IV) regional Internet of things sensing system
The regional Internet of things sensing system is composed of sensing equipment such as pressure sensing equipment, humidity sensing equipment, camera sensing equipment, light source sensing equipment, infrared sensing equipment and temperature sensing equipment.
(V) converged service sharing center and converged data innovation center
A converged service sharing center and a converged data innovation center are deployed in the open service gateway. The fusion service sharing center may create different data sharing centers after fusing the data of each region according to service classification, for example: the system comprises a personal information center, a credit information center, a legal information center, a financial service center, a travel service center, a comprehensive treatment service center, a space-time service center and other sharing centers, and an Internet of things service center.
The fusion data innovation center realizes the innovative application of fusion data through a data fusion system and an AI algorithm system, wherein the AI algorithm system consists of a large-scale visual computation platform, a full-time global traffic perception engine and a progressive video search engine.
The fusion service sharing center and the fusion data creation center fuse the data and then can present the processed data through the area application portal.
(VI) regional application Portal
In the regional application portal, the system is mainly divided into blocks such as ecological environment protection, global tourism, property cities, enterprise intelligent services, electronic fences, intelligent communities, international talent islands, regional economic brains, cross-border e-commerce, cross-domain authentication, electronic fences and the like. When a user enters each plate through the regional application portal, information corresponding to each plate formed by the processed data can be acquired.
(VII) safety operation and maintenance system
The safe operation and maintenance system comprises a network, a safe guarantee, a multi-cloud management, a regional cloud unified management, a platform interface and the like, and is used for guaranteeing the safe operation of the whole cloud platform.
(eighth) open operation system
The open operation system comprises a uniform entrance, an ability open, an operation platform and the like, and is used for establishing a uniform entrance of data and accessing the data of each area.
(nine) other structures
The data are processed through an ultra-computation cluster, a regional cloud computing platform and an openstack firmware cluster (an open-source cloud computing management platform project is a combination of a series of software open-source projects).
Referring to fig. 2, a flowchart of steps of a fault handling method provided in an embodiment of the present invention is shown, and is applied to a regional internet of things platform, where the regional internet of things platform is deployed with a plurality of cloud servers, where the method specifically includes the following steps:
step 201, determining a fault cloud server from the plurality of cloud servers;
in the regional Internet of things platform, a plurality of cloud servers are deployed, and in the running process of the cloud servers, the cloud servers can be monitored in real time, so that once the cloud servers break down, the broken-down cloud servers can be determined from the cloud servers, and accordingly nodes where the faults occur are determined.
Step 202, acquiring processing task information corresponding to the fault cloud server, and determining priority information corresponding to the fault cloud server according to the processing task information;
after the cloud server with the fault is determined, processing task information corresponding to the cloud server with the fault can be obtained, and priority information corresponding to the cloud server with the fault can be determined according to the processing task information. In general, the more important the tasks to be processed and the larger the number of tasks, the more the influence of the cloud server with a failure is, and a high priority can be set for the cloud server with a failure having a wide influence.
Step 203, acquiring server state information corresponding to the fault cloud server, and generating fault result information according to the server state information;
after the priority information is determined, server state information corresponding to the fault cloud server can be obtained, and fault result information is generated according to the server state information. The failure result information may preliminarily determine whether the failure of the cloud server is easily resolved.
Step 204, determining a fault processing mode according to the priority information and the fault result information;
in practical application, the failure processing mode can be determined according to priority, and the higher the priority is, the more complicated the failure processing procedure is. However, the priority is only related to the processing task of the cloud service, and the situation of the fault itself is not considered, and for some cloud servers with high priority, if some simple faults occur in the cloud servers, the processing mode is determined according to the priority, so that the simple faults also need to be processed by a complex fault processing flow, and the fault processing efficiency is reduced.
Therefore, after the fault result information is generated, the priority information and the fault result information are comprehensively considered, and the fault processing mode is determined, so that a complex processing program can be performed on simple faults, and the fault processing efficiency is improved.
In an embodiment of the present invention, the determining a fault handling manner according to the priority information and the fault result information includes;
determining a fault level according to the priority information and the fault result information; and determining a fault processing mode corresponding to the fault level.
In practical application, after the fault result information is generated, the fault level can be determined according to the priority information and the fault result information, and different fault levels correspond to different fault processing modes. Because the two parameters of the priority information and the fault result information have large changes, if a corresponding fault processing mode is set for each group, a fault processing program becomes complicated, and actual operation is inconvenient.
Step 205, performing fault processing on the fault cloud server according to the fault processing mode.
And after the fault processing mode is determined, carrying out fault processing on the fault cloud server according to the fault processing mode.
In an embodiment of the present invention, the performing fault processing on the fault cloud server according to the fault processing manner includes:
and calling a preset fault processing assembly to process the fault of the fault cloud server.
When the fault cloud server is low in priority and is a simple fault, in the regional internet of things platform, the processing process of the simple fault and some fault processing processes which have been processed successfully before can be stored in the preset fault processing assembly, and when the same or similar fault occurs again, the preset fault processing assembly can be directly called to perform fault processing on the fault cloud server.
In an embodiment of the present invention, the performing fault processing on the fault cloud server according to the fault processing manner includes:
and sending the fault result to a maintenance terminal, calling a preset fault processing assembly, and carrying out fault processing on the fault cloud server.
When the fault cloud server is high in priority and is a simple fault, the preset fault processing component may solve the fault, but due to the high priority, under the condition that the preset fault processing component may not be capable of successfully processing, on one hand, a fault result can be sent to the maintenance terminal, and meanwhile, the preset fault processing component is called to perform fault processing on the fault cloud server.
In an embodiment of the present invention, the performing fault processing on the fault cloud server according to the fault processing manner includes:
generating reminding information according to the fault result information; and sending the fault result to a maintenance terminal, and sending the reminding information to a maintenance worker corresponding to the maintenance terminal.
When the fault cloud server is high in priority and complex faults exist, the preset fault processing assembly cannot process the complex faults and needs manual intervention processing, in order to timely inform maintenance personnel of processing the faults, reminding information can be generated according to fault result information, on one hand, the fault result can be sent to a maintenance terminal so as to remind the maintenance personnel working at the terminal to process the faults in time, and on the other hand, the reminding information can be sent to the maintenance terminal corresponding to the maintenance personnel at the same time so as to prevent the maintenance personnel from missing the reminding when the maintenance personnel are not on duty.
In the embodiment of the invention, the fault cloud server is determined from the plurality of cloud servers, the processing task information corresponding to the fault cloud server is obtained, the priority information corresponding to the fault cloud server is determined according to the processing task information, the server state information corresponding to the fault cloud server is obtained, the fault result information is generated according to the server state information, the fault processing mode is determined according to the priority information and the fault result information, the fault cloud server is subjected to fault processing according to the fault processing mode, the fault of the cloud server is processed in time, the most appropriate fault processing mode can be determined by determining the fault processing mode according to the priority information and the fault result information, and the fault processing efficiency is improved.
Referring to fig. 3, a flowchart illustrating steps of another fault handling method according to an embodiment of the present invention is shown, which may specifically include the following steps:
step 301, determining a fault cloud server from the plurality of cloud servers;
step 302, acquiring processing task information corresponding to the fault cloud server;
step 303, determining task type information and task quantity information from the processing task information;
after the processing task information is acquired, task type information and task quantity information may be further determined.
In practical application, the task type information can be divided into a high-instantaneity task, a medium-instantaneity task and a non-instantaneity task according to the real-time requirement of the task.
For example: the high-instantaneity task has high requirement on instantaneity and can be a user login task, an equipment connection task, a data storage task, a data calculation task, an event notification task and the like.
The medium real-time task has a requirement on real-time property which is lower than that of the high real-time task, and can be a registration task, a user management task, an account management task, an authority task and the like.
The non-real-time tasks have no requirement on real-time performance, and are mainly management and configuration tasks, such as installation tasks, development tasks, debugging tasks, deployment tasks, resource configuration tasks, setting tasks and the like.
Step 304, determining priority information corresponding to the fault cloud server according to the task type information and the task quantity information;
after the task type information and the task quantity information are determined, priority information corresponding to the fault cloud server can be determined according to the task type information and the task quantity information.
In practical application, the higher the real-time requirement of the tasks is, the higher the importance degree of the tasks is, the more the number of the tasks is, and the higher the corresponding priority of the fault cloud server is.
305, acquiring server state information corresponding to the fault cloud server, and generating fault result information according to the server state information;
in an embodiment of the present invention, the number of the cloud servers with the fault is at least two, the obtaining of the server state information corresponding to the cloud server with the fault, and the generating of the fault result information according to the server state information include:
and sequentially acquiring server state information corresponding to the fault cloud server according to the priority information, and generating fault result information according to the server state information.
When more than 2 fault cloud servers exist, the server state information can be sequentially acquired according to the priority information of the fault cloud servers, and fault result information is generated according to the server state information.
For example, in the priority information of the cloud server a and the cloud server B, the priority of the cloud server a is located before the cloud server B, the server state information of the cloud server a may be obtained first, the fault result information corresponding to the server B may be generated, the fault processing may be performed on the cloud server B, and then the server state information of the cloud server B may be obtained, and the fault result information corresponding to the server B may be generated.
Or simultaneously acquiring server information of the fault cloud server A and the fault cloud server B, firstly generating fault result information corresponding to A, performing fault processing on the fault cloud server A, then generating fault result information corresponding to B, and performing fault processing on the fault cloud server B.
By performing fault processing on the plurality of fault cloud servers according to the priority, the task condition of the fault cloud server is fully considered, and the fault cloud servers containing important tasks or multiple associated tasks are preferentially subjected to fault processing.
Step 306, determining a fault processing mode according to the priority information and the fault result information;
step 307, performing fault processing on the fault cloud server according to the fault processing mode.
In the embodiment of the invention, the fault cloud server is determined from the plurality of cloud servers, the processing task information corresponding to the fault cloud server is obtained, the priority information corresponding to the fault cloud server is determined according to the processing task information, the server state information corresponding to the fault cloud server is obtained, the fault result information is generated according to the server state information, the fault processing mode is determined according to the priority information and the fault result information, the fault cloud server is subjected to fault processing according to the fault processing mode, the fault of the cloud server is processed in time, the most appropriate fault processing mode can be determined by determining the fault processing mode according to the priority information and the fault result information, and the fault processing efficiency is improved.
It should be noted that, for simplicity of description, the method embodiments are described as a series of acts or combination of acts, but those skilled in the art will recognize that the present invention is not limited by the illustrated order of acts, as some steps may occur in other orders or concurrently in accordance with the embodiments of the present invention. Further, those skilled in the art will appreciate that the embodiments described in the specification are presently preferred and that no particular act is required to implement the invention.
Referring to fig. 4, a schematic structural diagram of a fault handling apparatus according to an embodiment of the present invention is shown, which may specifically include the following modules:
a failure cloud server determining module 401, configured to determine a failure cloud server from the plurality of cloud servers;
a priority information determining module 402, configured to obtain processing task information corresponding to the faulty cloud server, and determine priority information corresponding to the faulty cloud server according to the processing task information;
in an embodiment of the present invention, the priority information determining module 402 may include:
the first determining submodule is used for determining task type information and task quantity information from the processing task information;
and the second determining submodule is used for determining priority information corresponding to the fault cloud server according to the task type information and the task quantity information.
A failure result information generating module 403, configured to obtain server state information corresponding to the failed cloud server, and generate failure result information according to the server state information;
in an embodiment of the present invention, the fault result information generating module 403 may include:
and the fault result information generating submodule is used for sequentially acquiring the server state information corresponding to the fault cloud server according to the priority information and generating the fault result information according to the server state information.
A failure processing mode determining module 404, configured to determine a failure processing mode according to the priority information and the failure result information;
in an embodiment of the present invention, the failure handling manner determining module 404 may include:
the fault grade determining submodule is used for determining the fault grade according to the priority information and the fault result information;
and the fault processing mode determining submodule is used for determining the fault processing mode corresponding to the fault level.
And a fault processing module 405, configured to perform fault processing on the faulty cloud server according to the fault processing manner.
In an embodiment of the present invention, the fault handling module 405 may include:
and the first fault processing submodule is used for calling a preset fault processing assembly and carrying out fault processing on the fault cloud server.
In an embodiment of the present invention, the fault handling module 405 may include:
and the second fault processing submodule is used for sending the fault result to the maintenance terminal, calling a preset fault processing assembly and carrying out fault processing on the fault cloud server.
In an embodiment of the present invention, the fault handling module 405 may include:
the reminding information generating submodule is used for generating reminding information according to the fault result information;
and the sending submodule is used for sending the fault result to a maintenance terminal and sending the reminding information to a maintenance worker corresponding to the maintenance terminal.
In the embodiment of the invention, the fault cloud server is determined from the plurality of cloud servers, the processing task information corresponding to the fault cloud server is obtained, the priority information corresponding to the fault cloud server is determined according to the processing task information, the server state information corresponding to the fault cloud server is obtained, the fault result information is generated according to the server state information, the fault processing mode is determined according to the priority information and the fault result information, the fault cloud server is subjected to fault processing according to the fault processing mode, the fault of the cloud server is processed in time, the most appropriate fault processing mode can be determined by determining the fault processing mode according to the priority information and the fault result information, and the fault processing efficiency is improved.
An embodiment of the present invention also provides a server, which may include a processor, a memory, and a computer program stored on the memory and capable of running on the processor, and when executed by the processor, the computer program implements the method for fault handling as described above.
An embodiment of the present invention further provides a computer-readable storage medium on which a computer program is stored, the computer program implementing the above fault handling method when executed by a processor.
For the device embodiment, since it is basically similar to the method embodiment, the description is simple, and for the relevant points, refer to the partial description of the method embodiment.
The embodiments in the present specification are described in a progressive manner, each embodiment focuses on differences from other embodiments, and the same and similar parts among the embodiments are referred to each other.
As will be appreciated by one skilled in the art, embodiments of the present invention may be provided as a method, apparatus, or computer program product. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product embodied on one or more computer-usable storage media (including, but not limited to, disk storage, CD-ROM, optical storage, and the like) having computer-usable program code embodied therein.
Embodiments of the present invention are described with reference to flowchart illustrations and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the invention. It will be understood that each flow and/or block of the flow diagrams and/or block diagrams, and combinations of flows and/or blocks in the flow diagrams and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to a processor of a general purpose computer, special purpose computer, embedded processor, or other programmable data processing terminal to produce a machine, such that the instructions, which execute via the processor of the computer or other programmable data processing terminal, create means for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be stored in a computer-readable memory that can direct a computer or other programmable data processing terminal to function in a particular manner, such that the instructions stored in the computer-readable memory produce an article of manufacture including instruction means which implement the function specified in the flowchart flow or flows and/or block diagram block or blocks.
These computer program instructions may also be loaded onto a computer or other programmable data processing terminal to cause a series of operational steps to be performed on the computer or other programmable terminal to produce a computer implemented process such that the instructions which execute on the computer or other programmable terminal provide steps for implementing the functions specified in the flowchart flow or flows and/or block diagram block or blocks.
While preferred embodiments of the present invention have been described, additional variations and modifications of these embodiments may occur to those skilled in the art once they learn of the basic inventive concepts. Therefore, it is intended that the appended claims be interpreted as including preferred embodiments and all such alterations and modifications as fall within the scope of the embodiments of the invention.
Finally, it should also be noted that, herein, relational terms such as first and second, and the like may be used solely to distinguish one entity or action from another entity or action without necessarily requiring or implying any actual such relationship or order between such entities or actions. Also, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or terminal that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or terminal. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other like elements in a process, method, article, or terminal that comprises the element.
The method and the device for fault handling provided above are introduced in detail, and a specific example is applied in this document to illustrate the principle and the implementation of the present invention, and the above description of the embodiment is only used to help understanding the method and the core idea of the present invention; meanwhile, for a person skilled in the art, according to the idea of the present invention, there may be variations in the specific embodiments and the application scope, and in summary, the content of the present specification should not be construed as a limitation to the present invention.

Claims (10)

1. A fault handling method is applied to a regional Internet of things platform, wherein the regional Internet of things platform is provided with a plurality of cloud servers, and the method comprises the following steps:
determining a failed cloud server from the plurality of cloud servers;
acquiring processing task information corresponding to the fault cloud server, and determining priority information corresponding to the fault cloud server according to the processing task information;
acquiring server state information corresponding to the fault cloud server, and generating fault result information according to the server state information;
determining a fault processing mode according to the priority information and the fault result information;
and according to the fault processing mode, carrying out fault processing on the fault cloud server.
2. The method according to claim 1, wherein the determining priority information corresponding to the failed cloud server according to the processing task information includes:
determining task type information and task quantity information from the processing task information;
and determining priority information corresponding to the fault cloud server according to the task type information and the task quantity information.
3. The method according to claim 1 or 2, wherein the number of the faulty cloud servers is at least two, and the obtaining of the server state information corresponding to the faulty cloud server and the generating of the fault result information according to the server state information includes:
and sequentially acquiring server state information corresponding to the fault cloud server according to the priority information, and generating fault result information according to the server state information.
4. The method according to claim 1 or 2, wherein the determining a fault handling manner according to the priority information and the fault result information comprises;
determining a fault level according to the priority information and the fault result information;
and determining a fault processing mode corresponding to the fault level.
5. The method according to claim 1, wherein the performing fault processing on the faulty cloud server according to the fault processing manner includes:
and calling a preset fault processing assembly to process the fault of the fault cloud server.
6. The method according to claim 1, wherein the performing fault processing on the faulty cloud server according to the fault processing manner includes:
and sending the fault result to a maintenance terminal, calling a preset fault processing assembly, and carrying out fault processing on the fault cloud server.
7. The method according to claim 1, wherein the performing fault processing on the faulty cloud server according to the fault processing manner includes:
generating reminding information according to the fault result information;
and sending the fault result to a maintenance terminal, and sending the reminding information to a maintenance worker corresponding to the maintenance terminal.
8. The utility model provides a device of fault handling, is applied to regional thing networking platform, regional thing networking platform deploys has a plurality of cloud ware, the device includes:
a failure cloud server determination module, configured to determine a failure cloud server from the plurality of cloud servers;
the priority information determining module is used for acquiring processing task information corresponding to the fault cloud server and determining the priority information corresponding to the fault cloud server according to the processing task information;
the fault result information generating module is used for acquiring server state information corresponding to the fault cloud server and generating fault result information according to the server state information;
the fault processing mode determining module is used for determining a fault processing mode according to the priority information and the fault result information;
and the fault processing module is used for carrying out fault processing on the fault cloud server according to the fault processing mode.
9. A server comprising a processor, a memory, and a computer program stored on the memory and capable of running on the processor, the computer program, when executed by the processor, implementing a method of fault handling as claimed in any one of claims 1 to 7.
10. A computer-readable storage medium, on which a computer program is stored which, when being executed by a processor, carries out a method of fault handling according to any one of claims 1 to 7.
CN202011291320.9A 2020-11-17 2020-11-17 Fault processing method and device Active CN112383435B (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202011291320.9A CN112383435B (en) 2020-11-17 2020-11-17 Fault processing method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202011291320.9A CN112383435B (en) 2020-11-17 2020-11-17 Fault processing method and device

Publications (2)

Publication Number Publication Date
CN112383435A CN112383435A (en) 2021-02-19
CN112383435B true CN112383435B (en) 2022-03-29

Family

ID=74585724

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202011291320.9A Active CN112383435B (en) 2020-11-17 2020-11-17 Fault processing method and device

Country Status (1)

Country Link
CN (1) CN112383435B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN113472583B (en) * 2021-07-06 2023-04-28 北京奇艺世纪科技有限公司 Fault cause identification method, device and system
CN114237886A (en) * 2021-12-15 2022-03-25 平安证券股份有限公司 Task processing method and device, computer equipment and storage medium

Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017005117A1 (en) * 2015-07-09 2017-01-12 阿里巴巴集团控股有限公司 Cloud platform-based fault handling method and device

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20070233865A1 (en) * 2006-03-30 2007-10-04 Garbow Zachary A Dynamically Adjusting Operating Level of Server Processing Responsive to Detection of Failure at a Server
CN101227315B (en) * 2007-01-17 2011-04-20 上海市医疗保险信息中心 Dynamic state server colony and control method thereof
CN102413186B (en) * 2011-12-02 2014-07-30 北京星网锐捷网络技术有限公司 Resource scheduling method and device based on private cloud computing, and cloud management server
US20140093231A1 (en) * 2012-10-02 2014-04-03 Kenneth Martin Fisher Procedure, apparatus, system, and computer program for network recovery
CN103200050B (en) * 2013-04-12 2016-12-28 北京百度网讯科技有限公司 The hardware state monitoring method and system of server

Patent Citations (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
WO2017005117A1 (en) * 2015-07-09 2017-01-12 阿里巴巴集团控股有限公司 Cloud platform-based fault handling method and device

Also Published As

Publication number Publication date
CN112383435A (en) 2021-02-19

Similar Documents

Publication Publication Date Title
CN110335187B (en) Smart city operating system
CN112291367B (en) Data processing method and device
CN108039959B (en) Data situation perception method, system and related device
US20200042657A1 (en) Multi-dimensional event model generation
CN112383435B (en) Fault processing method and device
CN112398859B (en) Security control method, device, server and storage medium based on regional Internet of things platform
CN112668841A (en) Comprehensive traffic monitoring system and method based on data fusion
CN112382122B (en) Traffic information processing method and device
CN109636307B (en) River chang APP system
CN110913032A (en) Method and platform for realizing DNS (Domain name System) domain name request analysis by using power grid threat information
CN112332981B (en) Data processing method and device
CN112291264B (en) Security control method, device, server and storage medium
CN112291266B (en) Data processing method, device, server and storage medium
CN108777637A (en) A kind of data center's total management system and method for supporting server isomery
CN112383631A (en) Regional Internet of things platform and data processing method based on regional Internet of things platform
CN112182625A (en) Data sharing system for smart city
CN113706101A (en) Power grid project management intelligent system architecture and method
CN112035333A (en) Energy internet service system, energy internet application system, and electronic device
CN112258371A (en) Fault processing method and device
CN112258372A (en) Data processing method and device
CN117112702A (en) Service rapid processing system for long and large bridge tunneling scene
CN116644934A (en) Big data management system of hydrogen energy industry
CN112333199B (en) Data processing method and device
CN112468696A (en) Data processing method and device
CN112258373B (en) Data processing method and device

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
GR01 Patent grant
GR01 Patent grant