CN109391524B - Fault positioning method and device - Google Patents
Fault positioning method and device Download PDFInfo
- Publication number
- CN109391524B CN109391524B CN201811184582.8A CN201811184582A CN109391524B CN 109391524 B CN109391524 B CN 109391524B CN 201811184582 A CN201811184582 A CN 201811184582A CN 109391524 B CN109391524 B CN 109391524B
- Authority
- CN
- China
- Prior art keywords
- service program
- information
- service
- unit
- call
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Images
Classifications
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/50—Testing arrangements
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L41/00—Arrangements for maintenance, administration or management of data switching networks, e.g. of packet switching networks
- H04L41/06—Management of faults, events, alarms or notifications
- H04L41/0677—Localisation of faults
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04L—TRANSMISSION OF DIGITAL INFORMATION, e.g. TELEGRAPHIC COMMUNICATION
- H04L43/00—Arrangements for monitoring or testing data switching networks
- H04L43/08—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters
- H04L43/0805—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability
- H04L43/0817—Monitoring or testing based on specific metrics, e.g. QoS, energy consumption or environmental parameters by checking availability by checking functioning
Landscapes
- Engineering & Computer Science (AREA)
- Computer Networks & Wireless Communication (AREA)
- Signal Processing (AREA)
- Environmental & Geological Engineering (AREA)
- Mobile Radio Communication Systems (AREA)
Abstract
The embodiment of the invention provides a fault positioning method and a fault positioning device, which are applied to a radio management integrated platform, wherein the method comprises the following steps: for each service program, obtaining a characteristic parameter reflecting the working performance of a unit of a target type according to the request information, the calling information and the execution information of the service program, comparing the characteristic parameter with a preset value, and determining whether the service program is an abnormal service program; for each abnormal service program, determining a unit for executing the abnormal service program; and searching the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determining the searched same unit belonging to the target type as a fault unit. Therefore, the function test can be carried out without establishing connection with the radio equipment by means of special detection equipment, the hardware cost is saved and the accuracy of fault location is improved on the basis of not influencing the daily service of the radio equipment.
Description
Technical Field
The invention relates to the field of radio, in particular to a fault positioning method and device.
Background
With the rapid increase of radio traffic, the daily use frequency of radio equipment increases, and the probability of radio equipment failure becomes high, such as frequent automatic shutdown, component loss, inaccurate parameters, and the like. The maintenance of the radio equipment is enhanced, the utilization rate of the radio equipment can be improved, the service life of the radio equipment is prolonged, and the working efficiency is improved.
Fault location as an important means of maintaining a radio device, a radio device is often functionally tested by establishing a connection with the radio device using special test equipment, during which the daily service of the radio device needs to be stopped.
Disclosure of Invention
In order to at least partially overcome the above-mentioned deficiencies in the prior art, embodiments of the present invention provide a fault location method and apparatus.
In order to achieve the above purpose, the embodiment of the present invention proposes the following technical solutions:
in a first aspect, an embodiment of the present invention provides a fault location method, which is applied to a radio management integration platform communicatively connected to a radio device, where the radio device includes multiple service programs and multiple types of units, and each service program is executed by at least one type of unit; the radio management integrated platform is used for responding to a request of a user terminal to a service program, calling the service program in the radio equipment, and recording request information, calling information and execution information of the service program; the method comprises the following steps:
for each service program, obtaining characteristic parameters reflecting the working performance of a unit of a target type according to the request information, the calling information and the execution information of the service program recorded by the radio management integrated platform, comparing the characteristic parameters of the service program with preset values, and determining whether the service program is an abnormal service program according to the comparison result;
for each determined exception service routine, determining a unit for executing the exception service routine;
and searching the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determining the searched same unit belonging to the target type as a fault unit.
In a second aspect, an embodiment of the present invention further provides a fault location apparatus, which is applied to a radio management integration platform in communication connection with a radio device, where the radio device includes multiple service programs and multiple types of units, and each service program is executed by at least one type of unit; the radio management integrated platform is used for responding to a request of a user terminal for a service program, calling the service program in the radio equipment, and recording request information, calling information and execution information of the service program; the device comprises:
an abnormal service program determining module, configured to, for each service program, obtain a characteristic parameter reflecting the working performance of a unit of a target type according to request information, call information, and execution information of the service program recorded by the radio management integrated platform, compare the characteristic parameter of the service program with a preset value, and determine whether the service program is an abnormal service program according to a comparison result;
an execution unit determination module, configured to determine, for each determined abnormal service program, a unit for executing the abnormal service program;
and the fault positioning module is used for searching the same unit belonging to the target type in the determined units for executing the abnormal service programs and determining the searched same unit belonging to the target type as a fault unit.
Compared with the prior art, the embodiment of the invention has the following beneficial effects:
the embodiment of the invention provides a fault positioning method and a fault positioning device, which are applied to a radio management integrated platform, wherein the method comprises the following steps: for each service program, acquiring characteristic parameters reflecting the working performance of a unit of a target type from request information, calling information and execution information of the service program, comparing the characteristic parameters with preset values, and determining whether the service program is an abnormal service program; for each abnormal service program, determining a unit for executing the abnormal service program; and searching the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determining the searched same unit belonging to the target type as a fault unit. In this way, the function test can be performed without establishing a connection with the radio device by means of a special detection device, and the influence of the function test process on the daily service of the radio device is avoided.
In addition, the design not only saves the hardware cost, but also improves the accuracy of fault location.
Drawings
In order to more clearly illustrate the technical solutions of the embodiments of the present invention, the drawings needed to be used in the embodiments will be briefly described below, it should be understood that the following drawings only illustrate some embodiments of the present invention and therefore should not be considered as limiting the scope, and for those skilled in the art, other related drawings can be obtained according to the drawings without inventive efforts.
Fig. 1 is an application scenario of a fault location method according to an embodiment of the present invention;
fig. 2 is a schematic flow chart of a fault location method according to an embodiment of the present invention;
FIG. 3 is a flowchart illustrating the sub-steps of step S210;
FIG. 4 is a schematic flow chart of the sub-steps of step S210;
FIG. 5 is a schematic flow chart of the sub-steps of step S210;
fig. 6 is a schematic partial flowchart of a fault location method according to an embodiment of the present invention;
fig. 7 is a schematic structural diagram of a server according to an embodiment of the present invention;
fig. 8 is a block diagram illustrating a fault location device according to an embodiment of the present invention.
Icon: 10-a server; 100-user terminal; 11-a processor; 12-a machine-readable storage medium; 200-radio management integrated platform; 300-a radio device; 400-fault locating means; 410-exception service routine determination module; 420-an execution unit determination module; 430-fault location module.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention.
It should be noted that: like reference numbers and letters refer to like items in the following figures, and thus, once an item is defined in one figure, it need not be further defined and explained in subsequent figures.
Referring to fig. 1, fig. 1 shows an application scenario of a fault location method according to an embodiment of the present invention. The application scenario includes a user terminal 100, a radio device 300, and a radio management integration platform 200 communicatively connected to the radio device 300. The radio device 300 and the user terminal 100 perform service communication through the integrated radio management platform 200.
Wherein the radio device 300 comprises a plurality of service programs and a plurality of types of units, the units being hardware modules in the radio device 300, each service program being executed by at least one type of unit. Each type of unit includes an unlimited number of units, and correspondingly, there may be one, two, or more units executing any service program, which is not limited in this embodiment of the present invention. In this embodiment, a user may select a service procedure to be invoked on the user terminal 100, the user terminal 100 initiates a request for invoking the corresponding service procedure according to the user's selection, and the radio integration 200 management platform is configured to respond to the request initiated by the user terminal 100 and invoke the service procedure in the radio device 300, so that at least one type of unit on the radio device 300 executes the service procedure, and records request information, invocation information, and execution information of the service procedure.
Alternatively, the radio management integration platform 200 may include a log file in which request information, call information, and execution information of the service program may be recorded.
In this embodiment, the request information, the call information, and the execution information of the service program may include: the type, number, address of the service procedure, the originator of the request (e.g., the user terminal 100), the time when the originator originates the request, whether the radio device 300 successfully invokes the service procedure, the time when the radio device 300 starts executing the service procedure and ends executing the service procedure, the total number of times the radio device 300 invokes the service procedure within a preset time interval, the number of times the radio device 300 successfully invokes the service procedure within a preset time interval, etc.
Fig. 2 is a schematic flow chart of a fault location method according to an embodiment of the present invention. In the embodiment of the present invention, the fault location method is applied to the radio management integrated platform 200 described above, so as to test the radio device without affecting the daily business of the radio device. The flow of the fault location method shown in fig. 2 is explained in detail below.
Step S210, for each service program, obtaining a characteristic parameter reflecting the working performance of the unit of the target type according to the request information, the call information and the execution information of the service program recorded by the radio management integration platform 200, comparing the characteristic parameter of the service program with a preset value, and determining whether the service program is an abnormal service program according to the comparison result.
The preset value is a preset value of the characteristic parameter in a normal working state of the service program and can be obtained through data statistics or testing. When the unit for executing the service program is in a normal working state, the service program is in the normal working state.
According to the request information, the call information and the execution information of the service program recorded by the radio management integration platform 200, different characteristic parameters can be obtained, and the different characteristic parameters are used for reflecting the working performance of units with different target types. The radio management integration platform 200 stores a mapping relationship between the characteristic parameter and the target type, and in the mapping relationship, it can be determined which target type of unit the obtained characteristic parameter is used for reflecting the working performance of the target type.
In a first alternative embodiment, the characteristic parameter may include a response duration, where the response duration refers to a time from when the request for the service procedure is issued from the user terminal 100 in one invocation until the service procedure is successfully invoked by the radio management integration platform 200. The integrated radio management platform 200 can calculate the response duration according to the recorded time when the user terminal 100 initiates the request and the time when the radio device 300 successfully invokes the service program. The response time length is used for reflecting the working performance of a data processing type unit, wherein the data processing type unit can be an audio data processing unit, a frequency spectrum data processing unit, an IQ (I: in-phase; Q: quadrature) data processing unit and the like. The shorter the response time, the better the performance of the data processing type unit.
Correspondingly, as shown in fig. 3, step S210 includes the following sub-steps:
step S2101, for each service program, obtaining a response duration of the service program in multiple calls according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200, obtaining multiple response durations;
step S2105, when all of the response durations exceed the preset response duration corresponding to the service program, or when an average of the response durations exceeds the preset response duration corresponding to the service program, determining that the service program is an abnormal service program.
In this embodiment, since the response duration of the service program in one call is accidental, the response condition of the service program is determined by the response duration of one service program in multiple calls, so that whether the service program is an abnormal service program can be more accurately determined.
In addition, for different service programs, the response time lengths in the normal operating state are different, so in this embodiment, a corresponding preset response time length is set for each service program.
In a second alternative embodiment, the characteristic parameter may comprise a maximum continuous execution time period. The maximum continuous execution time length refers to a maximum time length for which the radio device 300 continuously executes the service procedure within a preset time interval. The preset time interval may be one day, two days or one week, etc., depending on the actual need for equipment maintenance. The all-in-one radio management platform 200 may calculate the longest continuous execution time period according to the recorded time when the radio device 300 starts executing the service program and the recorded time when the radio device finishes executing the service program. The longest continuous execution time is used for reflecting the working performance of a unit of a long-time continuous working type, wherein the unit of the long-time continuous working type may be a Fast Fourier Transform (FFT) unit or the like. The longer the longest continuous execution time period is, the better the operation performance of the unit of the long-time continuous operation type is demonstrated.
Correspondingly, as shown in fig. 4, step S210 includes the following sub-steps:
step S2102, for each service program, obtaining the longest continuous execution duration of the service program according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200;
step S2106, when the longest continuous execution time of the service program is less than the preset longest continuous execution time, determining that the service program is an abnormal service program.
In a third alternative embodiment, the characteristic parameter includes a call success rate or a call failure rate. The call success rate refers to a success rate of the radio management integrated platform 200 calling a service program within a preset time interval, and the call failure rate refers to a failure rate of the radio management integrated platform 200 calling the service program within the preset time interval. The radio management integration platform 200 may calculate the call success rate or the call failure rate according to the recorded total number of times of calling the service program and the number of times of successfully calling the service program by the radio device 300 in the preset time interval. The call success rate and the call failure rate are used for reflecting the working performance of a control type unit, wherein the control type unit can be an intermediate frequency unit, a bandwidth unit and the like. The higher the call success rate (the lower the call failure rate), the better the performance of the control type unit.
Correspondingly, as shown in fig. 5a, step S210 includes the following sub-steps:
step S2103, for each service program, obtaining a call success rate of the service program according to the request information, call information, and execution information of the service program recorded by the radio management integrated platform 200;
step S2107, when the call success rate of the service program is lower than the preset call success rate, determining that the service program is an abnormal service program.
Alternatively, as shown in fig. 5b, step S210 includes the following sub-steps:
step S2104, for each service program, obtaining a call failure rate of the service program according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200;
in step S2108, when the call failure rate of the service program is higher than the preset call failure rate, it is determined that the service program is an abnormal service program.
It is understood that, in other embodiments, the characteristic parameter may further include other information, and step S210 may also determine whether the service program is an abnormal program based on the other information. Correspondingly, step 210 may include any combination of the above three embodiments and other embodiments.
In step S220, for each determined abnormal service program, a unit for executing the abnormal service program is determined.
In this embodiment, the integrated radio management platform 200 may store a mapping table of the service program and the unit executing the service program.
Through the above step S210, at least one abnormal service program may be determined, and for each abnormal service program in the at least one abnormal service program, a unit executing the abnormal service program may be looked up in the mapping relationship table.
In this embodiment, when at least one abnormal service program is determined by the characteristic parameter reflecting the working performance of a certain type of unit, it can only be preliminarily determined that there is a faulty unit in units belonging to the target type among the units executing the at least one service program. In this case, the range of the faulty unit can be further narrowed by the following step S230.
Step S230, searching the same unit belonging to the target type in the determined units for executing each abnormal service program, and determining the searched same unit belonging to the target type as a faulty unit.
For convenience of description, the set of units belonging to the target type in the units executing the abnormal service programs is respectively represented by An(the number n is used to represent different exception servers). For example, service programs 1, 2, 3 are determined, with A1A set representing the units belonging to said target type among the units executing the exception service routine 1, denoted A2A set of units of said target type, denoted A, among the units executing the exception service routine 23Represents a collection of units belonging to the target type among the units executing the exception service routine 3. In A1、A2、A3Is likely to be a failed cell, and the more sets (A) a cell belonging to the target type appears1、A2、A3Any of them), the greater the probability that the unit belonging to the target type will fail. Thus, simultaneously appear at A1、A2、A3Of the target typeThe unit is most likely to fail. In this way, the present embodiment further determines the found same cell belonging to the target type as a faulty cell through step S230.
Through the design, the fault positioning method disclosed by the embodiment of the invention can save unnecessary hardware cost, does not influence daily business of the radio equipment, and also improves the accuracy of fault positioning.
Optionally, in this embodiment, when the characteristic parameter includes the longest continuous execution duration and one of a call success rate and a call failure rate, the method may further include the following steps:
and analyzing the longest continuous execution time of each service program and the change trend of one of the call success rate and the call failure rate of the service program, and evaluating the working performance of the service program according to the analysis result.
Taking the characteristic parameters including the longest continuous execution duration and the call success rate as an example, for each service program, the integrated radio management platform 200 may record the longest continuous execution duration and the call success rate of the service program once a day, obtain the variation trend of the longest continuous execution duration and the call success rate of the service program over multiple consecutive days (e.g., one month), analyze the obtained variation trend, and evaluate the working performance of the service program according to the analysis result. The evaluating the working performance of the service program according to the analysis result comprises the following steps: when the longest continuous execution time of the service program and the call success rate are generally in a downward trend in a plurality of continuous days (such as one month), the service program is evaluated to have poor working performance. The case where the characteristic parameters include the longest continuous execution duration and the call failure rate may be analogized.
Optionally, in this embodiment, when the characteristic parameter includes one of a response duration, a longest continuous execution duration, and a call success rate and a call failure rate, as shown in fig. 6, the method may further include the following steps:
step S240, for each service program, obtaining a response duration of the service program in the latest call as a latest response duration according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200; obtaining the longest continuous execution time length of the service program at the ending current time as the current longest continuous execution time length; and obtaining the call success rate of the service program at the current moment as the current call success rate, or obtaining the call failure rate of the service program at the current moment as the current call failure rate.
In step S240, there is no limitation on the order of obtaining the latest response time, the current longest continuous execution time, and one of the current call success rate and the current call failure rate.
And step S250, calculating the minimum value of the latest response time length of each service program.
Step S260, calculating the minimum value of the current longest continuous execution time of each service program.
And step S270, calculating the minimum value of one of the current calling success rate and the current calling failure rate of each service program.
The steps S250 to S270 may be sequentially executed in sequence, or may be executed simultaneously, and this embodiment does not limit the order of execution of the steps S250 to S270.
In step S280, the current health index of the radio device 300 is calculated according to the calculated minimum value of the latest response time, the calculated minimum value of the current longest continuous execution time, and the calculated minimum value of one of the current call success rate and the current call failure rate.
In detail, step S280 may be implemented by the following calculation:
where H is the current health index of the radio 300, T is the minimum value of the calculated latest response time duration, ms is a time unit representing milliseconds, T is the minimum value of the calculated current longest continuous execution time duration, H is a time unit representing hours, and P is the minimum value of the calculated current call success rate.
In this embodiment, the radio management integration platform 200 may run on the server 10 shown in fig. 7, where the server 10 may be an independent server, or may be a server cluster deployed in a centralized manner or in a distributed manner.
As shown in fig. 7, the server 10 includes a radio management integration platform 200, a processor 11, and a machine-readable storage medium 12.
The elements of the processor 11 and the machine-readable storage medium 12 are electrically connected to each other directly or indirectly to realize data transmission or interaction. The radio management integration platform 200 includes at least one software functional module, which may be stored in a machine readable storage medium 12 in the form of software or firmware (firmware) or solidified in an Operating System (OS) of the server 10. The processor 11 is used for executing executable modules stored in the machine-readable storage medium 12, such as software functional modules and computer programs included in the radio management integration platform 200.
The machine-readable storage medium 12 may be, but is not limited to, a Random Access Memory (RAM), a Read Only Memory (ROM), a Programmable Read-Only Memory (PROM), an Erasable Read-Only Memory (EPROM), an electrically Erasable Read-Only Memory (EEPROM), and the like.
The processor 11 may be an integrated circuit chip having signal processing capabilities. The processor 11 may also be a general-purpose processor, such as a Central Processing Unit (CPU), a Network Processor (NP), a microprocessor, etc.; but may also be a Digital Signal Processor (DSP)), an Application Specific Integrated Circuit (ASIC), a Field Programmable Gate Array (FPGA) or other programmable logic device, discrete gate or transistor logic, discrete hardware components; the processor 11 may also be any conventional processor that may implement or perform the methods, steps and logic blocks disclosed in the embodiments of the present invention.
It should be understood that the configuration shown in fig. 7 is merely illustrative and that server 10 may have more or fewer components than shown in fig. 7 or a configuration that is completely different than that shown in fig. 7. Further, the components shown in FIG. 7 may be implemented in software, hardware, or a combination thereof.
Fig. 8 is a block diagram of a fault location apparatus 400 according to an embodiment of the present invention. The fault location device 400 is applied to the radio management integrated platform 200 shown in fig. 1. The fault location device 400 may be a sub-device of the integrated radio management platform 200.
As shown in fig. 8, the fault location apparatus 400 includes an abnormal service routine determination module 410, an execution unit determination module 420, and a fault location module 430.
The abnormal service program determining module 410 is configured to, for each service program, obtain a characteristic parameter reflecting the working performance of a unit of a target type according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200, compare the characteristic parameter of the service program with a preset value, and determine whether the service program is an abnormal service program according to a comparison result;
in this embodiment, the description about the abnormal service routine determining module 410 may refer to the detailed description about step S210 shown in fig. 2, that is, step S210 may be performed by the abnormal service routine determining module 410.
The execution unit determination module 420 is configured to determine, for each determined exception service, a unit to execute the exception service.
In this embodiment, the description about the execution unit determination module 420 may refer to the detailed description of step S220 shown in fig. 2, that is, step S220 may be executed by the execution unit determination module 420.
The fault location module 430 is configured to search for the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determine the same unit belonging to the target type as a fault unit.
In this embodiment, reference may be made to the detailed description of step S230 shown in fig. 2 for the description of the fault location module 430, that is, step S230 may be executed by the fault location module 430.
Optionally, the characteristic parameter may include a response time length reflecting an operation performance of the data processing type unit.
Correspondingly, the exception service routine determining module 410 includes a response duration obtaining sub-module and a first determining sub-module.
The response duration obtaining sub-module is configured to, for each service program, obtain response durations of the service program in multiple calls from the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200, so as to obtain multiple response durations.
In this embodiment, reference may be made to the detailed description of step S2101 shown in fig. 3 for the description of the response time length obtaining sub-module, that is, step S2101 may be executed by the response time length obtaining sub-module.
The first determining submodule is used for determining that the service program is an abnormal service program when the response time lengths all exceed the preset response time length corresponding to the service program or when the average value of the response time lengths exceeds the preset response time length corresponding to the service program.
In this embodiment, the description about the first determining sub-module may refer to the detailed description about step S2105 shown in fig. 3, that is, step S2105 may be performed by the first determining sub-module.
Alternatively, the characteristic parameter may include the maximum continuous execution time period of the operation performance of the unit reflecting the long-time continuous operation type.
Correspondingly, the exception service routine determining module 410 includes a longest continuous execution time obtaining sub-module and a second determining sub-module.
The longest continuous execution time obtaining sub-module is configured to, for each service program, obtain the longest continuous execution time of the service program from the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200;
in this embodiment, the detailed description of step S2102 shown in fig. 4 may be referred to for the description of the longest continuous execution time period acquisition sub-module, that is, step S2102 may be executed by the longest continuous execution time period acquisition sub-module.
The second determining submodule is used for determining that the service program is an abnormal service program when the longest continuous execution time length of the service program is less than the preset longest continuous execution time length.
In this embodiment, the detailed description of step S2106 shown in fig. 4 may be referred to for the description of the second determining sub-module, that is, step S2106 may be executed by the second determining sub-module.
Optionally, the characteristic parameter may include a call success rate or a call failure rate reflecting the operation performance of the control type unit.
Correspondingly, in one case, the exception handler determining module 410 may include a call success rate obtaining sub-module and a third determining sub-module.
The call success rate obtaining sub-module is configured to, for each service program, obtain a call success rate of the service program according to the request information, the call information, and the execution information of the service program recorded by the radio management integrated platform 200;
in this embodiment, reference may be made to the detailed description of step S2103 shown in fig. 5a for the description of the call success rate obtaining sub-module, that is, step S2103 may be performed by the call success rate obtaining sub-module.
And the third determining submodule is used for determining the service program as an abnormal service program when the calling success rate of the service program is lower than the preset calling success rate.
In this embodiment, the detailed description of step S2107 shown in fig. 5a may be referred to for the description of the third determining sub-module, that is, step S2107 may be executed by the third determining sub-module.
In another case, the exception handler determining module 410 may include a call failure rate obtaining sub-module and a fourth determining sub-module.
The call failure rate obtaining sub-module is configured to, for each service program, obtain a call failure rate of the service program according to the request information, the call information, and the execution information of the service program, which are recorded by the radio management integrated platform 200;
in this embodiment, reference may be made to the detailed description of step S2104 shown in fig. 5b for the description of the call failure rate obtaining sub-module, that is, step S2104 may be executed by the call failure rate obtaining sub-module.
And the fourth determining submodule is used for determining that the service program is an abnormal service program when the call failure rate of the service program is higher than the preset call failure rate.
In this embodiment, the description about the fourth determining sub-module may refer to the detailed description about step S2108 shown in fig. 5b, that is, step S2108 may be executed by the fourth determining sub-module.
In summary, an embodiment of the present invention provides a fault location method and an apparatus, which are applied to a radio management integrated platform, and the method includes: for each service program, acquiring characteristic parameters reflecting the working performance of a unit of a target type from request information, calling information and execution information of the service program, comparing the characteristic parameters with preset values, and determining whether the service program is an abnormal service program; for each abnormal service program, determining a unit for executing the abnormal service program; and searching the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determining the searched same unit belonging to the target type as a fault unit. Therefore, the function test is carried out without establishing connection with the radio equipment by special detection equipment, so that the hardware cost is saved, and the daily service of the radio equipment is not influenced. In addition, the accuracy of fault location is improved.
In the embodiments provided in the present invention, it should be understood that the disclosed apparatus and method can be implemented in other ways. The apparatus and method embodiments described above are illustrative only, as the flowcharts and block diagrams in the figures illustrate the architecture, functionality, and operation of possible implementations of apparatus, methods and computer program products according to various embodiments of the present invention. In this regard, each block in the flowchart or block diagrams may represent a module, segment, or portion of code, which comprises one or more executable instructions for implementing the specified logical function(s). It should also be noted that, in some alternative implementations, the functions noted in the block may occur out of the order noted in the figures. For example, two blocks shown in succession may, in fact, be executed substantially concurrently, or the blocks may sometimes be executed in the reverse order, depending upon the functionality involved. It will also be noted that each block of the block diagrams and/or flowchart illustration, and combinations of blocks in the block diagrams and/or flowchart illustration, can be implemented by special purpose hardware-based systems which perform the specified functions or acts, or combinations of special purpose hardware and computer instructions.
In addition, the functional modules in the embodiments of the present invention may be integrated together to form an independent part, or each module may exist separately, or two or more modules may be integrated to form an independent part.
The functions, if implemented in the form of software functional modules and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, an electronic device, or a network device) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes. It should be noted that, in this document, the terms "comprises," "comprising," or any other variation thereof, are intended to cover a non-exclusive inclusion, such that a process, method, article, or apparatus that comprises a list of elements does not include only those elements but may include other elements not expressly listed or inherent to such process, method, article, or apparatus. Without further limitation, an element defined by the phrase "comprising an … …" does not exclude the presence of other identical elements in a process, method, article, or apparatus that comprises the element. The terms "first," "second," "third," and the like are used solely to distinguish one from another and are not to be construed as indicating or implying relative importance.
The foregoing description is of selected embodiments of the present invention only, and is not intended to limit the present invention, which may be modified and varied by those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.
Claims (10)
1. A fault location method is applied to a radio management integration platform which is in communication connection with a user terminal and a radio device, wherein the radio device comprises a plurality of service programs and a plurality of types of units, and each service program is executed by at least one type of unit; the radio management integrated platform is used for responding to a request of the user terminal for a service program, calling the service program in the radio equipment, and recording request information, calling information and execution information of the service program; the method comprises the following steps:
for each service program, obtaining characteristic parameters reflecting the working performance of a unit of a target type according to the request information, the calling information and the execution information of the service program recorded by the radio management integrated platform, comparing the characteristic parameters of the service program with preset values, and determining whether the service program is an abnormal service program according to the comparison result;
for each determined exception service routine, determining a unit for executing the exception service routine;
and searching the same unit belonging to the target type in the determined units for executing the abnormal service programs, and determining the searched same unit belonging to the target type as a fault unit.
2. The fault localization method according to claim 1, wherein the characteristic parameter comprises a response time length reflecting an operation performance of a data processing type unit;
the step of obtaining, for each service program, a characteristic parameter reflecting the working performance of a unit of a target type according to request information, call information, and execution information of the service program recorded by the radio management integrated platform, comparing the characteristic parameter of the service program with a preset value, and determining whether the service program is an abnormal service program according to a comparison result includes:
for each service program, obtaining response time of the service program in multiple calls according to the request information, the calling information and the execution information of the service program recorded by the radio management integrated platform, and obtaining multiple response times;
and when the response time lengths all exceed the preset response time length corresponding to the service program, or when the average value of the response time lengths exceeds the preset response time length corresponding to the service program, determining that the service program is an abnormal service program.
3. The fault localization method according to claim 2, wherein the characteristic parameter includes a maximum continuous execution time period reflecting an operation performance of a unit of a long-duration operation type;
the step of obtaining, for each service program, a characteristic parameter reflecting the working performance of a unit of a target type according to request information, call information, and execution information of the service program recorded by the radio management integrated platform, comparing the characteristic parameter of the service program with a preset value, and determining whether the service program is an abnormal service program according to a comparison result includes:
for each service program, obtaining the longest continuous execution duration of the service program according to the request information, the calling information and the execution information of the service program recorded by the radio management integrated platform;
and when the longest continuous execution time length of the service program is less than the preset longest continuous execution time length, determining that the service program is an abnormal service program.
4. The fault location method according to claim 3, wherein the characteristic parameter includes a call success rate or a call failure rate reflecting the working performance of the control type unit;
the step of obtaining, for each service program, a characteristic parameter reflecting the working performance of a unit of a target type according to request information, call information, and execution information of the service program recorded by the radio management integrated platform, comparing the characteristic parameter of the service program with a preset value, and determining whether the service program is an abnormal service program according to a comparison result includes:
for each service program, obtaining the calling success rate of the service program according to the request information, calling information and execution information of the service program recorded by the radio management integrated platform, and determining the service program as an abnormal service program when the calling success rate of the service program is lower than a preset calling success rate; or the like, or, alternatively,
and for each service program, obtaining the call failure rate of the service program according to the request information, the call information and the execution information of the service program recorded by the integrated radio management platform, and determining the service program as an abnormal service program when the call failure rate of the service program is higher than a preset call failure rate.
5. The method of fault location according to claim 4, further comprising:
and analyzing the change trend of one of the call success rate and the call failure rate of each service program and the longest continuous execution time for each service program, and evaluating the working performance of the service program according to the analysis result.
6. The method of fault location according to claim 4, further comprising:
for each service program, obtaining the response time length of the service program in the latest call as the latest response time length according to the request information, the call information and the execution information of the service program recorded by the radio management integrated platform; obtaining the longest continuous execution time length of the service program at the ending current time as the current longest continuous execution time length; obtaining the call success rate of the service program at the current moment as the current call success rate, or obtaining the call failure rate of the service program at the current moment as the current call failure rate;
calculating the minimum value of the latest response time length of each service program;
calculating the minimum value of the current longest continuous execution time of each service program;
calculating the minimum value of one of the current calling success rate and the current calling failure rate of each service program;
and calculating the current health index of the radio equipment according to the calculated minimum value of the latest response time length, the minimum value of the current longest continuous execution time length and the minimum value of one of the current call success rate and the current call failure rate.
7. The method of claim 6, wherein the current health index of the radio device is calculated by the following calculation:
wherein H is the current health index of the radio device, T is the minimum value of the calculated latest response time, T is the minimum value of the calculated current longest continuous execution time, P is the minimum value of the calculated current call success rate, and ms and H are time units.
8. A fault location device is applied to a radio management integration platform in communication connection with a radio device, wherein the radio device comprises a plurality of service programs and a plurality of types of units, and each service program is executed by at least one type of unit; the radio management integrated platform is used for responding to a request of a user terminal for a service program, calling the service program in the radio equipment, and recording request information, calling information and execution information of the service program; the device comprises:
an abnormal service program determining module, configured to, for each service program, obtain a characteristic parameter reflecting the working performance of a unit of a target type according to request information, call information, and execution information of the service program recorded by the radio management integrated platform, compare the characteristic parameter of the service program with a preset value, and determine whether the service program is an abnormal service program according to a comparison result;
an execution unit determination module, configured to determine, for each determined abnormal service program, a unit for executing the abnormal service program;
and the fault positioning module is used for searching the same unit belonging to the target type in the determined units for executing the abnormal service programs and determining the searched same unit belonging to the target type as a fault unit.
9. The fault localization arrangement according to claim 8, characterized in that the characteristic parameter comprises a response time length reflecting an operating performance of the data processing type unit;
the exception service routine determination module comprises:
the response duration acquisition submodule is used for acquiring the response duration of each service program in multiple calls according to the request information, the call information and the execution information of the service program recorded by the radio management integrated platform, and acquiring multiple response durations;
the first determining submodule is used for determining that the service program is an abnormal service program when the response time lengths all exceed the preset response time length corresponding to the service program or when the average value of the response time lengths exceeds the preset response time length corresponding to the service program.
10. The fault locating device according to claim 8, wherein the characteristic parameter includes a maximum continuous execution time of the operation performance of the unit reflecting the long-time continuous operation type;
the exception service routine determination module comprises:
the longest continuous execution time length obtaining submodule is used for obtaining the longest continuous execution time length of each service program according to the request information, the calling information and the execution information of the service program recorded by the radio management integrated platform;
and the second determining submodule is used for determining the service program as an abnormal service program when the longest continuous execution time length of the service program is less than the preset longest continuous execution time length.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811184582.8A CN109391524B (en) | 2018-10-11 | 2018-10-11 | Fault positioning method and device |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201811184582.8A CN109391524B (en) | 2018-10-11 | 2018-10-11 | Fault positioning method and device |
Publications (2)
Publication Number | Publication Date |
---|---|
CN109391524A CN109391524A (en) | 2019-02-26 |
CN109391524B true CN109391524B (en) | 2020-10-20 |
Family
ID=65427510
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201811184582.8A Expired - Fee Related CN109391524B (en) | 2018-10-11 | 2018-10-11 | Fault positioning method and device |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109391524B (en) |
Families Citing this family (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN111488289B (en) * | 2020-04-26 | 2024-01-23 | 支付宝实验室(新加坡)有限公司 | Fault positioning method, device and equipment |
CN113392893B (en) * | 2021-06-08 | 2024-08-13 | 北京达佳互联信息技术有限公司 | Method, device, storage medium and computer program product for locating business fault |
Family Cites Families (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US5570373A (en) * | 1995-03-20 | 1996-10-29 | Lucent Technologies Inc. | Method and apparatus for testing a radio in a base station without using a radio test unit |
JP4562713B2 (en) * | 2006-10-05 | 2010-10-13 | ルネサスエレクトロニクス株式会社 | Fault location estimation system for multiple faults in logic circuit, fault location estimation method, and fault location estimation program |
JP4967088B2 (en) * | 2008-03-24 | 2012-07-04 | エヌイーシーコンピュータテクノ株式会社 | Information processing apparatus having failure analysis function, failure analysis method, and failure analysis program |
JP2012038368A (en) * | 2010-08-04 | 2012-02-23 | Toshiba Corp | Failure analysis device and failure analysis method |
CN105095052B (en) * | 2014-05-22 | 2018-08-31 | 阿里巴巴集团控股有限公司 | Fault detection method under SOA environment and device |
JP6365233B2 (en) * | 2014-10-24 | 2018-08-01 | 富士ゼロックス株式会社 | Failure prediction device, failure prediction system, and program |
CN106411615A (en) * | 2016-11-22 | 2017-02-15 | 北京奇虎科技有限公司 | Device used for cloud remediation of system application and method |
-
2018
- 2018-10-11 CN CN201811184582.8A patent/CN109391524B/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CN109391524A (en) | 2019-02-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN110502494B (en) | Log processing method and device, computer equipment and storage medium | |
KR20080055744A (en) | A telecommuncations-based link monitoring system | |
CN109391524B (en) | Fault positioning method and device | |
CN112783725A (en) | Index acquisition method and device | |
CN110297746A (en) | A kind of data processing method and system | |
CN113596078B (en) | Service problem positioning method and device | |
CN111970168A (en) | Method and device for monitoring full-link service node and storage medium | |
CN109271453B (en) | Method and device for determining database capacity | |
CN108270753B (en) | Method and device for logging out user account | |
CN109710552B (en) | Bus transmission quality evaluation method, system and computer storage medium | |
CN114091910A (en) | 5G user quality difference complaint source tracing analysis method and device | |
CN113220522A (en) | Automatic positioning method and device for service abnormity | |
CN112395155A (en) | Service monitoring method and device, storage medium and electronic device | |
US20230344933A1 (en) | Systems and methods for use in blocking of robocall and scam call phone numbers | |
CN112328463A (en) | Log monitoring method and device | |
CN109245943B (en) | Fault positioning method and device | |
CN109598525B (en) | Data processing method and device | |
CN110674171A (en) | Script generation method and device and data processing method and device | |
CN116662127A (en) | Method, system, equipment and medium for classifying and early warning equipment alarm information | |
CN116566873A (en) | ELK-based automatic log analysis method, system and storage medium | |
CN112261139B (en) | Service data acquisition method and device and electronic equipment | |
CN107678905B (en) | Monitoring method and device | |
CN108810299B (en) | Information analysis method, medium and equipment | |
CN110597572B (en) | Service call relation analysis method and computer system | |
CN113377627B (en) | Business server abnormality detection method, system, equipment and storage medium |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant | ||
CF01 | Termination of patent right due to non-payment of annual fee | ||
CF01 | Termination of patent right due to non-payment of annual fee |
Granted publication date: 20201020 Termination date: 20211011 |