Nothing Special   »   [go: up one dir, main page]

WO2019109771A1 - Power artificial-intelligence visual-analysis system on basis of multi-core heterogeneous parallel computing - Google Patents

Power artificial-intelligence visual-analysis system on basis of multi-core heterogeneous parallel computing Download PDF

Info

Publication number
WO2019109771A1
WO2019109771A1 PCT/CN2018/114389 CN2018114389W WO2019109771A1 WO 2019109771 A1 WO2019109771 A1 WO 2019109771A1 CN 2018114389 W CN2018114389 W CN 2018114389W WO 2019109771 A1 WO2019109771 A1 WO 2019109771A1
Authority
WO
WIPO (PCT)
Prior art keywords
layer
image
detection
module
parallel computing
Prior art date
Application number
PCT/CN2018/114389
Other languages
French (fr)
Chinese (zh)
Inventor
罗旺
吴超
冯敏
郝小龙
崔漾
樊强
彭启伟
赵高峰
夏源
张佩
余磊
Original Assignee
南京南瑞信息通信科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 南京南瑞信息通信科技有限公司 filed Critical 南京南瑞信息通信科技有限公司
Publication of WO2019109771A1 publication Critical patent/WO2019109771A1/en

Links

Images

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/94Hardware or software architectures specially adapted for image or video understanding
    • G06V10/955Hardware or software architectures specially adapted for image or video understanding using specific electronic processors
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06NCOMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
    • G06N3/00Computing arrangements based on biological models
    • G06N3/02Neural networks
    • G06N3/04Architecture, e.g. interconnection topology
    • G06N3/045Combinations of networks
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V10/00Arrangements for image or video recognition or understanding
    • G06V10/94Hardware or software architectures specially adapted for image or video understanding

Definitions

  • the present application relates to the field of signal and information processing, and in particular to a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing.
  • Artificial intelligence and deep learning are advanced scientific and technical methods in the field of automation. They are widely used in image processing and natural language recognition. At present, artificial intelligence analysis methods have been applied to many industries, including intelligent transportation, smart medical, smart home, automatic driving, intelligent hardware and so on.
  • various deep convolutional neural networks such as Lenet neural network, Alexnet neural network, VGG neural network, Resnet), Xnception neural network, etc. have emerged in an endless stream, and are widely used in computer vision fields such as image recognition and target detection.
  • front-end equipment in substation and transmission lines will collect a large amount of image and video data every day. Relevant business departments have urgent needs for image and video data analysis and identification. More mature deep learning images have appeared at home and abroad. Identification technology.
  • Heterogeneous parallel computing enables different types of computing devices to share the computational processes and results while continuously optimizing and accelerating the computational process for higher computational efficiency.
  • Heterogeneous parallel computing is developing rapidly at home and abroad.
  • the Central Processing Unit (CPU) combined with the GPU (Graphic Processing Unit) heterogeneous computing framework has become a research hotspot in recent years.
  • the GPU Graphic Processing Unit
  • the embodiment of the present application is intended to provide a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing, which can efficiently implement online or offline image data training, and form a lightweight and fast image classification model, which can realize the application of power internal and external network image services.
  • An embodiment of the present application provides a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing, the system comprising a multi-core heterogeneous parallel computing module and a business application module; the service application module and the multi-core heterogeneous parallel computing Data is transmitted between modules through a network service interface;
  • the multi-core heterogeneous parallel computing module includes a GPU computing node, a CPU storage management node, and a CPU computing node, and each node is connected through a switch; the GPU computing node is configured to perform model training to complete a first type of computing task; The CPU is configured as a data storage; the CPU computing node is configured to perform a second type of computing task, and assists the GPU computing node to perform a portion of the first type of computing task;
  • the business application module includes an image management module, an image annotation module, a model training module, and an algorithm application module; the image management module is configured to process an image service; and the image annotation module is configured to provide training data for a lightweight neural network model Set labeling information; the model training module configured to train a lightweight neural network model based on the multi-core heterogeneous parallel computing module; the algorithm application module surface configured to utilize the multi-core heterogeneous parallel computing module
  • the lightweight neural network model performs image analysis tasks.
  • the image management module communicates with other service platforms of the power industry through a network service interface, and is configured to send the processed target image and video data and related information to other service platforms of the power industry; Obtaining original image and video data and related information thereof; the other business platforms of the power industry include at least one of the following: a unified video monitoring platform, and a network for inspection, scheduling, marketing, and infrastructure power information.
  • the association information includes attributes of image and video data, including at least one of the following: a view category, a device tree, a scene tree, a label tree, a defect tree, a professional type, and a file source;
  • the view category includes an image category and a video data category; the device tree characterizes a front end device address of the captured image or video data; the scene tree characterizes a power scene of the captured image or video data; the tag tree characterizes the captured image or video The specific content of the data; the defect tree characterizing a defect in the captured image or video data; the professional type characterizing a power professional name of the captured image or video data; the file source characterizing means for capturing the image or video data.
  • the CPU stores a management node embedded relational database and a non-relational database; the relational database is configured to store an association relationship between image and video data, and the non-relational database is configured to store an image and Video data.
  • the lightweight neural network model comprises: a convolution layer, a pooling layer, and a fully connected layer;
  • the convolution layer is located at a front end of the model, and the pooling layer and the fully connected layer are located at a back end of the model;
  • the lightweight neural network model uses a 1 ⁇ 1 convolution kernel and a 1 ⁇ 3, 3 ⁇ 1 asymmetric convolution kernel.
  • the lightweight neural network model comprises:
  • the first layer is a convolution layer, the step size is 2, the input size is 224 ⁇ 224 ⁇ 3, using two concatenated convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the second layer is a convolution layer, the step size is 1, the input size is 112 ⁇ 112 ⁇ 32, using two concatenated convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the third layer is a convolution layer with a step size of 1, and the input size is 112 ⁇ 112 ⁇ 32, and a 1 ⁇ 1 convolution kernel is used;
  • the fourth layer is a convolution layer, the step size is 2, the input size is 112 ⁇ 112 ⁇ 64, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the fifth layer is a convolution layer, the step size is 1, the input size is 56 ⁇ 56 ⁇ 64, using two cascaded convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the sixth layer is a convolution layer with a step size of 1, and the input size is 56 ⁇ 56 ⁇ 128, and a 1 ⁇ 1 convolution kernel is used;
  • the seventh layer is a convolution layer, the step size is 2, the input size is 56 ⁇ 56 ⁇ 128, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the eighth layer is a convolution layer, the step size is 1, the input size is 28 ⁇ 28 ⁇ 128, using two cascaded convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the ninth layer is a convolution layer with a step size of 1, and the input size is 28 ⁇ 28 ⁇ 256, using a 1 ⁇ 1 convolution kernel;
  • the 10th layer is a convolutional layer with a step size of 2, the input size is 28 ⁇ 28 ⁇ 256, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 11th layer is a convolutional layer, the step size is 1, the input size is 14 ⁇ 14 ⁇ 256, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the 12th layer is a convolution layer with a step size of 1, and the input size is 14 ⁇ 14 ⁇ 512, using a 1 ⁇ 1 convolution kernel;
  • the 13th layer is a convolutional layer, the step size is 2, the input size is 14 ⁇ 14 ⁇ 512, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the 14th layer is a convolutional layer with a step size of 1, the input size is 7 ⁇ 7 ⁇ 512, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 15th layer is a convolutional layer with a step size of 1, and the input size is 7 ⁇ 7 ⁇ 1024, using a 1 ⁇ 1 convolution kernel;
  • the 16th layer is a convolutional layer with a step size of 2, the input size is 7 ⁇ 7 ⁇ 1024, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 17th layer is a convolutional layer, the step size is 1, the input size is 7 ⁇ 7 ⁇ 1024, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 18th layer is an average pooling layer with a step size of 1, the input size is 7 ⁇ 7 ⁇ 1024, and the pooling size is 7 ⁇ 7;
  • the 19th layer is a fully connected layer, and the input size is 1 ⁇ 1 ⁇ 1024, including 1000 neurons;
  • the 20th layer is the loss function layer, which can be adapted, using the softmax function as the loss function with single label classification, and the cross entropy function as the multi-label classification.
  • the image services are classified according to the following levels:
  • the first-level image service includes image common function tasks and image business application tasks
  • the secondary image service includes image deduplication, low-quality image rejection, video transcoding, video compression, fast browsing, and image service application tasks to the inspection task, scheduling-oriented tasks, and infrastructure-oriented Task, marketing-oriented tasks;
  • the three-level image service includes work vehicle detection, wire foreign object detection, tree and bamboo growth detection, wire icing detection, pyrotechnic detection, fitting corrosion detection, fitting loss detection, insulator crack detection, insulator loss detection, and insulator under the task of inspection.
  • Pollution flash detection substation meter digital identification, transformer oil sump oil leak detection, substation personnel abnormal behavior detection, substation personnel dress code detection, substation personnel access detection, pyrotechnic detection; also includes transformer switch state recognition and task-oriented tasks Isolation switch status recognition; also includes import and export vehicle detection, import and export license plate identification, import and export personnel detection, import and export personnel abnormal behavior detection, personnel dressing specification detection, open flame detection under the task of infrastructure construction; The quality of the business environment, the inspection of the service personnel to the post, the quality inspection of the service personnel, the quality of the service personnel, the customer behavior analysis and the abnormal identification.
  • the image annotation includes two types, one type is automatic labeling by the system, and the system automatically labels the image into the lightweight neural network model, and uses the output classification result as the annotation information; One type is the user's icon note.
  • the model training module uses the image and video data labeled by the image as a data set, and the machine learning task for image classification, target detection, and image segmentation, and performs model training through a multi-core heterogeneous parallel computing module;
  • the model training module supports two modes: online training and offline training; the offline training refers to performing model training tasks after one-time input of data; and the online training refers to inputting new models in the execution of model training tasks after the model training tasks are started.
  • the data is referred to performing model training tasks after one-time input of data.
  • the algorithm application module is configured to perform image analysis by the lightweight neural network model obtained by the model training module and/or other mature models built in the GPU computing node.
  • the embodiment of the present application adopts a multi-core heterogeneous parallel computing framework suitable for the system, wherein each node has clear division of labor, clear logic, and reasonable interaction, and the GPU computing node undertakes model training and intelligent task execution functions, completes intensive calculation;
  • CPU storage management The node undertakes the data storage function, embeds a relational database and a non-relational database, and the CPU compute node assumes the scientific computing function, and assists the GPU computing node to complete part of the intensive computing function;
  • the embodiment of the present application proposes a lightweight neural network model as the algorithm core of the system.
  • the network model can include 20 layers, adopting a small convolution kernel and an asymmetric convolution kernel structure, a residual structure, a waiver mechanism,
  • the batch standardization mechanism can effectively improve the classification accuracy rate, accelerate the convergence, enhance the network generalization ability, and maintain the accuracy rate while improving the training speed and algorithm execution speed.
  • the network provides normalization and enhancement functions for input data. Effectively augment the data set; the lightweight fast image classification model is based on existing power data, the parameters are trained based on the power data set, and are not based on a common data set, which is innovative and practical;
  • the system of the embodiment of the present application has scalability and universality, supports data transmission with other service platforms in the power industry, supports parallel computing of multi-core heterogeneous GPU nodes and CPU nodes, and has a scalable algorithm network model. Multiple power analysis scenarios for power scenarios, equipment, personnel, and event categories.
  • the system of the embodiment of the present application is based on a multi-core heterogeneous parallel computing framework, and can efficiently implement online or offline image data training to form a lightweight and fast image classification model, which can realize image application of power internal and external network images, and can maximize image data.
  • the value has a good application prospect.
  • FIG. 1 is a block diagram of a system logic architecture of an embodiment of the present application.
  • an embodiment of the present application provides a power artificial intelligence visual analysis system based on a multi-core heterogeneous parallel computing framework, where the system includes a multi-core heterogeneous parallel computing module and a business application module;
  • the multi-core heterogeneous parallel computing module includes a GPU computing node, a CPU storage management node, and a CPU computing node, and each node passes through a switch (for example, a switch of infiniband (a "conversion cable” technology supporting multiple concurrent links)) Connecting; the GPU computing node is configured to perform model training to complete the first type of computing task; the CPU storage management node is configured as data storage; the CPU computing node is configured to perform the second type of computing task, and assisting the GPU computing node to execute the portion The first type of computing task.
  • a switch for example, a switch of infiniband (a "conversion cable” technology supporting multiple concurrent links) Connecting
  • the GPU computing node is configured to perform model training to complete the first type of computing task
  • the CPU storage management node is configured as data storage
  • the CPU computing node is configured to perform the second type of computing task, and assisting the GPU computing node to execute the portion The first type of computing task.
  • the business application module includes an image management module, an image annotation module, a model training module, and an algorithm application module; the image management module is configured to manage the power internal and external network image services; and the image annotation module is configured to provide the training data set annotation for the lightweight neural network model.
  • Information the model training module is configured to train a lightweight neural network model based on the multi-core heterogeneous parallel computing module; the algorithm application module is directed to the power internal and external network image service, configured to utilize the light on the multi-core heterogeneous parallel computing module
  • the magnitude neural network model performs intelligent analysis tasks.
  • the service application module and the multi-core heterogeneous parallel computing module transmit data through a web service interface.
  • the image management module communicates with other service platforms of the power industry (for example, a unified video monitoring platform and a power information platform such as a physical inspection, scheduling, marketing, infrastructure, etc.) through a web service interface, and sends the information to other service platforms of the power industry.
  • the target image and the video data and the associated information processed by the system is further configured to acquire the original image and the video data and their associated relationship; on the one hand, the image management module may be from other service platforms of the power industry Obtaining original image and video data and related information thereof as a data source for subsequent analysis; on the other hand, the image management module can also receive locally uploaded image and video data and related information thereof as an effective supplement for subsequent analysis data sources. .
  • the CPU of the embodiment of the present application stores a management node embedded relational database and a non-relational database.
  • the image and video data are unstructured data
  • the associated information of the image and video data is structured data
  • the associated information includes attributes of the image and video data.
  • the unstructured data mentioned in this embodiment is equivalent to image and video data
  • the structured data is equivalent to the associated information of the image and video data.
  • the relational database is configured to store structured data, ie, an association relationship between stored images and video data, that is, attributes for storing image and video data
  • a non-relational database is configured to store unstructured data, that is, image and video data.
  • the image and video data of this embodiment are different in size, ranging from tens of kilobytes (KB) to hundreds of megabytes (MB), and a database supporting a storage unit size spanning a large interval is required.
  • a database supporting a storage unit size spanning a large interval is required.
  • an HBase database a distributed column storage system built on HDFS
  • the relational database can use the mySQL database (a relational database management system).
  • the associated information of the same image or video data is associated by a unique ID.
  • the GPU computing node of the embodiment of the present application includes grouped GPUs, each group containing 2 GPUs to complete parallel computing.
  • the first type of computing task is performed by using the GPU in the group
  • the first type of settlement task is, for example, model parallel computing and data parallel computing between groups
  • the two GPUs in the group each hold a lightweight neural network model.
  • Half of the parameters collaboratively complete the training of a single model.
  • the CPU compute node of the embodiment of the present application includes grouped CPUs, each group containing 2 CPUs to complete parallel computing.
  • two sets of CPU computing nodes are used to complete the second type of computing tasks, and the second type of computing tasks are scientific tasks, for example, including a retrieval task, a search task, etc., and the other two CPUs are effective supplements of the GPU computing nodes, and are used for
  • the first type of computing task is completed, and the in-group CPU is used for model parallel computing and the data is parallelized between groups.
  • the two CPUs in the group each hold half of the parameters of the lightweight neural network model, and cooperate to complete the training of the single model.
  • multi-core in this embodiment is mainly embodied in the number of physical nodes in the computing framework is extensible, "heterogeneous” is mainly reflected in the system node is composed of two categories of GPU and CPU, "parallel” is mainly reflected in the data Parallel computing, model parallel computing, and task parallel computing.
  • data parallel computing and model parallel computing can achieve perfect and efficient data transmission.
  • group 1 contains nodes GPU1 and GPU2
  • group 2 contains nodes.
  • Model parallel computing and data parallel computing of lightweight neural networks are based on grouping. GPUs in the group do model parallel computing, and GPUs in the group do data parallel computing.
  • Each node in the group holds half of the parameters of the lightweight neural network model, and cooperates to complete the training of a single model, that is, GPU1 and GPU2 hold half of the parameters of the model, and GPU3 and GPU4 hold the other half of the model.
  • Model parallel computing Parallel computing of data between groups is trained by synchronous random gradient descent, and topology is used to complete parameter exchange. That is, GPU1 and GPU2 in group 1 and GPU3 and GPU4 in group 2 complete data exchange. This process is called data parallel computing.
  • the training data is read from the disk, and the training data is preprocessed.
  • the lightweight neural network training respectively occupies the disk, CPU, and GPU resources, and both take a long time. Therefore, the pipeline is introduced, so that disk, CPU, and GPU resources can be utilized at the same time to improve overall performance.
  • Parallel processes that perform tasks are parallel to the parallel process of model training.
  • Parallel computing of tasks means that at the task management level, multiple tasks can be scheduled and executed concurrently, and each computing node is embodied as data parallel computing and model parallel computing.
  • the image data in the power field has its own professionalism, and the information categories such as equipment, personnel, and scenes usually do not exist in the common large data sets such as ImageNet and Pascal VOC.
  • the image service in the power field also has its own professionalism, involving multiple categories of image classification, multi-label image classification, target detection, etc., and has high requirements for real-time system.
  • this embodiment proposes a lightweight neural network model, which can adapt to a variety of algorithms required by the business, while the model calculation amount is moderate, the response speed is fast, the precision is high, and there is a strong Practical value.
  • the lightweight neural network model of the embodiment of the present application includes: a convolution layer, a pooling layer, and a fully connected layer;
  • the model comprises 20 layers, wherein the convolution layer has 17 layers, the pool layer has 1 layer, and the full connection layer has 1 layer, and the convolution layer is located at the front end of the model, the pool layer and the whole The connection layer is at the back end of the model.
  • the network structure is deep but the number of layers is moderate, which can effectively improve the classification accuracy rate;
  • the convolution kernel used in the lightweight neural network model is a 1 ⁇ 1 convolution kernel and a 1 ⁇ 3, 3 ⁇ 1 asymmetric convolution kernel, where 1 ⁇ 3, 3 ⁇ 1 asymmetric volumes
  • the product core can play the equivalent effect of the 3 ⁇ 3 convolution kernel, but the parameters are greatly reduced, and the network nonlinearity is increased while maintaining the network depth, reducing the amount of calculation and the number of parameters.
  • the lightweight neural network model has a residual structure, accelerates convergence while maintaining the depth of the network, and effectively avoids the gradient disappearance of the deep neural network.
  • the lightweight neural network model has optional configuration parameters of abandonment and batch standardization, and introduces a regularization mechanism to speed up the training while reducing over-fitting and enhancing network generalization ability.
  • the lightweight neural network model provides normalization and enhancement functions for input data, normalizing all input images to pixel values of 224 ⁇ 224, and providing image inversion, cropping, tone conversion, and the like.
  • a data enhancement method that expands the training data set, can reduce over-fitting and enhance network generalization ability.
  • the last layer of the lightweight neural network model ie the loss function layer, is adaptable, using a normalized exponential function (softmax function) as a loss function with a single label classification, using a cross entropy function as a multi-label Classification to meet a variety of power business needs.
  • a normalized exponential function softmax function
  • a cross entropy function as a multi-label Classification
  • the number of network model layers proposed in the embodiments of the present application is moderate, and the accuracy and the execution speed of the algorithm are improved while the training time and algorithm execution time are smaller than the commonly used convolutional neural networks such as ResNet and GoogLeNet. .
  • the lightweight neural network model of this embodiment includes:
  • the first layer is a convolution layer, the step size is 2, the input size is 224 ⁇ 224 ⁇ 3, using two concatenated convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the second layer is a convolution layer, the step size is 1, the input size is 112 ⁇ 112 ⁇ 32, using two cascaded convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the third layer is a convolution layer with a step size of 1, and the input size is 112 ⁇ 112 ⁇ 32, and a 1 ⁇ 1 convolution kernel is used;
  • the fourth layer is a convolution layer, the step size is 2, the input size is 112 ⁇ 112 ⁇ 64, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the fifth layer is a convolution layer, the step size is 1, the input size is 56 ⁇ 56 ⁇ 64, using two cascaded convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the sixth layer is a convolution layer with a step size of 1, and the input size is 56 ⁇ 56 ⁇ 128, and a 1 ⁇ 1 convolution kernel is used;
  • the seventh layer is a convolution layer, the step size is 2, the input size is 56 ⁇ 56 ⁇ 128, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the eighth layer is a convolution layer, the step size is 1, the input size is 28 ⁇ 28 ⁇ 128, using two cascaded convolution kernels 1 ⁇ 3 and 3 ⁇ 1;
  • the ninth layer is a convolution layer with a step size of 1, and the input size is 28 ⁇ 28 ⁇ 256, using a 1 ⁇ 1 convolution kernel;
  • the 10th layer is a convolutional layer with a step size of 2, the input size is 28 ⁇ 28 ⁇ 256, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 11th layer is a convolutional layer, the step size is 1, the input size is 14 ⁇ 14 ⁇ 256, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the 12th layer is a convolution layer with a step size of 1, and the input size is 14 ⁇ 14 ⁇ 512, using a 1 ⁇ 1 convolution kernel;
  • the 13th layer is a convolutional layer, the step size is 2, the input size is 14 ⁇ 14 ⁇ 512, and two concatenated convolution kernels are used, 1 ⁇ 3 and 3 ⁇ 1;
  • the 14th layer is a convolutional layer with a step size of 1, the input size is 7 ⁇ 7 ⁇ 512, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 15th layer is a convolutional layer with a step size of 1, and the input size is 7 ⁇ 7 ⁇ 1024, using a 1 ⁇ 1 convolution kernel;
  • the 16th layer is a convolutional layer with a step size of 2, the input size is 7 ⁇ 7 ⁇ 1024, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 17th layer is a convolutional layer, the step size is 1, the input size is 7 ⁇ 7 ⁇ 1024, and two concatenated convolution kernels are used 1 ⁇ 3 and 3 ⁇ 1;
  • the 18th layer is an average pooling layer with a step size of 1, the input size is 7 ⁇ 7 ⁇ 1024, and the pooling size is 7 ⁇ 7;
  • the 19th layer is a fully connected layer, and the input size is 1 ⁇ 1 ⁇ 1024, including 1000 neurons;
  • the 20th layer is the loss function layer, which can be adapted, using the softmax function as the loss function with single label classification, and the cross entropy function as the multi-label classification.
  • the training of lightweight neural network model adopts the training method of migration learning. Firstly, the pre-training model is trained on the large database ImageNet, and then the fine-tuning training is carried out based on the image and video data inside the system to obtain a lightweight and fast image classification model.
  • the classification accuracy of the image classification model after testing and training is 93.19%, which meets the needs of actual power production scenarios.
  • the image management module of the embodiment is configured to manage image services of the internal and external networks of the power, and provides basic functions such as file uploading of the power external network, file import of the power intranet, file retrieval, file downloading, and statistical display for the image service.
  • the attributes of the file that is, the associated information of the image and the video data, are utilized when the file is uploaded, imported, retrieved, downloaded, and displayed.
  • the association information of the image and video data includes at least one of the following: a view category, a device tree, a scene tree, a tag tree, a defect tree, a professional type, a file source, and the like.
  • the view category includes an image category and a video data category;
  • the device tree represents a front-end device address of the captured image or the video data;
  • the scene tree represents a power scene of the captured image or the video data;
  • the tag tree represents a specific content of the captured image or the video data,
  • the defect tree is optional and represents a defect in the captured image or video data, which can be determined by the system administrator;
  • the professional type characterizes the power professional name of the captured image or video data, such as transmission lines, substations, converter stations , computer room, infrastructure center, business hall;
  • document source means the means of shooting images or video data, such as drone shooting, fixed camera shooting, robot shooting, handheld terminal shooting.
  • the first-level image service includes image common function tasks and image business application tasks
  • the secondary image service includes image deduplication, low-quality image rejection, video transcoding, video compression, fast browsing, and image service application tasks to the inspection task, scheduling-oriented tasks, and infrastructure-oriented Task, marketing-oriented tasks;
  • the three-level image service includes work vehicle detection, wire foreign object detection, tree and bamboo growth detection, wire icing detection, pyrotechnic detection, fitting corrosion detection, fitting loss detection, insulator crack detection, insulator loss detection, and insulator under the task of inspection.
  • Pollution flash detection, substation meter digital identification, transformer oil pillow oil leakage detection, substation personnel abnormal behavior detection, substation personnel dress code detection, substation personnel access detection, pyrotechnic detection, etc. also includes transformer switching status recognition under scheduling tasks And isolation switch status recognition, etc.; also includes import and export vehicle detection, import and export license plate identification, import and export personnel inspection, import and export personnel abnormal behavior detection, personnel dressing specification detection, open flame detection, etc. for infrastructure-oriented tasks;
  • the image annotation module of the embodiment provides a semi-automatic image annotation tool configured to provide annotation information of the training data set for the lightweight neural network model.
  • the image annotation is divided into two types, one is automatic labeling by the system, that is, the input image is transferred to the lightweight neural network model, and the classification result output by the model is labeled information; the other is the user identification icon, that is, The user views the image data and manually labels the object category and the area contained in the image.
  • the user identification icon can be used as an effective supplement to the system's automatic labeling, which can filter out inaccurate information automatically marked by the system.
  • the model training module of this embodiment is configured to train a lightweight neural network model on a GPU computing node within the multi-core heterogeneous parallel computing module.
  • the image and video data marked by the image are used as data sets for machine learning tasks such as image classification, target detection and image segmentation, and the model training is carried out through the multi-core heterogeneous parallel computing module.
  • the training process can be suspended, revoked, and continued. If the training result is not ideal, the network parameters are modified, and the number of iterations continues to be trained until a satisfactory model is obtained.
  • the satisfactory model is stored in the GPU computing node, and the original model can be replaced.
  • the model training module of this embodiment supports two modes of online training and offline training.
  • Offline training means that all data is entered into the system once and the model training task is started.
  • Online training means that after the model training task is started, new data can be input to the system, and the model that is being iterated is added to continue training.
  • the algorithm application module of the embodiment is directed to the image service of the internal and external network of the power, and performs the intelligent analysis task by using the lightweight neural network model on the GPU computing node in the multi-core heterogeneous parallel computing module.
  • the system provides a variety of algorithm models for power scenarios, devices, people, and event categories to adapt to different intelligent analysis tasks.
  • the specific task name can be found in the image management module.
  • the algorithm application module performs an intelligent analysis task algorithm model has two sources, one is a lightweight neural network model obtained by the model training module, and the other is a mature model built in the GPU computing node.
  • the task can be suspended, revoked, and resumed during the execution of the task, and the task can be deleted after the task ends.
  • the user can manipulate the image and video data according to the results, based on the functions mentioned above, including but not limited to, deleting duplicate images and low quality images, compressing long videos, transcoding videos of different formats, and pairing images. Target classification in the target, detecting and segmenting targets in the image, etc.
  • the system is scalable and universal: the scalability of the system is reflected in:
  • the system is based on a multi-core heterogeneous parallel computing framework. Different CPU nodes or GPU nodes can access the system to achieve multi-core parallel computing and efficient data storage, and complete offline or online model training and intelligent task execution functions;
  • this embodiment provides a lightweight neural network, the network model in the algorithm can be extended, and different neural network models are adapted according to different task characteristics.
  • the structured data stored in the platform is in accordance with the national network standard unified coding format, and the data format can be transmitted between the power industry service platforms without conversion;
  • the system provides a variety of algorithms for power scenarios, devices, personnel, and event categories, and can be applied to a variety of power analysis scenarios.
  • Adopt multi-core heterogeneous parallel computing framework suitable for this system, in which each node has clear division of labor, clear logic and reasonable interaction; GPU computing node undertakes model training and intelligent task execution function, completes intensive calculation; CPU storage management node bears data The storage function embeds a relational database and a non-relational database. The CPU compute node assumes the scientific computing function and assists the GPU computing node to perform part of the intensive computing function;
  • a lightweight neural network model is proposed as the core of the system.
  • the neural network can contain 20 layers, using small convolution kernel and asymmetric convolution kernel structure, residual structure, abandonment mechanism and batch standardization mechanism. It can effectively improve the classification accuracy, accelerate the convergence, enhance the network generalization ability, and maintain the accuracy rate while improving the training speed and algorithm execution speed.
  • the network provides normalization and enhancement functions for input data, which can effectively expand the data set.
  • the lightweight fast image classification model is based on existing power data. The parameters are based on the power data set training. It is not based on a common data set and is innovative and practical.
  • the system has scalability and universality, supports data transmission with other business platforms in the power industry, supports parallel computing of multi-core heterogeneous GPU nodes and CPU nodes, and has a scalable algorithm network model for power scenarios and devices. Multiple power analysis scenarios for personnel and event categories;
  • the system is based on multi-core heterogeneous parallel computing framework, which can efficiently realize online or offline image data training, form a lightweight and fast image classification model, and can realize the application of power internal and external network image services, which can maximize the value of image data and have better performance. Application prospects.

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Software Systems (AREA)
  • General Physics & Mathematics (AREA)
  • Multimedia (AREA)
  • General Health & Medical Sciences (AREA)
  • General Engineering & Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Evolutionary Computation (AREA)
  • Biophysics (AREA)
  • Molecular Biology (AREA)
  • Computing Systems (AREA)
  • Computational Linguistics (AREA)
  • Biomedical Technology (AREA)
  • Mathematical Physics (AREA)
  • Artificial Intelligence (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Health & Medical Sciences (AREA)
  • Image Analysis (AREA)

Abstract

Disclosed in the present application is a power artificial-intelligence visual-analysis system on the basis of multi-core heterogeneous parallel computing. The system comprises a multi-core heterogeneous parallel computing module and a service application module. Data is transmitted between the service application module and the multi-core heterogeneous parallel computing module through a web service interface. The multi-core heterogeneous parallel computing module comprises a graphic processing unit (GPU) computing node, a central processing unit (CPU) storage management node, and a CPU computing node, and the nodes are connected by means of a switch. The service application module comprises an image management module, an image labeling module, a model training module, and an algorithm application module.

Description

基于多核异构并行计算的电力人工智能视觉分析系统Electric artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing
相关申请的交叉引用Cross-reference to related applications
本申请基于申请号为201711268417.6、申请日为2017年12月5日的中国专利申请提出,并要求该中国专利申请的优先权,该中国专利申请的全部内容在此以引入方式并入本申请。The present application is based on a Chinese patent application filed on Jan. 5, 2017, the entire disclosure of which is hereby incorporated by reference.
技术领域Technical field
本申请涉及信号与信息处理领域,具体涉及一种基于多核异构并行计算的电力人工智能视觉分析系统。The present application relates to the field of signal and information processing, and in particular to a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing.
背景技术Background technique
人工智能和深度学习是先进的自动化领域科学技术手段,目前在图像处理、自然语言识别等领域获得了广泛应用。当前,人工智能分析方法已经开始运用于诸多行业,包括智能交通、智慧医疗、智能家居、自动驾驶、智能硬件等。近年来各种深度卷积神经网络(如Lenet神经网络、Alexnet神经网络、VGG神经网络、残差网络(Resnet)、Xnception神经网络等)层出不穷,广泛应用于图像识别、目标检测等计算机视觉领域。在电力行业,变电站、输电线路等业务场景下的前端设备每天都会采集大量图像和视频数据,相关业务部门均存在迫切的图像和视频数据分析识别需求,国内外已经出现了较为成熟的深度学习图像识别技术。但是大部分深度学习图像识别技术计算消耗大,运行速度慢。异构并行计算可以让不同类型的计算设备能够共享计算的过程和结果,同时不断优化和加速计算的过程,使其具备更高的计算效能。异构并行计算在国内外正在迅猛发展,尤其是中央处理器(CPU,Central Processing Unit)结合图形处理器(GPU,Graphic  Processing Unit)异构计算框架近年来成为研究热点。但是,目前尚未见基于多核异构并行计算框架的轻量化神经网络及其在电力图像识别中的应用。Artificial intelligence and deep learning are advanced scientific and technical methods in the field of automation. They are widely used in image processing and natural language recognition. At present, artificial intelligence analysis methods have been applied to many industries, including intelligent transportation, smart medical, smart home, automatic driving, intelligent hardware and so on. In recent years, various deep convolutional neural networks (such as Lenet neural network, Alexnet neural network, VGG neural network, Resnet), Xnception neural network, etc. have emerged in an endless stream, and are widely used in computer vision fields such as image recognition and target detection. In the power industry, front-end equipment in substation and transmission lines will collect a large amount of image and video data every day. Relevant business departments have urgent needs for image and video data analysis and identification. More mature deep learning images have appeared at home and abroad. Identification technology. However, most deep learning image recognition techniques are computationally expensive and slow to run. Heterogeneous parallel computing enables different types of computing devices to share the computational processes and results while continuously optimizing and accelerating the computational process for higher computational efficiency. Heterogeneous parallel computing is developing rapidly at home and abroad. In particular, the Central Processing Unit (CPU) combined with the GPU (Graphic Processing Unit) heterogeneous computing framework has become a research hotspot in recent years. However, there is no lightweight neural network based on multi-core heterogeneous parallel computing framework and its application in power image recognition.
因此,如何解决电力行业采集数据存在的智能化业务应用问题,目前尚无有效解决方案。Therefore, how to solve the intelligent business application problems existing in the data collection in the power industry has no effective solution.
发明内容Summary of the invention
本申请实施例期望提供一种基于多核异构并行计算的电力人工智能视觉分析系统,可高效实现在线或离线图像数据训练,形成轻量化快速图像分类模型,可实现电力内外网图像业务应用。The embodiment of the present application is intended to provide a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing, which can efficiently implement online or offline image data training, and form a lightweight and fast image classification model, which can realize the application of power internal and external network image services.
为了实现上述目标,本申请实施例采用如下技术方案:In order to achieve the above objectives, the embodiment of the present application adopts the following technical solutions:
本申请实施例提供了一种基于多核异构并行计算的电力人工智能视觉分析系统,所述系统包括多核异构并行计算模块和业务应用模块;所述业务应用模块和所述多核异构并行计算模块之间通过网络服务接口传输数据;An embodiment of the present application provides a power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing, the system comprising a multi-core heterogeneous parallel computing module and a business application module; the service application module and the multi-core heterogeneous parallel computing Data is transmitted between modules through a network service interface;
所述多核异构并行计算模块包括GPU计算节点、CPU存储管理节点和CPU计算节点,各节点之间通过交换机连接;所述GPU计算节点配置为执行模型训练,完成第一类计算任务;所述CPU配置为数据存储;所述CPU计算节点配置为执行第二类计算任务,以及辅助所述GPU计算节点执行部分所述第一类计算任务;The multi-core heterogeneous parallel computing module includes a GPU computing node, a CPU storage management node, and a CPU computing node, and each node is connected through a switch; the GPU computing node is configured to perform model training to complete a first type of computing task; The CPU is configured as a data storage; the CPU computing node is configured to perform a second type of computing task, and assists the GPU computing node to perform a portion of the first type of computing task;
所述业务应用模块包括图像管理模块、图像标注模块、模型训练模块和算法应用模块;所述图像管理模块配置为处理图像业务;所述图像标注模块配置为为轻量级神经网络模型提供训练数据集的标注信息;所述模型训练模块配置为基于所述多核异构并行计算模块训练轻量级神经网络模型;所述算法应用模块面配置为在所述多核异构并行计算模块上利用所述轻量级神经网络模型执行图像分析任务。The business application module includes an image management module, an image annotation module, a model training module, and an algorithm application module; the image management module is configured to process an image service; and the image annotation module is configured to provide training data for a lightweight neural network model Set labeling information; the model training module configured to train a lightweight neural network model based on the multi-core heterogeneous parallel computing module; the algorithm application module surface configured to utilize the multi-core heterogeneous parallel computing module The lightweight neural network model performs image analysis tasks.
在一实施例中,所述图像管理模块通过网络服务接口与电力行业其他业务平台通信,配置为向所述电力行业其他业务平台发送处理后的目标图像和视频数据及其关联信息;还配置为获取原始图像和视频数据及其关联信息;所述电力行业其他业务平台包括以下至少之一:统一视频监控平台以及运检、调度、营销、基建电力信息化平台。In an embodiment, the image management module communicates with other service platforms of the power industry through a network service interface, and is configured to send the processed target image and video data and related information to other service platforms of the power industry; Obtaining original image and video data and related information thereof; the other business platforms of the power industry include at least one of the following: a unified video monitoring platform, and a network for inspection, scheduling, marketing, and infrastructure power information.
在一实施例中,所述关联信息包括图像和视频数据的属性,包括以下至少之一:视图类别、设备树、场景树、标签树、缺陷树、专业类型、文件来源;In an embodiment, the association information includes attributes of image and video data, including at least one of the following: a view category, a device tree, a scene tree, a label tree, a defect tree, a professional type, and a file source;
所述视图类别包括图像类别和视频数据类别;所述设备树表征拍摄图像或视频数据的前端设备地址;所述场景树表征拍摄图像或视频数据的电力场景;所述标签树表征拍摄图像或视频数据的具体内容;所述缺陷树表征拍摄图像或视频数据存在的缺陷问题;所述专业类型表征拍摄图像或视频数据的电力专业名称;所述文件来源表征拍摄图像或视频数据的手段。The view category includes an image category and a video data category; the device tree characterizes a front end device address of the captured image or video data; the scene tree characterizes a power scene of the captured image or video data; the tag tree characterizes the captured image or video The specific content of the data; the defect tree characterizing a defect in the captured image or video data; the professional type characterizing a power professional name of the captured image or video data; the file source characterizing means for capturing the image or video data.
在一实施例中,所述CPU存储管理节点内嵌关系型数据库和非关系型数据库;所述关系型数据库配置为存储图像和视频数据的关联关系,所述非关系型数据库配置为存储图像和视频数据。In an embodiment, the CPU stores a management node embedded relational database and a non-relational database; the relational database is configured to store an association relationship between image and video data, and the non-relational database is configured to store an image and Video data.
在一实施例中,所述轻量级神经网络模型,包括:卷积层、池化层和全连接层;In an embodiment, the lightweight neural network model comprises: a convolution layer, a pooling layer, and a fully connected layer;
所述卷积层位于模型前端,所述池化层和所述全连接层位于模型后端;The convolution layer is located at a front end of the model, and the pooling layer and the fully connected layer are located at a back end of the model;
所述轻量级神经网络模型采用1×1卷积核和1×3、3×1的非对称卷积核。The lightweight neural network model uses a 1×1 convolution kernel and a 1×3, 3×1 asymmetric convolution kernel.
在一实施例中,所述轻量级神经网络模型包括:In an embodiment, the lightweight neural network model comprises:
第1层为卷积层,步长为2,输入的大小为224×224×3,采用两个级联的卷积核1×3和3×1;The first layer is a convolution layer, the step size is 2, the input size is 224 × 224 × 3, using two concatenated convolution kernels 1 × 3 and 3 × 1;
第2层为卷积层,步长为1,输入的大小为112×112×32,采用两个 级联的卷积核1×3和3×1;The second layer is a convolution layer, the step size is 1, the input size is 112 × 112 × 32, using two concatenated convolution kernels 1 × 3 and 3 × 1;
第3层为卷积层,步长为1,输入的大小为112×112×32,采用1×1的卷积核;The third layer is a convolution layer with a step size of 1, and the input size is 112×112×32, and a 1×1 convolution kernel is used;
第4层为卷积层,步长为2,输入的大小为112×112×64,采用两个级联的卷积核1×3和3×1;The fourth layer is a convolution layer, the step size is 2, the input size is 112×112×64, and two concatenated convolution kernels are used, 1×3 and 3×1;
第5层为卷积层,步长为1,输入的大小为56×56×64,采用两个级联的卷积核1×3和3×1;The fifth layer is a convolution layer, the step size is 1, the input size is 56 × 56 × 64, using two cascaded convolution kernels 1 × 3 and 3 × 1;
第6层为卷积层,步长为1,输入的大小为56×56×128,采用1×1的卷积核;The sixth layer is a convolution layer with a step size of 1, and the input size is 56×56×128, and a 1×1 convolution kernel is used;
第7层为卷积层,步长为2,输入的大小为56×56×128,采用两个级联的卷积核1×3和3×1;The seventh layer is a convolution layer, the step size is 2, the input size is 56×56×128, and two concatenated convolution kernels are used, 1×3 and 3×1;
第8层为卷积层,步长为1,输入的大小为28×28×128,采用两个级联的卷积核1×3和3×1;The eighth layer is a convolution layer, the step size is 1, the input size is 28 × 28 × 128, using two cascaded convolution kernels 1 × 3 and 3 × 1;
第9层为卷积层,步长为1,输入的大小为28×28×256,采用1×1卷积核;The ninth layer is a convolution layer with a step size of 1, and the input size is 28×28×256, using a 1×1 convolution kernel;
第10层为卷积层,步长为2,输入的大小为28×28×256,采用两个级联的卷积核1×3和3×1;The 10th layer is a convolutional layer with a step size of 2, the input size is 28×28×256, and two concatenated convolution kernels are used 1×3 and 3×1;
第11层为卷积层,步长为1,输入的大小为14×14×256,采用两个级联的卷积核1×3和3×1;The 11th layer is a convolutional layer, the step size is 1, the input size is 14×14×256, and two concatenated convolution kernels are used, 1×3 and 3×1;
第12层为卷积层,步长为1,输入的大小为14×14×512,采用1×1卷积核;The 12th layer is a convolution layer with a step size of 1, and the input size is 14×14×512, using a 1×1 convolution kernel;
第13层为卷积层,步长为2,输入的大小为14×14×512,采用两个级联的卷积核1×3和3×1;The 13th layer is a convolutional layer, the step size is 2, the input size is 14×14×512, and two concatenated convolution kernels are used, 1×3 and 3×1;
第14层为卷积层,步长为1,输入的大小为7×7×512,采用两个级联的卷积核1×3和3×1;The 14th layer is a convolutional layer with a step size of 1, the input size is 7×7×512, and two concatenated convolution kernels are used 1×3 and 3×1;
第15层为卷积层,步长为1,输入的大小为7×7×1024,采用1×1卷积核;The 15th layer is a convolutional layer with a step size of 1, and the input size is 7×7×1024, using a 1×1 convolution kernel;
第16层为卷积层,步长为2,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 16th layer is a convolutional layer with a step size of 2, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
第17层为卷积层,步长为1,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 17th layer is a convolutional layer, the step size is 1, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
第18层为平均池化层,步长为1,输入的大小为7×7×1024,池化大小为7×7;The 18th layer is an average pooling layer with a step size of 1, the input size is 7×7×1024, and the pooling size is 7×7;
第19层为全连接层,输入的大小为1×1×1024,包含1000个神经元;The 19th layer is a fully connected layer, and the input size is 1×1×1024, including 1000 neurons;
第20层为损失函数层,可适配,使用softmax函数作为损失函数用单标签分类,使用交叉熵函数作为多标签分类。The 20th layer is the loss function layer, which can be adapted, using the softmax function as the loss function with single label classification, and the cross entropy function as the multi-label classification.
在一实施例中,所述图像业务按如下级别分类:In an embodiment, the image services are classified according to the following levels:
一级图像业务包括图像通用功能任务和图像业务应用任务;The first-level image service includes image common function tasks and image business application tasks;
二级图像业务包括图像通用功能任务下的图像去重、低质量图像剔除、视频转码、视频压缩、快速浏览,以及图像业务应用任务下面向运检的任务、面向调度的任务、面向基建的任务、面向营销的任务;The secondary image service includes image deduplication, low-quality image rejection, video transcoding, video compression, fast browsing, and image service application tasks to the inspection task, scheduling-oriented tasks, and infrastructure-oriented Task, marketing-oriented tasks;
三级图像业务包括面向运检的任务下的作业车辆检测、导线异物检测、树竹生长检测、导线覆冰检测、烟火检测、金具锈蚀检测、金具丢失检测、绝缘子破裂检测、绝缘子丢失检测、绝缘子污闪检测、变电站表计数字识别、变压器油枕漏油检测、变电站人员异常行为检测、变电站人员着装规范检测、变电站人员出入检测、烟火检测;还包括面向调度的任务下的变压器开关状态识别和隔离开关状态识别;还包括面向基建的任务下的进出口车辆检测、进出口车牌识别、进出口人员检测、进出口人员异常行为检测、人员着装规范检测、明火检测;还包括面向营销的任务下的营业环境质量检测、服务人员到岗离岗情况检测、服务人员仪容仪表质量检测、服 务人员工作行为质量检测、客户行为分析及异常识别。The three-level image service includes work vehicle detection, wire foreign object detection, tree and bamboo growth detection, wire icing detection, pyrotechnic detection, fitting corrosion detection, fitting loss detection, insulator crack detection, insulator loss detection, and insulator under the task of inspection. Pollution flash detection, substation meter digital identification, transformer oil sump oil leak detection, substation personnel abnormal behavior detection, substation personnel dress code detection, substation personnel access detection, pyrotechnic detection; also includes transformer switch state recognition and task-oriented tasks Isolation switch status recognition; also includes import and export vehicle detection, import and export license plate identification, import and export personnel detection, import and export personnel abnormal behavior detection, personnel dressing specification detection, open flame detection under the task of infrastructure construction; The quality of the business environment, the inspection of the service personnel to the post, the quality inspection of the service personnel, the quality of the service personnel, the customer behavior analysis and the abnormal identification.
在一实施例中,所述图像标注包括两种类型,一种类型是系统自动标注,所述系统自动标注为图像输入至轻量级神经网络模型后、以输出的分类结果为标注信息;另一种类型是用户识图标注。In an embodiment, the image annotation includes two types, one type is automatic labeling by the system, and the system automatically labels the image into the lightweight neural network model, and uses the output classification result as the annotation information; One type is the user's icon note.
在一实施例中,所述模型训练模块以图像标注的图像和视频数据为数据集,面向图像分类、目标检测、图像分割的机器学习任务,通过多核异构并行计算模块进行模型训练;所述模型训练模块支持在线训练和离线训练两种方式;所述离线训练是指将数据一次性输入后执行模型训练任务;所述在线训练指模型训练任务启动后,在模型训练任务执行过程中输入新的数据。In an embodiment, the model training module uses the image and video data labeled by the image as a data set, and the machine learning task for image classification, target detection, and image segmentation, and performs model training through a multi-core heterogeneous parallel computing module; The model training module supports two modes: online training and offline training; the offline training refers to performing model training tasks after one-time input of data; and the online training refers to inputting new models in the execution of model training tasks after the model training tasks are started. The data.
在一实施例中,所述算法应用模块,配置为通过所述模型训练模块得到的所述轻量级神经网络模型和/或内置于所述GPU计算节点的其他成熟模型执行图像分析。In an embodiment, the algorithm application module is configured to perform image analysis by the lightweight neural network model obtained by the model training module and/or other mature models built in the GPU computing node.
本申请实施例所达到的有益效果包括:The beneficial effects achieved by the embodiments of the present application include:
1.本申请实施例采用了适合本系统的多核异构并行计算框架,其中各节点分工明确,逻辑清晰,交互合理,GPU计算节点承担模型训练和智能任务执行功能,完成密集计算;CPU存储管理结点承担数据存储功能,内嵌一种关系型数据库和一种非关系型数据库,CPU计算节点承担科学计算功能,同时辅助GPU计算节点完成部分密集计算功能;1. The embodiment of the present application adopts a multi-core heterogeneous parallel computing framework suitable for the system, wherein each node has clear division of labor, clear logic, and reasonable interaction, and the GPU computing node undertakes model training and intelligent task execution functions, completes intensive calculation; CPU storage management The node undertakes the data storage function, embeds a relational database and a non-relational database, and the CPU compute node assumes the scientific computing function, and assists the GPU computing node to complete part of the intensive computing function;
2.本申请实施例提出了一种轻量级神经网络模型作为系统的算法核心,网络模型可包含20层,采用了小卷积核和非对称卷积核结构、残差结构、弃权机制、批标准化机制,能有效提高分类准确率,同时加速收敛,增强网络泛化能力,保持准确率的同时提升了训练速度和算法执行速度;同时网络提供针对输入数据的归一化和增强功能,能有效扩充数据集;轻量化快速图像分类模型基于现有的电力数据,参数是基于电力数据集训练 得到的,并非基于通用的数据集,具备创新性和实用价值;2. The embodiment of the present application proposes a lightweight neural network model as the algorithm core of the system. The network model can include 20 layers, adopting a small convolution kernel and an asymmetric convolution kernel structure, a residual structure, a waiver mechanism, The batch standardization mechanism can effectively improve the classification accuracy rate, accelerate the convergence, enhance the network generalization ability, and maintain the accuracy rate while improving the training speed and algorithm execution speed. At the same time, the network provides normalization and enhancement functions for input data. Effectively augment the data set; the lightweight fast image classification model is based on existing power data, the parameters are trained based on the power data set, and are not based on a common data set, which is innovative and practical;
3.本申请实施例的系统具备可扩展性和普适性,支持和电力行业其他业务平台的数据传递,支持多核异构的GPU节点和CPU节点并行计算,具备可扩展的算法网络模型,适用电力场景、设备、人员、事件类别的多种电力分析场合。3. The system of the embodiment of the present application has scalability and universality, supports data transmission with other service platforms in the power industry, supports parallel computing of multi-core heterogeneous GPU nodes and CPU nodes, and has a scalable algorithm network model. Multiple power analysis scenarios for power scenarios, equipment, personnel, and event categories.
基于此,本申请实施例的系统基于多核异构并行计算框架,可高效实现在线或离线图像数据训练,形成轻量化快速图像分类模型,可实现电力内外网图像业务应用,能最大化发挥图像数据的价值,具备较好的应用前景。Based on this, the system of the embodiment of the present application is based on a multi-core heterogeneous parallel computing framework, and can efficiently implement online or offline image data training to form a lightweight and fast image classification model, which can realize image application of power internal and external network images, and can maximize image data. The value has a good application prospect.
附图说明DRAWINGS
图1是本申请实施例的系统逻辑架构框图。FIG. 1 is a block diagram of a system logic architecture of an embodiment of the present application.
具体实施方式Detailed ways
下面结合附图对本申请实施例作进一步描述。以下实施例仅用于更加清楚地说明本发明的技术方案,而不能以此来限制本发明的保护范围。The embodiments of the present application are further described below in conjunction with the accompanying drawings. The following examples are only intended to more clearly illustrate the technical solutions of the present invention, and are not intended to limit the scope of the present invention.
如图1所示,本申请实施例提供了一种基于多核异构并行计算框架的电力人工智能视觉分析系统,所述系统包括多核异构并行计算模块和业务应用模块;As shown in FIG. 1 , an embodiment of the present application provides a power artificial intelligence visual analysis system based on a multi-core heterogeneous parallel computing framework, where the system includes a multi-core heterogeneous parallel computing module and a business application module;
多核异构并行计算模块包括GPU计算节点、CPU存储管理节点和CPU计算节点,各节点之间通过交换机(该交换机例如为infiniband(一种支持多并发链接的“转换线缆”技术)的交换机)连接;GPU计算节点配置为执行模型训练,完成第一类计算任务;CPU存储管理节点配置为数据存储;CPU计算节点配置为执行第二类计算任务,以及辅助所述GPU计算节点执行部分所述第一类计算任务。The multi-core heterogeneous parallel computing module includes a GPU computing node, a CPU storage management node, and a CPU computing node, and each node passes through a switch (for example, a switch of infiniband (a "conversion cable" technology supporting multiple concurrent links)) Connecting; the GPU computing node is configured to perform model training to complete the first type of computing task; the CPU storage management node is configured as data storage; the CPU computing node is configured to perform the second type of computing task, and assisting the GPU computing node to execute the portion The first type of computing task.
业务应用模块包括图像管理模块、图像标注模块、模型训练模块和算 法应用模块;图像管理模块配置为管理电力内外网图像业务;图像标注模块配置为为轻量级神经网络模型提供训练数据集的标注信息;模型训练模块配置为基于所述多核异构并行计算模块训练轻量级神经网络模型;算法应用模块面向电力内外网图像业务,配置为在所述多核异构并行计算模块上利用所述轻量级神经网络模型执行智能分析任务。The business application module includes an image management module, an image annotation module, a model training module, and an algorithm application module; the image management module is configured to manage the power internal and external network image services; and the image annotation module is configured to provide the training data set annotation for the lightweight neural network model. Information; the model training module is configured to train a lightweight neural network model based on the multi-core heterogeneous parallel computing module; the algorithm application module is directed to the power internal and external network image service, configured to utilize the light on the multi-core heterogeneous parallel computing module The magnitude neural network model performs intelligent analysis tasks.
其中,业务应用模块和多核异构并行计算模块之间通过网络服务(web service)接口传输数据。The service application module and the multi-core heterogeneous parallel computing module transmit data through a web service interface.
本实施例中,图像管理模块通过web service接口与电力行业其他业务平台(例如统一视频监控平台以及运检、调度、营销、基建等电力信息化平台)通信,向所述电力行业其他业务平台发送经本系统处理后的目标图像和视频数据及其关联信息;所述图像管理模块还配置为获取原始图像和视频数据及其关联关系;一方面,所述图像管理模块可从电力行业其他业务平台获取原始图像和视频数据及其关联信息,作为后续分析的数据源;另一方面,所述图像管理模块也可接收本地上传的图像和视频数据及其关联信息,作为后续分析数据源的有效补充。In this embodiment, the image management module communicates with other service platforms of the power industry (for example, a unified video monitoring platform and a power information platform such as a physical inspection, scheduling, marketing, infrastructure, etc.) through a web service interface, and sends the information to other service platforms of the power industry. The target image and the video data and the associated information processed by the system; the image management module is further configured to acquire the original image and the video data and their associated relationship; on the one hand, the image management module may be from other service platforms of the power industry Obtaining original image and video data and related information thereof as a data source for subsequent analysis; on the other hand, the image management module can also receive locally uploaded image and video data and related information thereof as an effective supplement for subsequent analysis data sources. .
本申请实施例的CPU存储管理节点内嵌关系型数据库和非关系型数据库。本实施例中,图像和视频数据为非结构化数据,图像和视频数据的关联信息为结构化数据,所述关联信息包括图像和视频数据的属性。本实施例中提到的非结构化数据等价于图像和视频数据,结构化数据等价于图像和视频数据的关联信息。关系型数据库配置为存储结构化数据,即存储图像和视频数据的关联关系,也即存储图像和视频数据的属性;非关系型数据库配置为存储非结构化数据,即图像和视频数据。本实施例的图像和视频数据大小不一,小则几十千字节(KB),大则上百兆字节(MB),需要一个支持存储单元大小跨越较大区间的数据库,作为一种示例,本实施例中可选用HBase数据库(构建在HDFS上的分布式列存储系统)存储。关 系型数据库可选用mySQL数据库(一个关系型数据库管理系统)。同一图像或视频数据的关联信息通过唯一ID关联。The CPU of the embodiment of the present application stores a management node embedded relational database and a non-relational database. In this embodiment, the image and video data are unstructured data, and the associated information of the image and video data is structured data, and the associated information includes attributes of the image and video data. The unstructured data mentioned in this embodiment is equivalent to image and video data, and the structured data is equivalent to the associated information of the image and video data. The relational database is configured to store structured data, ie, an association relationship between stored images and video data, that is, attributes for storing image and video data; a non-relational database is configured to store unstructured data, that is, image and video data. The image and video data of this embodiment are different in size, ranging from tens of kilobytes (KB) to hundreds of megabytes (MB), and a database supporting a storage unit size spanning a large interval is required. For example, in this embodiment, an HBase database (a distributed column storage system built on HDFS) may be used for storage. The relational database can use the mySQL database (a relational database management system). The associated information of the same image or video data is associated by a unique ID.
本申请实施例的GPU计算节点包含分组的GPU,每组包含2个GPU,以完成并行计算。本实施例采用组内GPU执行第一类计算任务,所述第一类结算任务例如做模型并行计算、组间做数据并行计算的方式,组内两GPU各持有轻量级神经网络模型的一半参数,协作完成单个模型的训练。The GPU computing node of the embodiment of the present application includes grouped GPUs, each group containing 2 GPUs to complete parallel computing. In this embodiment, the first type of computing task is performed by using the GPU in the group, and the first type of settlement task is, for example, model parallel computing and data parallel computing between groups, and the two GPUs in the group each hold a lightweight neural network model. Half of the parameters, collaboratively complete the training of a single model.
本申请实施例的CPU计算节点包含分组的CPU,每组包含2个CPU,以完成并行计算。本实施例采用2组CPU计算节点完成第二类计算任务,所述第二类计算任务为科学任务,例如包括检索任务、查找任务等,另外2组CPU是GPU计算节点的有效补充,用于完成第一类计算任务,采用组内CPU做模型并行计算、组间做数据并行计算的方式,组内两CPU各持有轻量级神经网络模型的一半参数,协作完成单个模型的训练。The CPU compute node of the embodiment of the present application includes grouped CPUs, each group containing 2 CPUs to complete parallel computing. In this embodiment, two sets of CPU computing nodes are used to complete the second type of computing tasks, and the second type of computing tasks are scientific tasks, for example, including a retrieval task, a search task, etc., and the other two CPUs are effective supplements of the GPU computing nodes, and are used for The first type of computing task is completed, and the in-group CPU is used for model parallel computing and the data is parallelized between groups. The two CPUs in the group each hold half of the parameters of the lightweight neural network model, and cooperate to complete the training of the single model.
本实施例中的“多核”主要体现在计算框架内的物理节点数量是可扩展的,“异构”主要体现在系统的结点由GPU和CPU两种类别组成,“并行”主要体现在数据并行计算、模型并行计算和任务并行计算。The "multi-core" in this embodiment is mainly embodied in the number of physical nodes in the computing framework is extensible, "heterogeneous" is mainly reflected in the system node is composed of two categories of GPU and CPU, "parallel" is mainly reflected in the data Parallel computing, model parallel computing, and task parallel computing.
就本实施例的系统而言,数据并行计算和模型并行计算可以实现完善高效的数据传输。以模型训练为例,假设有4个GPU计算节点完成模型训练任务,将这4个节点分为两组,分别为组1和组2,组1中包含节点GPU1和GPU2,组2中包含节点GPU3和GPU4。轻量级神经网络的模型并行计算和数据并行计算建立在分组基础上,组内GPU做模型并行计算,组间GPU做数据并行计算。组内两节点各持有轻量级神经网络模型的一半参数,协作完成单个模型的训练,即GPU1和GPU2持有模型的一半参数,GPU3和GPU4持有模型的另一半参数,这一过程叫做模型并行计算。组间数据并行计算按同步随机梯度下降进行训练,采用拓扑完成参数交换,即组1内的GPU1和GPU2与组2内的GPU3和GPU4完成数据交换,这一过程 叫做数据并行计算。With the system of the present embodiment, data parallel computing and model parallel computing can achieve perfect and efficient data transmission. Taking model training as an example, suppose there are 4 GPU computing nodes to complete the model training task. The four nodes are divided into two groups, namely group 1 and group 2, group 1 contains nodes GPU1 and GPU2, and group 2 contains nodes. GPU3 and GPU4. Model parallel computing and data parallel computing of lightweight neural networks are based on grouping. GPUs in the group do model parallel computing, and GPUs in the group do data parallel computing. Each node in the group holds half of the parameters of the lightweight neural network model, and cooperates to complete the training of a single model, that is, GPU1 and GPU2 hold half of the parameters of the model, and GPU3 and GPU4 hold the other half of the model. This process is called Model parallel computing. Parallel computing of data between groups is trained by synchronous random gradient descent, and topology is used to complete parameter exchange. That is, GPU1 and GPU2 in group 1 and GPU3 and GPU4 in group 2 complete data exchange. This process is called data parallel computing.
本实施例引入数据并行计算和模型并行计算后,从磁盘读取训练数据,训练数据预处理,轻量级神经网络训练分别占用磁盘、CPU、GPU资源,且均耗时较大。因此引入流水线,使得磁盘、CPU、GPU资源可以同时得到利用,提升整体性能。In this embodiment, after data parallel computing and model parallel computing are introduced, the training data is read from the disk, and the training data is preprocessed. The lightweight neural network training respectively occupies the disk, CPU, and GPU resources, and both take a long time. Therefore, the pipeline is introduced, so that disk, CPU, and GPU resources can be utilized at the same time to improve overall performance.
执行任务的并行过程类比模型训练的并行过程。任务并行计算则指在任务管理层面,多个任务可以被按需调度、并发执行,具体到各计算节点体现为数据并行计算和模型并行计算。Parallel processes that perform tasks are parallel to the parallel process of model training. Parallel computing of tasks means that at the task management level, multiple tasks can be scheduled and executed concurrently, and each computing node is embodied as data parallel computing and model parallel computing.
电力领域的图像数据有其自身的专业性,其中的设备、人员、场景等信息类别通常并不存在于ImageNet、Pascal VOC等常见大型数据集中。电力领域图像业务也具备自身专业性,涉及单标签图像分类、多标签图像分类、目标检测等多种类别,且对系统实时性要求较高。针对以上的业务需求,本实施例提出了一种轻量级神经网络模型,该模型可以适配多种业务需要的算法,同时模型计算量适中,响应速度快,精度较高,有很强的实用价值。The image data in the power field has its own professionalism, and the information categories such as equipment, personnel, and scenes usually do not exist in the common large data sets such as ImageNet and Pascal VOC. The image service in the power field also has its own professionalism, involving multiple categories of image classification, multi-label image classification, target detection, etc., and has high requirements for real-time system. In view of the above business requirements, this embodiment proposes a lightweight neural network model, which can adapt to a variety of algorithms required by the business, while the model calculation amount is moderate, the response speed is fast, the precision is high, and there is a strong Practical value.
本申请实施例轻量级神经网络模型,包括:卷积层、池化层和全连接层;The lightweight neural network model of the embodiment of the present application includes: a convolution layer, a pooling layer, and a fully connected layer;
其中,作为一种示例,(1)模型包括20层,其中,卷积层有17层,池化层有1层,全连接层有1层,卷积层位于模型前端,池化层和全连接层位于模型后端。网络结构较深但是层数适中,能有效提高分类准确率;Wherein, as an example, (1) the model comprises 20 layers, wherein the convolution layer has 17 layers, the pool layer has 1 layer, and the full connection layer has 1 layer, and the convolution layer is located at the front end of the model, the pool layer and the whole The connection layer is at the back end of the model. The network structure is deep but the number of layers is moderate, which can effectively improve the classification accuracy rate;
(2)所述轻量级神经网络模型采用的卷积核为1×1卷积核和1×3、3×1的非对称卷积核,其中1×3、3×1的非对称卷积核可以起到3×3卷积核的等价效果,但是参数大大减少,在保持网络深度的同时增加网络的非线性,降低计算量和参数数量。(2) The convolution kernel used in the lightweight neural network model is a 1×1 convolution kernel and a 1×3, 3×1 asymmetric convolution kernel, where 1×3, 3×1 asymmetric volumes The product core can play the equivalent effect of the 3×3 convolution kernel, but the parameters are greatly reduced, and the network nonlinearity is increased while maintaining the network depth, reducing the amount of calculation and the number of parameters.
(3)所述轻量级神经网络模型具备残差结构,在保持网络深度的同时 加速收敛,有效避免了深度神经网络的梯度消失问题。(3) The lightweight neural network model has a residual structure, accelerates convergence while maintaining the depth of the network, and effectively avoids the gradient disappearance of the deep neural network.
(4)所述轻量级神经网络模型具备弃权、批标准化的可选配置参数,引入正则化机制,加快训练速度的同时可以减轻过拟合,增强网络泛化能力。(4) The lightweight neural network model has optional configuration parameters of abandonment and batch standardization, and introduces a regularization mechanism to speed up the training while reducing over-fitting and enhancing network generalization ability.
(5)所述轻量级神经网络模型提供针对输入数据的归一化和增强功能,将所有输入图像归一化为224×224的像素值,并提供镜像反转、裁剪、色调变换等多种数据增强手段,扩充了训练数据集,可以减轻过拟合,增强网络泛化能力。(5) The lightweight neural network model provides normalization and enhancement functions for input data, normalizing all input images to pixel values of 224×224, and providing image inversion, cropping, tone conversion, and the like. A data enhancement method that expands the training data set, can reduce over-fitting and enhance network generalization ability.
(6)所述轻量级神经网络模型最后的一层即损失函数层具备可适配性,使用归一化指数函数(softmax函数)作为损失函数用单标签分类,使用交叉熵函数作为多标签分类,满足多种电力业务需求。(6) The last layer of the lightweight neural network model, ie the loss function layer, is adaptable, using a normalized exponential function (softmax function) as a loss function with a single label classification, using a cross entropy function as a multi-label Classification to meet a variety of power business needs.
相比于其他的神经网络,本申请实施例提出的网络模型层数适中,保持准确率的同时提升了训练速度和算法执行速度,训练用时和算法执行用时小于ResNet、GoogLeNet等常用卷积神经网络。Compared with other neural networks, the number of network model layers proposed in the embodiments of the present application is moderate, and the accuracy and the execution speed of the algorithm are improved while the training time and algorithm execution time are smaller than the commonly used convolutional neural networks such as ResNet and GoogLeNet. .
本实施例的轻量级神经网络模型如下表所示:The lightweight neural network model of this embodiment is shown in the following table:
表1Table 1
Figure PCTCN2018114389-appb-000001
Figure PCTCN2018114389-appb-000001
Figure PCTCN2018114389-appb-000002
Figure PCTCN2018114389-appb-000002
结合表1所示,本实施例的轻量级神经网络模型包括:As shown in Table 1, the lightweight neural network model of this embodiment includes:
第1层为卷积层,步长为2,输入的大小为224×224×3,采用两个级联的卷积核1×3和3×1;The first layer is a convolution layer, the step size is 2, the input size is 224 × 224 × 3, using two concatenated convolution kernels 1 × 3 and 3 × 1;
第2层为卷积层,步长为1,输入的大小为112×112×32,采用两个级联的卷积核1×3和3×1;The second layer is a convolution layer, the step size is 1, the input size is 112 × 112 × 32, using two cascaded convolution kernels 1 × 3 and 3 × 1;
第3层为卷积层,步长为1,输入的大小为112×112×32,采用1×1的卷积核;The third layer is a convolution layer with a step size of 1, and the input size is 112×112×32, and a 1×1 convolution kernel is used;
第4层为卷积层,步长为2,输入的大小为112×112×64,采用两个级联的卷积核1×3和3×1;The fourth layer is a convolution layer, the step size is 2, the input size is 112×112×64, and two concatenated convolution kernels are used, 1×3 and 3×1;
第5层为卷积层,步长为1,输入的大小为56×56×64,采用两个级联的卷积核1×3和3×1;The fifth layer is a convolution layer, the step size is 1, the input size is 56 × 56 × 64, using two cascaded convolution kernels 1 × 3 and 3 × 1;
第6层为卷积层,步长为1,输入的大小为56×56×128,采用1×1的卷积核;The sixth layer is a convolution layer with a step size of 1, and the input size is 56×56×128, and a 1×1 convolution kernel is used;
第7层为卷积层,步长为2,输入的大小为56×56×128,采用两个级联的卷积核1×3和3×1;The seventh layer is a convolution layer, the step size is 2, the input size is 56×56×128, and two concatenated convolution kernels are used, 1×3 and 3×1;
第8层为卷积层,步长为1,输入的大小为28×28×128,采用两个级联的卷积核1×3和3×1;The eighth layer is a convolution layer, the step size is 1, the input size is 28 × 28 × 128, using two cascaded convolution kernels 1 × 3 and 3 × 1;
第9层为卷积层,步长为1,输入的大小为28×28×256,采用1×1卷积核;The ninth layer is a convolution layer with a step size of 1, and the input size is 28×28×256, using a 1×1 convolution kernel;
第10层为卷积层,步长为2,输入的大小为28×28×256,采用两个级联的卷积核1×3和3×1;The 10th layer is a convolutional layer with a step size of 2, the input size is 28×28×256, and two concatenated convolution kernels are used 1×3 and 3×1;
第11层为卷积层,步长为1,输入的大小为14×14×256,采用两个级联的卷积核1×3和3×1;The 11th layer is a convolutional layer, the step size is 1, the input size is 14×14×256, and two concatenated convolution kernels are used, 1×3 and 3×1;
第12层为卷积层,步长为1,输入的大小为14×14×512,采用1×1卷积核;The 12th layer is a convolution layer with a step size of 1, and the input size is 14×14×512, using a 1×1 convolution kernel;
第13层为卷积层,步长为2,输入的大小为14×14×512,采用两个级联的卷积核1×3和3×1;The 13th layer is a convolutional layer, the step size is 2, the input size is 14×14×512, and two concatenated convolution kernels are used, 1×3 and 3×1;
第14层为卷积层,步长为1,输入的大小为7×7×512,采用两个级 联的卷积核1×3和3×1;The 14th layer is a convolutional layer with a step size of 1, the input size is 7×7×512, and two concatenated convolution kernels are used 1×3 and 3×1;
第15层为卷积层,步长为1,输入的大小为7×7×1024,采用1×1卷积核;The 15th layer is a convolutional layer with a step size of 1, and the input size is 7×7×1024, using a 1×1 convolution kernel;
第16层为卷积层,步长为2,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 16th layer is a convolutional layer with a step size of 2, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
第17层为卷积层,步长为1,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 17th layer is a convolutional layer, the step size is 1, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
第18层为平均池化层,步长为1,输入的大小为7×7×1024,池化大小为7×7;The 18th layer is an average pooling layer with a step size of 1, the input size is 7×7×1024, and the pooling size is 7×7;
第19层为全连接层,输入的大小为1×1×1024,包含1000个神经元;The 19th layer is a fully connected layer, and the input size is 1×1×1024, including 1000 neurons;
第20层为损失函数层,可适配,使用softmax函数作为损失函数用单标签分类,使用交叉熵函数作为多标签分类。The 20th layer is the loss function layer, which can be adapted, using the softmax function as the loss function with single label classification, and the cross entropy function as the multi-label classification.
轻量级神经网络模型的训练采用迁移学习的训练方式,先在大型数据库ImageNet上训练得到预训练模型,然后在基于本系统内部的图像和视频数据进行微调训练,得到轻量化快速图像分类模型。测试训练完毕的图像分类模型分类准确率达93.19%,满足实际电力生产场景的需求。The training of lightweight neural network model adopts the training method of migration learning. Firstly, the pre-training model is trained on the large database ImageNet, and then the fine-tuning training is carried out based on the image and video data inside the system to obtain a lightweight and fast image classification model. The classification accuracy of the image classification model after testing and training is 93.19%, which meets the needs of actual power production scenarios.
本实施例的图像管理模块配置为管理电力内外网的图像业务,面向图像业务,提供电力外网的文件上传、电力内网的文件导入、文件检索、文件下载、统计展示等基本功能。The image management module of the embodiment is configured to manage image services of the internal and external networks of the power, and provides basic functions such as file uploading of the power external network, file import of the power intranet, file retrieval, file downloading, and statistical display for the image service.
本实施例中,文件上传、导入、检索、下载、展示时会利用文件的属性,即图像和视频数据的关联信息。图像和视频数据的关联信息包括以下至少之一:视图类别、设备树、场景树、标签树、缺陷树、专业类型、文件来源等。其中,视图类别包括图像类别和视频数据类别;设备树表征拍摄图像或视频数据的前端设备地址;场景树表征拍摄图像或视频数据的电力场景;标签树表征拍摄图像或视频数据的具体内容,可由系统管理员制 定;缺陷树是可选项,表征拍摄图像或视频数据存在的缺陷问题,可由系统管理员制定;专业类型表征拍摄图像或视频数据的电力专业名称,如输电线路、变电站、换流站、机房、基建中心、营业厅;文件来源表征拍摄图像或视频数据的手段,如无人机拍摄、固定摄像头拍摄、机器人拍摄、手持终端拍摄等。In this embodiment, the attributes of the file, that is, the associated information of the image and the video data, are utilized when the file is uploaded, imported, retrieved, downloaded, and displayed. The association information of the image and video data includes at least one of the following: a view category, a device tree, a scene tree, a tag tree, a defect tree, a professional type, a file source, and the like. The view category includes an image category and a video data category; the device tree represents a front-end device address of the captured image or the video data; the scene tree represents a power scene of the captured image or the video data; and the tag tree represents a specific content of the captured image or the video data, Developed by the system administrator; the defect tree is optional and represents a defect in the captured image or video data, which can be determined by the system administrator; the professional type characterizes the power professional name of the captured image or video data, such as transmission lines, substations, converter stations , computer room, infrastructure center, business hall; document source means the means of shooting images or video data, such as drone shooting, fixed camera shooting, robot shooting, handheld terminal shooting.
本实施例的系统处理的图像业务按如下级别分类:The image services processed by the system of this embodiment are classified according to the following levels:
一级图像业务包括图像通用功能任务和图像业务应用任务;The first-level image service includes image common function tasks and image business application tasks;
二级图像业务包括图像通用功能任务下的图像去重、低质量图像剔除、视频转码、视频压缩、快速浏览,以及图像业务应用任务下面向运检的任务、面向调度的任务、面向基建的任务、面向营销的任务;The secondary image service includes image deduplication, low-quality image rejection, video transcoding, video compression, fast browsing, and image service application tasks to the inspection task, scheduling-oriented tasks, and infrastructure-oriented Task, marketing-oriented tasks;
三级图像业务包括面向运检的任务下的作业车辆检测、导线异物检测、树竹生长检测、导线覆冰检测、烟火检测、金具锈蚀检测、金具丢失检测、绝缘子破裂检测、绝缘子丢失检测、绝缘子污闪检测、变电站表计数字识别、变压器油枕漏油检测、变电站人员异常行为检测、变电站人员着装规范检测、变电站人员出入检测、烟火检测等;还包括面向调度的任务下的变压器开关状态识别和隔离开关状态识别等;还包括面向基建的任务下的进出口车辆检测、进出口车牌识别、进出口人员检测、进出口人员异常行为检测、人员着装规范检测、明火检测等;还包括面向营销的任务下的营业环境质量检测、服务人员到岗离岗情况检测、服务人员仪容仪表质量检测、服务人员工作行为质量检测、客户行为分析及异常识别等。The three-level image service includes work vehicle detection, wire foreign object detection, tree and bamboo growth detection, wire icing detection, pyrotechnic detection, fitting corrosion detection, fitting loss detection, insulator crack detection, insulator loss detection, and insulator under the task of inspection. Pollution flash detection, substation meter digital identification, transformer oil pillow oil leakage detection, substation personnel abnormal behavior detection, substation personnel dress code detection, substation personnel access detection, pyrotechnic detection, etc.; also includes transformer switching status recognition under scheduling tasks And isolation switch status recognition, etc.; also includes import and export vehicle detection, import and export license plate identification, import and export personnel inspection, import and export personnel abnormal behavior detection, personnel dressing specification detection, open flame detection, etc. for infrastructure-oriented tasks; The business environment quality inspection under the task, the service personnel to leave the post, the quality inspection of the service personnel, the quality of service personnel, the customer behavior analysis and the abnormality identification.
本实施例的图像标注模块,提供半自动的图像标注工具,配置为为轻量级神经网络模型提供训练数据集的标注信息。作为一种示例,图像标注分为两种,一种是系统自动标注,即输入图像到轻量级神经网络模型,以模型输出的分类结果为标注信息;另一种是用户识图标注,即用户查看图像数据,手动标注出图像内包含的物体类别和所在区域。用户识图标注可 作为系统自动标注的有效补充,可以过滤掉系统自动标注的不准确信息。The image annotation module of the embodiment provides a semi-automatic image annotation tool configured to provide annotation information of the training data set for the lightweight neural network model. As an example, the image annotation is divided into two types, one is automatic labeling by the system, that is, the input image is transferred to the lightweight neural network model, and the classification result output by the model is labeled information; the other is the user identification icon, that is, The user views the image data and manually labels the object category and the area contained in the image. The user identification icon can be used as an effective supplement to the system's automatic labeling, which can filter out inaccurate information automatically marked by the system.
本实施例的模型训练模块,配置为在多核异构并行计算模块内的GPU计算节点上训练轻量级神经网络模型。以图像标注的图像和视频数据为数据集,面向图像分类、目标检测、图像分割等机器学习任务,通过多核异构并行计算模块进行模型训练。The model training module of this embodiment is configured to train a lightweight neural network model on a GPU computing node within the multi-core heterogeneous parallel computing module. The image and video data marked by the image are used as data sets for machine learning tasks such as image classification, target detection and image segmentation, and the model training is carried out through the multi-core heterogeneous parallel computing module.
其中,训练过程中可以暂停、撤销、继续训练任务。若训练结果不够理想,则修改网络参数,迭代次数继续训练,直到得到满意的模型,满意的模型存储于GPU计算节点内,可替换原有的模型。Among them, the training process can be suspended, revoked, and continued. If the training result is not ideal, the network parameters are modified, and the number of iterations continues to be trained until a satisfactory model is obtained. The satisfactory model is stored in the GPU computing node, and the original model can be replaced.
本实施例的所述模型训练模块支持在线训练和离线训练两种方式。离线训练是指将所有数据一次性输入系统后开启模型训练任务。在线训练指模型训练任务启动后,可以输入新的数据到系统,加入正在迭代的模型后继续训练。The model training module of this embodiment supports two modes of online training and offline training. Offline training means that all data is entered into the system once and the model training task is started. Online training means that after the model training task is started, new data can be input to the system, and the model that is being iterated is added to continue training.
本实施例的算法应用模块,面向电力内外网图像业务,在多核异构并行计算模块内的GPU计算节点上利用轻量级神经网络模型执行智能分析任务。系统提供了面向电力场景、设备、人员、事件类别的多种算法模型,以适配不同的智能分析任务。具体的任务名称见图像管理模块。The algorithm application module of the embodiment is directed to the image service of the internal and external network of the power, and performs the intelligent analysis task by using the lightweight neural network model on the GPU computing node in the multi-core heterogeneous parallel computing module. The system provides a variety of algorithm models for power scenarios, devices, people, and event categories to adapt to different intelligent analysis tasks. The specific task name can be found in the image management module.
其中,所述算法应用模块执行智能分析任务的算法模型有两个来源,一是模型训练模块得到的轻量级神经网络模型,二是内置于GPU计算节点的其他成熟模型。The algorithm application module performs an intelligent analysis task algorithm model has two sources, one is a lightweight neural network model obtained by the model training module, and the other is a mature model built in the GPU computing node.
其中,任务执行过程中可以暂停、撤销、继续训练任务,任务结束之后可以删除任务。任务执行完毕后,用户可以根据结果操作图像和视频数据,基于上文提到的功能,包括但不限于,删除重复的图像和低质量图像,压缩长视频,转码不同格式的视频,对图像中的目标分类,检测和分割图像中的目标等。Among them, the task can be suspended, revoked, and resumed during the execution of the task, and the task can be deleted after the task ends. After the task is executed, the user can manipulate the image and video data according to the results, based on the functions mentioned above, including but not limited to, deleting duplicate images and low quality images, compressing long videos, transcoding videos of different formats, and pairing images. Target classification in the target, detecting and segmenting targets in the image, etc.
本系统具备可扩展性和普适性:系统的可扩展性体现为:The system is scalable and universal: the scalability of the system is reflected in:
1.系统层面,电力行业其他业务平台(例如统一视频监控平台以及运检、调度、营销、基建等电力信息化平台)通过网络服务接口与图像管理模块通信,获取经本系统处理后的目标图像和视频数据及其关联信息;图像管理模块通过网络服务接口与电力行业其他业务平台通信,获取电力系统的原始图像和视频数据及其关联信息,作为后续分析的数据源;图像管理模块也接收本地上传的图像和视频数据,作为后续分析数据源的有效补充;1. At the system level, other business platforms in the power industry (such as unified video surveillance platform and power information platform for transportation inspection, dispatching, marketing, infrastructure, etc.) communicate with the image management module through the network service interface to obtain the target image processed by the system. And the video data and its associated information; the image management module communicates with other service platforms of the power industry through the network service interface, and acquires the original image and video data of the power system and its associated information as a data source for subsequent analysis; the image management module also receives the local Uploaded image and video data as an effective complement to subsequent analysis data sources;
2.硬件层面,系统基于多核异构并行计算框架,不同的CPU节点或GPU节点可接入系统,实现多核并行运算和数据高效存储,完成离线或在线模型训练和智能任务执行功能;2. At the hardware level, the system is based on a multi-core heterogeneous parallel computing framework. Different CPU nodes or GPU nodes can access the system to achieve multi-core parallel computing and efficient data storage, and complete offline or online model training and intelligent task execution functions;
3.软件层面,本实施例提供了一种轻量级神经网络,算法中的网络模型可扩展,根据不同的任务特性适配不同的神经网络模型。3. Software level, this embodiment provides a lightweight neural network, the network model in the algorithm can be extended, and different neural network models are adapted according to different task characteristics.
系统的普适性体现为:The universality of the system is reflected as:
1.数据适用方面,平台内存储的结构化数据均按照国网标准统一编码格式,数据格式无需转换即可在电力行业业务平台间传输;1. In terms of data application, the structured data stored in the platform is in accordance with the national network standard unified coding format, and the data format can be transmitted between the power industry service platforms without conversion;
2.算法适用方面,系统提供了面向电力场景、设备、人员、事件类别的多种算法,可以适用多种电力分析场合。2. In terms of algorithm application, the system provides a variety of algorithms for power scenarios, devices, personnel, and event categories, and can be applied to a variety of power analysis scenarios.
有上述描述可见,本实施例中:It can be seen from the above description that in this embodiment:
1.采用了适合本系统的多核异构并行计算框架,其中各节点分工明确,逻辑清晰,交互合理;GPU计算节点承担模型训练和智能任务执行功能,完成密集计算;CPU存储管理结点承担数据存储功能,内嵌一种关系型数据库和一种非关系型数据库,CPU计算节点承担科学计算功能,同时辅助GPU计算节点完成部分密集计算功能;1. Adopt multi-core heterogeneous parallel computing framework suitable for this system, in which each node has clear division of labor, clear logic and reasonable interaction; GPU computing node undertakes model training and intelligent task execution function, completes intensive calculation; CPU storage management node bears data The storage function embeds a relational database and a non-relational database. The CPU compute node assumes the scientific computing function and assists the GPU computing node to perform part of the intensive computing function;
2.提出了一种轻量级神经网络模型作为系统的算法核心,神经网络可包含20层,采用了小卷积核和非对称卷积核结构、残差结构、弃权机制、批标准化机制,能有效提高分类准确率,同时加速收敛,增强网络泛化能 力,保持准确率的同时提升了训练速度和算法执行速度;同时网络提供针对输入数据的归一化和增强功能,能有效扩充数据集。轻量化快速图像分类模型基于现有的电力数据,参数是基于电力数据集训练得到的,并非基于通用的数据集,具备创新性和实用价值;A lightweight neural network model is proposed as the core of the system. The neural network can contain 20 layers, using small convolution kernel and asymmetric convolution kernel structure, residual structure, abandonment mechanism and batch standardization mechanism. It can effectively improve the classification accuracy, accelerate the convergence, enhance the network generalization ability, and maintain the accuracy rate while improving the training speed and algorithm execution speed. At the same time, the network provides normalization and enhancement functions for input data, which can effectively expand the data set. . The lightweight fast image classification model is based on existing power data. The parameters are based on the power data set training. It is not based on a common data set and is innovative and practical.
3.系统具备可扩展性和普适性,支持和电力行业其他业务平台的数据传递,支持多核异构的GPU节点和CPU节点并行计算,具备可扩展的算法网络模型,适用电力场景、设备、人员、事件类别的多种电力分析场合;3. The system has scalability and universality, supports data transmission with other business platforms in the power industry, supports parallel computing of multi-core heterogeneous GPU nodes and CPU nodes, and has a scalable algorithm network model for power scenarios and devices. Multiple power analysis scenarios for personnel and event categories;
本系统基于多核异构并行计算框架,可高效实现在线或离线图像数据训练,形成轻量化快速图像分类模型,可实现电力内外网图像业务应用,能最大化发挥图像数据的价值,具备较好的应用前景。The system is based on multi-core heterogeneous parallel computing framework, which can efficiently realize online or offline image data training, form a lightweight and fast image classification model, and can realize the application of power internal and external network image services, which can maximize the value of image data and have better performance. Application prospects.
以上所述仅是本申请的可选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明技术原理的前提下,还可以做出若干改进和变形,这些改进和变形也应视为本发明的保护范围。The above description is only an alternative embodiment of the present application, and it should be noted that those skilled in the art can make several improvements and modifications without departing from the technical principles of the present invention. Deformation should also be considered as the scope of protection of the present invention.

Claims (10)

  1. 一种基于多核异构并行计算的电力人工智能视觉分析系统,所述系统包括多核异构并行计算模块和业务应用模块;所述业务应用模块和所述多核异构并行计算模块之间通过网络服务接口传输数据;A power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing, the system comprising a multi-core heterogeneous parallel computing module and a business application module; the service application module and the multi-core heterogeneous parallel computing module are served through a network Interface transfer data;
    所述多核异构并行计算模块包括图形处理器GPU计算节点、中央处理器CPU存储管理节点和CPU计算节点,各节点之间通过交换机连接;所述GPU计算节点配置为执行模型训练,完成第一类计算任务;所述CPU存储管理节点配置为数据存储;所述CPU计算节点配置为执行第二类计算任务,以及辅助所述GPU计算节点执行部分所述第一类计算任务;The multi-core heterogeneous parallel computing module includes a graphics processor GPU computing node, a central processing unit CPU storage management node, and a CPU computing node, and each node is connected through a switch; the GPU computing node is configured to perform model training, completing the first a class computing task; the CPU storage management node is configured as a data store; the CPU compute node is configured to perform a second type of computing task, and assists the GPU computing node to perform a portion of the first type of computing task;
    所述业务应用模块包括图像管理模块、图像标注模块、模型训练模块和算法应用模块;所述图像管理模块配置为处理图像业务;所述图像标注模块配置为为轻量级神经网络模型提供训练数据集的标注信息;所述模型训练模块配置为基于所述多核异构并行计算模块训练轻量级神经网络模型;所述算法应用模块,配置为在所述多核异构并行计算模块上利用所述轻量级神经网络模型执行图像分析任务。The business application module includes an image management module, an image annotation module, a model training module, and an algorithm application module; the image management module is configured to process an image service; and the image annotation module is configured to provide training data for a lightweight neural network model Set labeling information; the model training module configured to train a lightweight neural network model based on the multi-core heterogeneous parallel computing module; the algorithm application module configured to utilize the multi-core heterogeneous parallel computing module The lightweight neural network model performs image analysis tasks.
  2. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein
    所述图像管理模块通过网络服务接口与电力行业其他业务平台通信,配置为向所述电力行业其他业务平台发送处理后的目标图像和视频数据及其关联信息;还配置为获取原始图像和视频数据及其关联信息;所述电力行业其他业务平台包括以下至少之一:统一视频监控平台以及运检、调度、营销、基建电力信息化平台。The image management module communicates with other service platforms of the power industry through a network service interface, and is configured to send the processed target image and video data and related information to other service platforms of the power industry; and is configured to acquire original image and video data. And related information; the other business platforms of the power industry include at least one of the following: a unified video monitoring platform and a transportation inspection, dispatching, marketing, and infrastructure power information platform.
  3. 根据权利要求2所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述关联信息包括图像和视频数据的属性,包括以下至少之一:视图类别、设备树、场景树、标签树、缺陷树、专业类型、 文件来源;The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 2, wherein the association information comprises attributes of image and video data, including at least one of the following: a view category, a device tree, and a scene. Tree, tag tree, defect tree, professional type, file source;
    所述视图类别包括图像类别和视频数据类别;所述设备树表征拍摄图像或视频数据的前端设备地址;所述场景树表征拍摄图像或视频数据的电力场景;所述标签树表征拍摄图像或视频数据的具体内容;所述缺陷树表征拍摄图像或视频存在的缺陷问题;所述专业类型描述拍摄图像或视频数据的电力专业名称;所述文件来源表征拍摄图像或视频数据的手段。The view category includes an image category and a video data category; the device tree characterizes a front end device address of the captured image or video data; the scene tree characterizes a power scene of the captured image or video data; the tag tree characterizes the captured image or video The specific content of the data; the defect tree characterizing a defect in the captured image or video; the professional type describing a power professional name of the captured image or video data; the file source characterizing means for capturing the image or video data.
  4. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述CPU存储管理节点内嵌关系型数据库和非关系型数据库;所述关系型数据库配置为存储图像和视频数据的关联关系,所述非关系型数据库配置为存储图像和视频数据。The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the CPU stores a management node embedded relational database and a non-relational database; the relational database is configured to be stored. An association of image and video data, the non-relational database being configured to store image and video data.
  5. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述轻量级神经网络模型,包括:卷积层、池化层和全连接层;The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the lightweight neural network model comprises: a convolution layer, a pooling layer and a fully connected layer;
    所述卷积层位于模型前端,所述池化层和所述全连接层位于模型后端;The convolution layer is located at a front end of the model, and the pooling layer and the fully connected layer are located at a back end of the model;
    所述轻量级神经网络模型采用1×1卷积核和1×3、3×1的非对称卷积核。The lightweight neural network model uses a 1×1 convolution kernel and a 1×3, 3×1 asymmetric convolution kernel.
  6. 根据权利要求1至5任一项所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述轻量级神经网络模型包括:The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to any one of claims 1 to 5, wherein the lightweight neural network model comprises:
    第1层为卷积层,步长为2,输入的大小为224×224×3,采用两个级联的卷积核1×3和3×1;The first layer is a convolution layer, the step size is 2, the input size is 224 × 224 × 3, using two concatenated convolution kernels 1 × 3 and 3 × 1;
    第2层为卷积层,步长为1,输入的大小为112×112×32,采用两个级联的卷积核1×3和3×1;The second layer is a convolution layer, the step size is 1, the input size is 112 × 112 × 32, using two cascaded convolution kernels 1 × 3 and 3 × 1;
    第3层为卷积层,步长为1,输入的大小为112×112×32,采用1×1的卷积核;The third layer is a convolution layer with a step size of 1, and the input size is 112×112×32, and a 1×1 convolution kernel is used;
    第4层为卷积层,步长为2,输入的大小为112×112×64,采用两个 级联的卷积核1×3和3×1;The fourth layer is a convolution layer, the step size is 2, the input size is 112×112×64, and two concatenated convolution kernels are used, 1×3 and 3×1;
    第5层为卷积层,步长为1,输入的大小为56×56×64,采用两个级联的卷积核1×3和3×1;The fifth layer is a convolution layer, the step size is 1, the input size is 56 × 56 × 64, using two cascaded convolution kernels 1 × 3 and 3 × 1;
    第6层为卷积层,步长为1,输入的大小为56×56×128,采用1×1的卷积核;The sixth layer is a convolution layer with a step size of 1, and the input size is 56×56×128, and a 1×1 convolution kernel is used;
    第7层为卷积层,步长为2,输入的大小为56×56×128,采用两个级联的卷积核1×3和3×1;The seventh layer is a convolution layer, the step size is 2, the input size is 56×56×128, and two concatenated convolution kernels are used, 1×3 and 3×1;
    第8层为卷积层,步长为1,输入的大小为28×28×128,采用两个级联的卷积核1×3和3×1;The eighth layer is a convolution layer, the step size is 1, the input size is 28 × 28 × 128, using two cascaded convolution kernels 1 × 3 and 3 × 1;
    第9层为卷积层,步长为1,输入的大小为28×28×256,采用1×1卷积核;The ninth layer is a convolution layer with a step size of 1, and the input size is 28×28×256, using a 1×1 convolution kernel;
    第10层为卷积层,步长为2,输入的大小为28×28×256,采用两个级联的卷积核1×3和3×1;The 10th layer is a convolutional layer with a step size of 2, the input size is 28×28×256, and two concatenated convolution kernels are used 1×3 and 3×1;
    第11层为卷积层,步长为1,输入的大小为14×14×256,采用两个级联的卷积核1×3和3×1;The 11th layer is a convolutional layer, the step size is 1, the input size is 14×14×256, and two concatenated convolution kernels are used, 1×3 and 3×1;
    第12层为卷积层,步长为1,输入的大小为14×14×512,采用1×1卷积核;The 12th layer is a convolution layer with a step size of 1, and the input size is 14×14×512, using a 1×1 convolution kernel;
    第13层为卷积层,步长为2,输入的大小为14×14×512,采用两个级联的卷积核1×3和3×1;The 13th layer is a convolutional layer, the step size is 2, the input size is 14×14×512, and two concatenated convolution kernels are used, 1×3 and 3×1;
    第14层为卷积层,步长为1,输入的大小为7×7×512,采用两个级联的卷积核1×3和3×1;The 14th layer is a convolutional layer with a step size of 1, the input size is 7×7×512, and two concatenated convolution kernels are used 1×3 and 3×1;
    第15层为卷积层,步长为1,输入的大小为7×7×1024,采用1×1卷积核;The 15th layer is a convolutional layer with a step size of 1, and the input size is 7×7×1024, using a 1×1 convolution kernel;
    第16层为卷积层,步长为2,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 16th layer is a convolutional layer with a step size of 2, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
    第17层为卷积层,步长为1,输入的大小为7×7×1024,采用两个级联的卷积核1×3和3×1;The 17th layer is a convolutional layer, the step size is 1, the input size is 7×7×1024, and two concatenated convolution kernels are used 1×3 and 3×1;
    第18层为平均池化层,步长为1,输入的大小为7×7×1024,池化大小为7×7;The 18th layer is an average pooling layer with a step size of 1, the input size is 7×7×1024, and the pooling size is 7×7;
    第19层为全连接层,输入的大小为1×1×1024,包含1000个神经元;The 19th layer is a fully connected layer, and the input size is 1×1×1024, including 1000 neurons;
    第20层为损失函数层,使用softmax函数作为损失函数用单标签分类,使用交叉熵函数作为多标签分类。The 20th layer is the loss function layer, using the softmax function as the loss function with a single label classification, and the cross entropy function as the multi-label classification.
  7. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述图像业务按如下级别分类:The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the image services are classified according to the following levels:
    一级图像业务包括图像通用功能任务和图像业务应用任务;The first-level image service includes image common function tasks and image business application tasks;
    二级图像业务包括图像通用功能任务下的图像去重、低质量图像剔除、视频转码、视频压缩、快速浏览,以及图像业务应用任务下面向运检的任务、面向调度的任务、面向基建的任务、面向营销的任务;The secondary image service includes image deduplication, low-quality image rejection, video transcoding, video compression, fast browsing, and image service application tasks to the inspection task, scheduling-oriented tasks, and infrastructure-oriented Task, marketing-oriented tasks;
    三级图像业务包括面向运检的任务下的作业车辆检测、导线异物检测、树竹生长检测、导线覆冰检测、烟火检测、金具锈蚀检测、金具丢失检测、绝缘子破裂检测、绝缘子丢失检测、绝缘子污闪检测、变电站表计数字识别、变压器油枕漏油检测、变电站人员异常行为检测、变电站人员着装规范检测、变电站人员出入检测、烟火检测;还包括面向调度的任务下的变压器开关状态识别和隔离开关状态识别;还包括面向基建的任务下的进出口车辆检测、进出口车牌识别、进出口人员检测、进出口人员异常行为检测、人员着装规范检测、明火检测;还包括面向营销的任务下的营业环境质量检测、服务人员到岗离岗情况检测、服务人员仪容仪表质量检测、服务人员工作行为质量检测、客户行为分析及异常识别。The three-level image service includes work vehicle detection, wire foreign object detection, tree and bamboo growth detection, wire icing detection, pyrotechnic detection, fitting corrosion detection, fitting loss detection, insulator crack detection, insulator loss detection, and insulator under the task of inspection. Pollution flash detection, substation meter digital identification, transformer oil sump oil leak detection, substation personnel abnormal behavior detection, substation personnel dress code detection, substation personnel access detection, pyrotechnic detection; also includes transformer switch state recognition and task-oriented tasks Isolation switch status recognition; also includes import and export vehicle detection, import and export license plate identification, import and export personnel detection, import and export personnel abnormal behavior detection, personnel dressing specification detection, open flame detection under the task of infrastructure construction; The quality of the business environment, the inspection of the service personnel to the post, the quality inspection of the service personnel, the quality of the service personnel, the customer behavior analysis and the abnormal identification.
  8. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述图像标注包括两种类型,一种类型是系统自动 标注,所述系统自动标注为图像输入至轻量级神经网络模型后、以输出的分类结果为标注信息;另一种类型是用户识图标注。The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the image annotation comprises two types, one type is automatic labeling by a system, and the system is automatically labeled as image input. After the lightweight neural network model, the output classification result is labeled information; the other type is the user identification icon.
  9. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述模型训练模块以图像标注的图像和视频数据为数据集,面向图像分类、目标检测、图像分割的机器学习任务,通过多核异构并行计算模块进行模型训练;The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the model training module uses image-annotated image and video data as a data set, and faces image classification, target detection, and image. Segmented machine learning tasks, model training through multi-core heterogeneous parallel computing modules;
    所述模型训练模块支持在线训练和离线训练;所述离线训练是指将数据一次性输入后执行模型训练任务;所述在线训练指模型训练任务启动后,在模型训练任务执行过程中输入新的数据。The model training module supports online training and offline training; the offline training refers to performing model training tasks after one-time input of data; and the online training refers to inputting new models in the execution of model training tasks after the model training tasks are started. data.
  10. 根据权利要求1所述的一种基于多核异构并行计算的电力人工智能视觉分析系统,其中,所述算法应用模块,配置为通过所述模型训练模块得到的所述轻量级神经网络模型和/或内置于所述GPU计算节点中的其他成熟模型执行图像分析。The power artificial intelligence visual analysis system based on multi-core heterogeneous parallel computing according to claim 1, wherein the algorithm application module is configured to obtain the lightweight neural network model obtained by the model training module and / or other mature models built into the GPU compute node to perform image analysis.
PCT/CN2018/114389 2017-12-05 2018-11-07 Power artificial-intelligence visual-analysis system on basis of multi-core heterogeneous parallel computing WO2019109771A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN201711268417.6A CN108171117B (en) 2017-12-05 2017-12-05 Electric power artificial intelligence visual analysis system based on multicore heterogeneous Computing
CN201711268417.6 2017-12-05

Publications (1)

Publication Number Publication Date
WO2019109771A1 true WO2019109771A1 (en) 2019-06-13

Family

ID=62524353

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2018/114389 WO2019109771A1 (en) 2017-12-05 2018-11-07 Power artificial-intelligence visual-analysis system on basis of multi-core heterogeneous parallel computing

Country Status (2)

Country Link
CN (1) CN108171117B (en)
WO (1) WO2019109771A1 (en)

Cited By (41)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334642A (en) * 2019-07-01 2019-10-15 河南牧业经济学院 The machine vision recognition method and system of one boar behavior
CN110658725A (en) * 2019-08-19 2020-01-07 周口师范学院 Energy supervision and prediction system and method based on artificial intelligence
CN110766604A (en) * 2019-09-04 2020-02-07 国网浙江省电力有限公司湖州供电公司 Expansion method of image data of double-column horizontal rotary isolating switch
CN110910440A (en) * 2019-09-30 2020-03-24 中国电力科学研究院有限公司 Power transmission line length determination method and system based on power image data
CN111027388A (en) * 2019-11-12 2020-04-17 国网天津市电力公司 Method and system for monitoring safety operation behaviors of constructors based on image recognition
CN111080652A (en) * 2019-12-23 2020-04-28 西安电子科技大学 Optical remote sensing image segmentation method based on multi-scale lightweight cavity convolution
CN111079645A (en) * 2019-12-16 2020-04-28 国网重庆市电力公司永川供电分公司 Insulator self-explosion identification method based on AlexNet network
CN111126138A (en) * 2019-11-18 2020-05-08 施博凯 AI image recognition method for garbage classification
CN111126196A (en) * 2019-12-10 2020-05-08 安徽银河物联通信技术有限公司 Equipment oil leakage detection method
CN111159095A (en) * 2020-01-02 2020-05-15 中国航空工业集团公司西安航空计算技术研究所 Heterogeneous integrated embedded intelligent computing implementation method
CN111310899A (en) * 2020-02-19 2020-06-19 山东大学 Electric power defect identification method based on symbiotic relationship and small sample learning
CN111339977A (en) * 2020-03-03 2020-06-26 河南中光学集团有限公司 Small target intelligent identification system based on remote video monitoring and identification method thereof
CN111353432A (en) * 2020-02-28 2020-06-30 安徽华润金蟾药业股份有限公司 Rapid honeysuckle medicinal material cleaning method and system based on convolutional neural network
CN111368978A (en) * 2020-03-02 2020-07-03 开放智能机器(上海)有限公司 Precision improving method for offline quantization tool
CN111462167A (en) * 2020-04-21 2020-07-28 济南浪潮高新科技投资发展有限公司 Intelligent terminal video analysis algorithm combining edge calculation and deep learning
CN111537515A (en) * 2020-03-31 2020-08-14 国网辽宁省电力有限公司朝阳供电公司 Iron tower bolt defect display method and system based on three-dimensional live-action model
CN111783968A (en) * 2020-06-30 2020-10-16 山东信通电子股份有限公司 Power transmission line monitoring method and system based on cloud edge cooperation
CN111858465A (en) * 2020-06-29 2020-10-30 西南电子技术研究所(中国电子科技集团公司第十研究所) Large-scale matrix QR decomposition parallel computing structure
CN112598142A (en) * 2020-12-16 2021-04-02 明阳智慧能源集团股份公司 Wind turbine generator overhaul work quality examination assisting method and system
CN113310897A (en) * 2021-05-31 2021-08-27 广东骏亚电子科技股份有限公司 AI visual inspection system application based on 5G algorithm
CN113326873A (en) * 2021-05-19 2021-08-31 云南电网有限责任公司电力科学研究院 Method for automatically classifying opening and closing states of power equipment based on data enhancement
CN113343392A (en) * 2021-07-06 2021-09-03 浙江天铂云科光电股份有限公司 Intelligent detection and positioning method for oil conservator
CN113505064A (en) * 2021-07-07 2021-10-15 广东电力信息科技有限公司 Heterogeneous information flow-based electric power big data service system testing method
CN113657348A (en) * 2021-08-31 2021-11-16 江苏中科云墨数字科技有限公司 Intelligent analysis method and system for operation violation behaviors of transformer substation
CN113780312A (en) * 2019-11-21 2021-12-10 同济大学 Highway road surface condition detecting system
CN114187156A (en) * 2021-12-17 2022-03-15 江西洪都航空工业集团有限责任公司 Intelligent recognition method for city management affair component under mobile background
CN114244975A (en) * 2021-11-15 2022-03-25 国能大渡河革什扎水电开发有限公司 Mobile operation process control method and system based on video edge calculation
CN114760445A (en) * 2022-04-21 2022-07-15 北京新童瑞科技有限公司 Remote control video double-confirmation method for primary electric power equipment
CN114757307A (en) * 2022-06-14 2022-07-15 中国电力科学研究院有限公司 Artificial intelligence automatic training method, system, device and storage medium
CN114760168A (en) * 2022-03-22 2022-07-15 广东电力通信科技有限公司 AI intelligent edge computing gateway
CN114827614A (en) * 2022-04-18 2022-07-29 重庆邮电大学 Method for realizing LCEVC video coding optimization
CN114937247A (en) * 2022-07-21 2022-08-23 四川金信石信息技术有限公司 Transformer substation monitoring method and system based on deep learning and electronic equipment
CN115116005A (en) * 2022-06-29 2022-09-27 济南银捷实业有限公司 Method for closing cash box, training field unmanned algorithm model and judging logic
CN115129479A (en) * 2022-08-24 2022-09-30 南方电网数字电网研究院有限公司 Method and device for processing edge data of power chip in partition mode and computer equipment
CN115240075A (en) * 2022-09-22 2022-10-25 山东大学 Construction and training method of electric power vision multi-granularity pre-training large model
CN115379286A (en) * 2022-10-24 2022-11-22 通号通信信息集团有限公司 Intelligent video analysis box, intelligent video analysis system and method
CN115984675A (en) * 2022-12-01 2023-04-18 扬州万方科技股份有限公司 System and method for realizing multi-channel video decoding and AI intelligent analysis
CN116309520A (en) * 2023-04-03 2023-06-23 江南大学 Strip steel surface defect detection system
CN116450486A (en) * 2023-06-16 2023-07-18 浪潮电子信息产业股份有限公司 Modeling method, device, equipment and medium for nodes in multi-element heterogeneous computing system
CN117389742A (en) * 2023-11-10 2024-01-12 深圳市天鹤科技有限公司 Edge computing method, device and storage medium for machine vision
CN118334663A (en) * 2024-06-13 2024-07-12 杭州宇泛智能科技股份有限公司 One-stop artificial intelligent image processing model construction method and device

Families Citing this family (27)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN108171117B (en) * 2017-12-05 2019-05-21 南京南瑞信息通信科技有限公司 Electric power artificial intelligence visual analysis system based on multicore heterogeneous Computing
CN110717574B (en) * 2018-07-11 2023-07-07 杭州海康威视数字技术股份有限公司 Neural network operation method and device and heterogeneous intelligent chip
CN109255345A (en) * 2018-08-21 2019-01-22 国网浙江省电力有限公司电力科学研究院 A kind of cable tunnel iron rust recognition methods based on convolutional neural networks
CN111198760B (en) * 2018-11-20 2024-08-09 北京搜狗科技发展有限公司 Data processing method and device
CN109727376B (en) * 2018-12-29 2022-03-04 北京沃东天骏信息技术有限公司 Method and device for generating configuration file and vending equipment
CN109829542B (en) * 2019-01-29 2021-04-16 武汉星巡智能科技有限公司 Multi-core processor-based multi-element deep network model reconstruction method and device
CN109947573A (en) * 2019-03-26 2019-06-28 北京智芯微电子科技有限公司 Intelligence suitable for electric system edge calculations accelerates chip
CN110175571A (en) * 2019-05-28 2019-08-27 华翔翔能电气股份有限公司 The intellectual monitoring of substation equipment state and recognition methods
CN110705425B (en) * 2019-09-25 2022-06-28 广州西思数字科技有限公司 Tongue picture multi-label classification method based on graph convolution network
CN111147603A (en) * 2019-09-30 2020-05-12 华为技术有限公司 Method and device for networking reasoning service
CN112784989B (en) * 2019-11-08 2024-05-03 阿里巴巴集团控股有限公司 Inference system, inference method, electronic device, and computer storage medium
CN111260553A (en) * 2020-01-13 2020-06-09 哈尔滨工程大学 Domestic vision computing system based on remote lossless video transmission
CN111582073A (en) * 2020-04-23 2020-08-25 浙江大学 Transformer substation violation identification method based on ResNet101 characteristic pyramid
CN111738084A (en) * 2020-05-21 2020-10-02 山东大学 Real-time target detection method and system based on CPU-GPU heterogeneous multiprocessor system on chip
CN111931713B (en) * 2020-09-21 2021-01-29 成都睿沿科技有限公司 Abnormal behavior detection method and device, electronic equipment and storage medium
CN112101266B (en) * 2020-09-25 2024-09-17 重庆电政信息科技有限公司 Multi-ARM-based motion recognition model distributed reasoning method
CN112164446B (en) * 2020-10-13 2022-04-22 电子科技大学 Medical image report generation method based on multi-network fusion
CN112528983B (en) * 2020-12-16 2023-12-26 国网江苏省电力有限公司检修分公司 GIS isolation/grounding switch video image acquisition system under dim light condition
CN113096002A (en) * 2021-04-02 2021-07-09 联捷计算科技(深圳)有限公司 Video frame scale transformation heterogeneous system and method and storage medium
CN113225537A (en) * 2021-05-10 2021-08-06 合肥中科类脑智能技术有限公司 Video image analysis system and method
CN113515829B (en) * 2021-05-21 2023-07-21 华北电力大学(保定) Situation awareness method for transmission line hardware defects under extremely cold disasters
CN113222134B (en) * 2021-07-12 2021-10-26 深圳市永达电子信息股份有限公司 Brain-like computing system, method and computer readable storage medium
CN113590321B (en) * 2021-07-30 2024-02-27 西安电子科技大学 Task configuration method for heterogeneous distributed machine learning cluster
WO2023015500A1 (en) * 2021-08-11 2023-02-16 Baidu.Com Times Technology (Beijing) Co., Ltd. Multiple-model heterogeneous computing
CN113762120A (en) * 2021-08-27 2021-12-07 南京南瑞信息通信科技有限公司 Insulator image segmentation method and device, electronic equipment and storage medium
CN113850186A (en) * 2021-09-24 2021-12-28 中国劳动关系学院 Intelligent streaming media video big data analysis method based on convolutional neural network
CN114915665B (en) * 2022-07-13 2022-10-21 香港中文大学(深圳) Heterogeneous task scheduling method based on hierarchical strategy

Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106504233A (en) * 2016-10-18 2017-03-15 国网山东省电力公司电力科学研究院 Image electric power widget recognition methodss and system are patrolled and examined based on the unmanned plane of Faster R CNN
CN107341127A (en) * 2017-07-05 2017-11-10 西安电子科技大学 Convolutional neural networks accelerated method based on OpenCL standards
CN108171117A (en) * 2017-12-05 2018-06-15 南京南瑞信息通信科技有限公司 Electric power artificial intelligence visual analysis system based on multinuclear heterogeneous Computing

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US8175617B2 (en) * 2009-10-28 2012-05-08 Digimarc Corporation Sensor-based mobile search, related methods and systems
WO2016149881A1 (en) * 2015-03-20 2016-09-29 Intel Corporation Object recogntion based on boosting binary convolutional neural network features
US10332509B2 (en) * 2015-11-25 2019-06-25 Baidu USA, LLC End-to-end speech recognition
CN106527455A (en) * 2017-01-03 2017-03-22 北京博瑞空间科技发展有限公司 UAV landing control method and device

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN106504233A (en) * 2016-10-18 2017-03-15 国网山东省电力公司电力科学研究院 Image electric power widget recognition methodss and system are patrolled and examined based on the unmanned plane of Faster R CNN
CN107341127A (en) * 2017-07-05 2017-11-10 西安电子科技大学 Convolutional neural networks accelerated method based on OpenCL standards
CN108171117A (en) * 2017-12-05 2018-06-15 南京南瑞信息通信科技有限公司 Electric power artificial intelligence visual analysis system based on multinuclear heterogeneous Computing

Cited By (59)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110334642A (en) * 2019-07-01 2019-10-15 河南牧业经济学院 The machine vision recognition method and system of one boar behavior
CN110658725A (en) * 2019-08-19 2020-01-07 周口师范学院 Energy supervision and prediction system and method based on artificial intelligence
CN110766604A (en) * 2019-09-04 2020-02-07 国网浙江省电力有限公司湖州供电公司 Expansion method of image data of double-column horizontal rotary isolating switch
CN110766604B (en) * 2019-09-04 2023-06-06 国网浙江省电力有限公司湖州供电公司 Expansion method for image data of double-column horizontal rotary isolating switch
CN110910440B (en) * 2019-09-30 2023-04-07 中国电力科学研究院有限公司 Power transmission line length determination method and system based on power image data
CN110910440A (en) * 2019-09-30 2020-03-24 中国电力科学研究院有限公司 Power transmission line length determination method and system based on power image data
CN111027388A (en) * 2019-11-12 2020-04-17 国网天津市电力公司 Method and system for monitoring safety operation behaviors of constructors based on image recognition
CN111126138A (en) * 2019-11-18 2020-05-08 施博凯 AI image recognition method for garbage classification
CN113780312B (en) * 2019-11-21 2024-04-12 同济大学 Highway road surface condition detecting system
CN113780312A (en) * 2019-11-21 2021-12-10 同济大学 Highway road surface condition detecting system
CN111126196A (en) * 2019-12-10 2020-05-08 安徽银河物联通信技术有限公司 Equipment oil leakage detection method
CN111079645A (en) * 2019-12-16 2020-04-28 国网重庆市电力公司永川供电分公司 Insulator self-explosion identification method based on AlexNet network
CN111080652A (en) * 2019-12-23 2020-04-28 西安电子科技大学 Optical remote sensing image segmentation method based on multi-scale lightweight cavity convolution
CN111159095A (en) * 2020-01-02 2020-05-15 中国航空工业集团公司西安航空计算技术研究所 Heterogeneous integrated embedded intelligent computing implementation method
CN111159095B (en) * 2020-01-02 2023-05-12 中国航空工业集团公司西安航空计算技术研究所 Heterogeneous fusion embedded intelligent computing implementation method
CN111310899A (en) * 2020-02-19 2020-06-19 山东大学 Electric power defect identification method based on symbiotic relationship and small sample learning
CN111310899B (en) * 2020-02-19 2023-07-11 山东大学 Power defect identification method based on symbiotic relation and small sample learning
CN111353432B (en) * 2020-02-28 2023-08-01 安徽华润金蟾药业股份有限公司 Rapid clean selection method and system for honeysuckle medicinal materials based on convolutional neural network
CN111353432A (en) * 2020-02-28 2020-06-30 安徽华润金蟾药业股份有限公司 Rapid honeysuckle medicinal material cleaning method and system based on convolutional neural network
CN111368978A (en) * 2020-03-02 2020-07-03 开放智能机器(上海)有限公司 Precision improving method for offline quantization tool
CN111368978B (en) * 2020-03-02 2023-03-24 开放智能机器(上海)有限公司 Precision improving method for offline quantization tool
CN111339977A (en) * 2020-03-03 2020-06-26 河南中光学集团有限公司 Small target intelligent identification system based on remote video monitoring and identification method thereof
CN111537515B (en) * 2020-03-31 2023-05-12 国网辽宁省电力有限公司朝阳供电公司 Iron tower bolt defect display method and system based on three-dimensional live-action model
CN111537515A (en) * 2020-03-31 2020-08-14 国网辽宁省电力有限公司朝阳供电公司 Iron tower bolt defect display method and system based on three-dimensional live-action model
CN111462167A (en) * 2020-04-21 2020-07-28 济南浪潮高新科技投资发展有限公司 Intelligent terminal video analysis algorithm combining edge calculation and deep learning
CN111858465A (en) * 2020-06-29 2020-10-30 西南电子技术研究所(中国电子科技集团公司第十研究所) Large-scale matrix QR decomposition parallel computing structure
CN111858465B (en) * 2020-06-29 2023-06-06 西南电子技术研究所(中国电子科技集团公司第十研究所) Large-scale matrix QR decomposition parallel computing system
CN111783968B (en) * 2020-06-30 2024-05-31 山东信通电子股份有限公司 Power transmission line monitoring method and system based on cloud edge cooperation
CN111783968A (en) * 2020-06-30 2020-10-16 山东信通电子股份有限公司 Power transmission line monitoring method and system based on cloud edge cooperation
CN112598142A (en) * 2020-12-16 2021-04-02 明阳智慧能源集团股份公司 Wind turbine generator overhaul work quality examination assisting method and system
CN112598142B (en) * 2020-12-16 2024-02-02 明阳智慧能源集团股份公司 Wind turbine maintenance working quality inspection auxiliary method and system
CN113326873A (en) * 2021-05-19 2021-08-31 云南电网有限责任公司电力科学研究院 Method for automatically classifying opening and closing states of power equipment based on data enhancement
CN113310897A (en) * 2021-05-31 2021-08-27 广东骏亚电子科技股份有限公司 AI visual inspection system application based on 5G algorithm
CN113343392A (en) * 2021-07-06 2021-09-03 浙江天铂云科光电股份有限公司 Intelligent detection and positioning method for oil conservator
CN113505064B (en) * 2021-07-07 2022-05-17 广东电力信息科技有限公司 Heterogeneous information flow-based electric power big data service system testing method
CN113505064A (en) * 2021-07-07 2021-10-15 广东电力信息科技有限公司 Heterogeneous information flow-based electric power big data service system testing method
CN113657348A (en) * 2021-08-31 2021-11-16 江苏中科云墨数字科技有限公司 Intelligent analysis method and system for operation violation behaviors of transformer substation
CN114244975A (en) * 2021-11-15 2022-03-25 国能大渡河革什扎水电开发有限公司 Mobile operation process control method and system based on video edge calculation
CN114187156A (en) * 2021-12-17 2022-03-15 江西洪都航空工业集团有限责任公司 Intelligent recognition method for city management affair component under mobile background
CN114760168A (en) * 2022-03-22 2022-07-15 广东电力通信科技有限公司 AI intelligent edge computing gateway
CN114827614A (en) * 2022-04-18 2022-07-29 重庆邮电大学 Method for realizing LCEVC video coding optimization
CN114827614B (en) * 2022-04-18 2024-03-22 重庆邮电大学 Method for realizing LCEVC video coding optimization
CN114760445A (en) * 2022-04-21 2022-07-15 北京新童瑞科技有限公司 Remote control video double-confirmation method for primary electric power equipment
CN114757307B (en) * 2022-06-14 2022-09-06 中国电力科学研究院有限公司 Artificial intelligence automatic training method, system, device and storage medium
CN114757307A (en) * 2022-06-14 2022-07-15 中国电力科学研究院有限公司 Artificial intelligence automatic training method, system, device and storage medium
CN115116005A (en) * 2022-06-29 2022-09-27 济南银捷实业有限公司 Method for closing cash box, training field unmanned algorithm model and judging logic
CN114937247A (en) * 2022-07-21 2022-08-23 四川金信石信息技术有限公司 Transformer substation monitoring method and system based on deep learning and electronic equipment
CN115129479A (en) * 2022-08-24 2022-09-30 南方电网数字电网研究院有限公司 Method and device for processing edge data of power chip in partition mode and computer equipment
CN115240075B (en) * 2022-09-22 2022-12-13 山东大学 Construction and training method of electric power vision multi-granularity pre-training large model
CN115240075A (en) * 2022-09-22 2022-10-25 山东大学 Construction and training method of electric power vision multi-granularity pre-training large model
CN115379286A (en) * 2022-10-24 2022-11-22 通号通信信息集团有限公司 Intelligent video analysis box, intelligent video analysis system and method
CN115984675A (en) * 2022-12-01 2023-04-18 扬州万方科技股份有限公司 System and method for realizing multi-channel video decoding and AI intelligent analysis
CN115984675B (en) * 2022-12-01 2023-10-13 扬州万方科技股份有限公司 System and method for realizing multipath video decoding and AI intelligent analysis
CN116309520A (en) * 2023-04-03 2023-06-23 江南大学 Strip steel surface defect detection system
CN116450486B (en) * 2023-06-16 2023-09-05 浪潮电子信息产业股份有限公司 Modeling method, device, equipment and medium for nodes in multi-element heterogeneous computing system
CN116450486A (en) * 2023-06-16 2023-07-18 浪潮电子信息产业股份有限公司 Modeling method, device, equipment and medium for nodes in multi-element heterogeneous computing system
CN117389742A (en) * 2023-11-10 2024-01-12 深圳市天鹤科技有限公司 Edge computing method, device and storage medium for machine vision
CN117389742B (en) * 2023-11-10 2024-05-31 深圳市天鹤科技有限公司 Edge computing method, device and storage medium for machine vision
CN118334663A (en) * 2024-06-13 2024-07-12 杭州宇泛智能科技股份有限公司 One-stop artificial intelligent image processing model construction method and device

Also Published As

Publication number Publication date
CN108171117A (en) 2018-06-15
CN108171117B (en) 2019-05-21

Similar Documents

Publication Publication Date Title
WO2019109771A1 (en) Power artificial-intelligence visual-analysis system on basis of multi-core heterogeneous parallel computing
WO2022068196A1 (en) Cross-modal data processing method and device, storage medium, and electronic device
JP7412847B2 (en) Image processing method, image processing device, server, and computer program
CN113486833B (en) Multi-modal feature extraction model training method and device and electronic equipment
CN107908175B (en) On-site intelligent operation and maintenance system for power system
CN109743356B (en) Industrial internet data acquisition method and device, readable storage medium and terminal
CN113704531A (en) Image processing method, image processing device, electronic equipment and computer readable storage medium
CN108764456B (en) Airborne target identification model construction platform, airborne target identification method and equipment
WO2022116616A1 (en) Behavior recognition method based on conversion module
CN108055529A (en) Electric power unmanned plane and robot graphics' data normalization artificial intelligence analysis's system
CN117096867A (en) Short-term power load prediction method, device, system and storage medium
WO2022111387A1 (en) Data processing method and related apparatus
CN113222560A (en) Enterprise service system based on AI consultation and construction method thereof
WO2024040941A1 (en) Neural architecture search method and device, and storage medium
Belhi et al. Deep learning and cultural heritage: the CEPROQHA project case study
CN114332659A (en) Power transmission line defect inspection method and device based on lightweight model issuing
CN116778527A (en) Human body model construction method, device, equipment and storage medium
US20240303957A1 (en) End-edge-cloud coordination system and method based on digital retina, and device
CN112905571B (en) Train rail transit sensor data management method and device
WO2024094127A1 (en) Parameter tuning method and apparatus, and computer device and storage medium
CN107679097A (en) A kind of distributed data processing method, system and storage medium
WO2023174256A1 (en) Data compression method and related device
CN114925210B (en) Knowledge graph construction method, device, medium and equipment
CN110968596A (en) Data processing method based on label system
CN113139490B (en) Image feature matching method and device, computer equipment and storage medium

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 18885440

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 18885440

Country of ref document: EP

Kind code of ref document: A1