Nothing Special   »   [go: up one dir, main page]

CN110443172A - A kind of object detection method and system based on super-resolution and model compression - Google Patents

A kind of object detection method and system based on super-resolution and model compression Download PDF

Info

Publication number
CN110443172A
CN110443172A CN201910678925.4A CN201910678925A CN110443172A CN 110443172 A CN110443172 A CN 110443172A CN 201910678925 A CN201910678925 A CN 201910678925A CN 110443172 A CN110443172 A CN 110443172A
Authority
CN
China
Prior art keywords
resolution
model
super
faster
network
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201910678925.4A
Other languages
Chinese (zh)
Inventor
阿孜古丽·吾拉木
杨容季
张德政
谢永红
李鹏
史家兴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
University of Science and Technology Beijing USTB
Original Assignee
University of Science and Technology Beijing USTB
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by University of Science and Technology Beijing USTB filed Critical University of Science and Technology Beijing USTB
Priority to CN201910678925.4A priority Critical patent/CN110443172A/en
Publication of CN110443172A publication Critical patent/CN110443172A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06FELECTRIC DIGITAL DATA PROCESSING
    • G06F18/00Pattern recognition
    • G06F18/20Analysing
    • G06F18/21Design or setup of recognition systems or techniques; Extraction of features in feature space; Blind source separation
    • G06F18/214Generating training patterns; Bootstrap methods, e.g. bagging or boosting
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06TIMAGE DATA PROCESSING OR GENERATION, IN GENERAL
    • G06T3/00Geometric image transformations in the plane of the image
    • G06T3/40Scaling of whole images or parts thereof, e.g. expanding or contracting
    • G06T3/4053Scaling of whole images or parts thereof, e.g. expanding or contracting based on super-resolution, i.e. the output image resolution being higher than the sensor resolution
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V20/00Scenes; Scene-specific elements
    • G06V20/10Terrestrial scenes
    • G06V20/13Satellite images
    • GPHYSICS
    • G06COMPUTING; CALCULATING OR COUNTING
    • G06VIMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
    • G06V2201/00Indexing scheme relating to image or video recognition or understanding
    • G06V2201/07Target detection

Landscapes

  • Engineering & Computer Science (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Theoretical Computer Science (AREA)
  • Data Mining & Analysis (AREA)
  • Bioinformatics & Computational Biology (AREA)
  • Artificial Intelligence (AREA)
  • Bioinformatics & Cheminformatics (AREA)
  • Life Sciences & Earth Sciences (AREA)
  • Computer Vision & Pattern Recognition (AREA)
  • Evolutionary Biology (AREA)
  • Evolutionary Computation (AREA)
  • General Engineering & Computer Science (AREA)
  • Astronomy & Astrophysics (AREA)
  • Remote Sensing (AREA)
  • Multimedia (AREA)
  • Image Analysis (AREA)

Abstract

本发明提供一种基于超分辨率和模型压缩的目标检测方法及系统,该方法包括:采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;对Faster‑RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster‑RCNN网络训练目标检测模型;采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。本发明的方案可以对卫星拍摄的低分辨率的卫星图像实现实时目标检测,且具有良好的检测精度。

The present invention provides a target detection method and system based on super-resolution and model compression. The method includes: using an image with high resolution as a training label, using a corresponding image with low resolution as a training sample, and training super-resolution use the trained super-resolution model to process the low-resolution images to be processed to generate corresponding high-resolution images; improve the Faster‑RCNN network according to the preset method, and improve the high-resolution images generated by the super-resolution model. The target detection model is trained using the improved Faster‑RCNN network; the trained target detection model is compressed by the preset model compression method, so that it can be deployed on smart terminals. The solution of the present invention can realize real-time target detection for low-resolution satellite images captured by satellites, and has good detection accuracy.

Description

一种基于超分辨率和模型压缩的目标检测方法及系统A target detection method and system based on super-resolution and model compression

技术领域technical field

本发明涉及目标检测技术领域,特别是指一种基于超分辨率和模型压缩的目标检测方法和系统。The invention relates to the technical field of target detection, in particular to a target detection method and system based on super-resolution and model compression.

背景技术Background technique

目标检测技术是近年来技术发展迭代的非常快的一个计算机视觉领域,在这个领域中具有划时代意义的是RCNN系列目标检测网络。在RCNN系列中,性能最佳,速度最快的当属Faster-RCNN网络,但是针对卫星拍摄的大量的低分辨率的卫星图像上的目标检测,现有目标检测方法的检测检测精度并不理想;此外,现阶段虽然有很多检测精度较高的网络模型,例如Faster-RCNN, Fast-RCNN等等。但是,这些网络在没有做模型压缩的情况下相对YOLOv3等网络速度较慢,因此不能真正的实现实时目标检测。Target detection technology is a field of computer vision that has experienced rapid technological development in recent years. The epoch-making significance in this field is the RCNN series of target detection networks. Among the RCNN series, the Faster-RCNN network has the best performance and the fastest speed. However, for target detection on a large number of low-resolution satellite images captured by satellites, the detection accuracy of existing target detection methods is not ideal. ; In addition, although there are many network models with high detection accuracy at this stage, such as Faster-RCNN, Fast-RCNN and so on. However, these networks are slower than networks such as YOLOv3 without model compression, so they cannot truly achieve real-time target detection.

发明内容SUMMARY OF THE INVENTION

本发明的目的是解决针对卫星拍摄的大量的低分辨率的卫星图像上的目标检测问题,现有目标检测方法无法实现实时准确检测;且现阶段的检测精度较高的网络模型在没有做模型压缩的情况下相对YOLOv3等网络速度较慢,因此不能真正的实现实时目标检测的问题。The purpose of the present invention is to solve the problem of target detection on a large number of low-resolution satellite images captured by satellites, and the existing target detection methods cannot achieve real-time accurate detection; In the case of compression, the network speed is slower than that of YOLOv3, so it cannot really realize the problem of real-time target detection.

为解决上述技术问题,本发明提供一种基于超分辨率和模型压缩的目标检测方法,所述方法包括:In order to solve the above technical problems, the present invention provides a target detection method based on super-resolution and model compression, and the method includes:

采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;Use high-resolution images as training labels, and use corresponding low-resolution images as training samples to train the super-resolution model;

利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;Use the trained super-resolution model to process the low-resolution images to be processed to generate corresponding high-resolution images;

对Faster-RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster-RCNN网络训练目标检测模型;Improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model;

采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。A preset model compression method is used to compress the trained target detection model so that it can be deployed on smart terminals.

进一步地,所述对Faster-RCNN网络按照预设方法进行改进,包括:Further, the Faster-RCNN network is improved according to a preset method, including:

将Faster-RCNN网络的基础网络由VGG16改为ResNeXt101。Change the base network of Faster-RCNN network from VGG16 to ResNeXt101.

进一步地,所述对Faster-RCNN网络按照预设方法进行改进,还包括:Further, the improvement of the Faster-RCNN network according to the preset method also includes:

将Faster-RCNN网络中的卷积层替换为虫蚀卷积层。Replace the convolutional layers in the Faster-RCNN network with worm-eaten convolutional layers.

进一步地,所述对Faster-RCNN网络按照预设方法进行改进,还包括:Further, the improvement of the Faster-RCNN network according to the preset method also includes:

在所述基础网络上使用FPN网络。The FPN network is used on the base network.

进一步地,所述采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上,具体为:Further, using the preset model compression method to compress the trained target detection model so that it can be deployed on the intelligent terminal, specifically:

使用改进后的Faster-RCNN网络中的卷积层后面BN层的gamma参数进行模型剪枝,对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。The gamma parameter of the BN layer behind the convolutional layer in the improved Faster-RCNN network is used for model pruning, and the trained target detection model is compressed so that it can be deployed on smart terminals.

相应地,为解决上述技术问题,本发明还提供一种基于超分辨率和模型压缩的目标检测系统,所述系统包括:Correspondingly, in order to solve the above technical problems, the present invention also provides a target detection system based on super-resolution and model compression, and the system includes:

超分辨率模型训练模块,用于采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;The super-resolution model training module is used to use high-resolution images as training labels, and use corresponding low-resolution images as training samples to train super-resolution models;

高分辨率图像生成模块,用于利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;The high-resolution image generation module is used to process the low-resolution images to be processed by using the trained super-resolution model to generate corresponding high-resolution images;

目标检测模型训练模块,用于对Faster-RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster-RCNN网络训练目标检测模型;The target detection model training module is used to improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model;

模型压缩模块,用于采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。The model compression module is used to compress the trained target detection model by using a preset model compression method, so that it can be deployed on the intelligent terminal.

进一步地,所述目标检测模型训练模块具体用于:Further, the target detection model training module is specifically used for:

将Faster-RCNN网络的基础网络由VGG16改为ResNeXt101。Change the base network of Faster-RCNN network from VGG16 to ResNeXt101.

进一步地,所述目标检测模型训练模块还用于:Further, the target detection model training module is also used for:

将Faster-RCNN网络中的卷积层替换为虫蚀卷积层。Replace the convolutional layers in the Faster-RCNN network with worm-eaten convolutional layers.

进一步地,所述目标检测模型训练模块还用于:Further, the target detection model training module is also used for:

在所述基础网络上使用FPN网络。The FPN network is used on the base network.

进一步地,所述模型压缩模块具体用于:Further, the model compression module is specifically used for:

使用改进后的Faster-RCNN网络中的卷积层后面BN层的gamma参数进行模型剪枝,对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。The gamma parameter of the BN layer behind the convolutional layer in the improved Faster-RCNN network is used for model pruning, and the trained target detection model is compressed so that it can be deployed on smart terminals.

本发明的上述技术方案的有益效果如下:The beneficial effects of the above-mentioned technical solutions of the present invention are as follows:

本发明通过采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;对Faster-RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的 Faster-RCNN网络训练目标检测模型;采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使其能被部署到智能终端上。从而实现了可以对卫星拍摄的低分辨率的卫星图像进行实时目标检测的效果,且具有良好的检测精度。The present invention trains a super-resolution model by using an image with high resolution as a training label and a corresponding image with low resolution as a training sample; and uses the trained super-resolution model to process the low-resolution image to be processed. , generate corresponding high-resolution images; improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model; use the preset The model compression method compresses the trained target detection model so that it can be deployed on smart terminals. Therefore, the effect of real-time target detection on the low-resolution satellite image captured by the satellite is realized, and the detection accuracy is good.

附图说明Description of drawings

图1为本发明的基于超分辨率和模型压缩的目标检测方法的流程示意图;1 is a schematic flowchart of a target detection method based on super-resolution and model compression of the present invention;

图2为RDN网络整体架构示意图;Figure 2 is a schematic diagram of the overall architecture of the RDN network;

图3为RDB模块的结构示意图;Fig. 3 is the structural representation of RDB module;

图4为ResNeXt50和ResNet50结构对比示意图;Figure 4 is a schematic diagram of the structure comparison between ResNeXt50 and ResNet50;

图5为ResNet和ResNeXt的块结构对比示意图;Figure 5 is a schematic diagram of the block structure comparison between ResNet and ResNeXt;

图6为FPN网络的结构示意图;FIG. 6 is a schematic structural diagram of an FPN network;

图7为模型剪枝的步骤示意图。FIG. 7 is a schematic diagram of the steps of model pruning.

具体实施方式Detailed ways

为使本发明要解决的技术问题、技术方案和优点更加清楚,下面将结合附图及具体实施例进行详细描述。In order to make the technical problems, technical solutions and advantages to be solved by the present invention more clear, the following will be described in detail with reference to the accompanying drawings and specific embodiments.

第一实施例first embodiment

本实施例提供一种基于超分辨率和模型压缩的目标检测方法,该方法涉及到图像超分辨率技术,目标检测技术以及模型压缩技术。其中,高精度的目标检测技术是根本,而图像超分辨率和模型压缩技术是为了使本实施例的方法解决目标检测任务的性能更优。This embodiment provides a target detection method based on super-resolution and model compression, and the method involves image super-resolution technology, target detection technology and model compression technology. Among them, the high-precision target detection technology is fundamental, and the image super-resolution and model compression technology are used to make the method of this embodiment perform better in solving the target detection task.

超分辨率技术是指从观测到的低分辨率图像重建出相应的高分辨率图像的一种技术。图片的分辨率太低会极大的影响计算机视觉中的目标检测和语义分割,因为分辨率低的图片会丢失很多的图片信息,因此本实施例采用了超分辨率技术首先将低分辨率的图片转化为高分辨率的图片。Super-resolution technology refers to a technology that reconstructs a corresponding high-resolution image from an observed low-resolution image. If the resolution of the picture is too low, it will greatly affect the target detection and semantic segmentation in computer vision, because the picture with low resolution will lose a lot of picture information, so this embodiment uses the super-resolution technology to first Convert images to high-resolution images.

模型压缩技术是近几年来为了在智能设备上部署深度学习技术而应用广泛的技术,被广泛应用在智能手机,智能穿戴设备等硬件条件不是特别强的终端上。从最初的MobileNet,SqueezeNet和模型剪枝等方法发展到如今的模型自动剪枝,模型重训练等方法,使得智能终端能够大规模的部署人工智能技术。Model compression technology is a technology that has been widely used in recent years to deploy deep learning technology on smart devices. It is widely used in smart phones, smart wearable devices and other terminals where hardware conditions are not particularly strong. From the original MobileNet, SqueezeNet and model pruning methods to today's automatic model pruning, model retraining and other methods, intelligent terminals can deploy artificial intelligence technology on a large scale.

具体地,如图1所示,本实施例的基于超分辨率和模型压缩的目标检测方法,该基于超分辨率和模型压缩的目标检测方法包括:Specifically, as shown in FIG. 1 , the target detection method based on super-resolution and model compression in this embodiment includes:

S101,采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;S101, using an image with a high resolution as a training label, and using a corresponding image with a low resolution as a training sample, to train a super-resolution model;

S102,利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;S102, using the trained super-resolution model to process the low-resolution image to be processed to generate a corresponding high-resolution image;

需要说明的是,上述S101和S102的作用在于对原始模糊不清的卫星图像使用超分辨率技术使其清晰化;S101中训练超分辨率模型使用的是RDN网络,其网络架构如图2所示,RDN网络的参数较少,适合用来作为实时生成高分辨率图像的模型;该网络为了尽可能多的利用特征和尽快的实现训练的收敛,采用了RDB(Residual dense block)的结构。该RDB的结构如图3所示。It should be noted that the functions of the above S101 and S102 are to use super-resolution technology to make the original blurred satellite images clear; the super-resolution model in S101 uses the RDN network, and its network architecture is shown in Figure 2 It is shown that the RDN network has fewer parameters and is suitable for use as a model for generating high-resolution images in real time; the network adopts the structure of RDB (Residual dense block) in order to utilize as many features as possible and achieve training convergence as soon as possible. The structure of the RDB is shown in Figure 3.

S103,对Faster-RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster-RCNN网络训练目标检测模型;S103, improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model;

需要说明的是,上述步骤是采用了改进的Faster-RCNN网络进行目标检测。Faster-RCNN在提出之初,是采用VGG16网络作为基础网络负责对原始的图像提取特征,紧接着将提取出的特征图输入到RPN网络中,RPN网络使用一个3*3 的窗口在特征图上滑动并且将9种anchor应用到原图上,最终训练保留下来一定数量的ROIs作为下一步的输入。It should be noted that the above steps are to use the improved Faster-RCNN network for target detection. At the beginning of the proposal, Faster-RCNN used the VGG16 network as the basic network to extract features from the original image, and then input the extracted feature map into the RPN network, which used a 3*3 window on the feature map. Sliding and applying 9 kinds of anchors to the original image, the final training retains a certain number of ROIs as the input of the next step.

本实施例的第一个改进就是将基础网络由VGG16改为了ResNeXt101。 ResNeXt101网络拥有101个卷积层,且在每一个卷积层上将特征图沿通道方向等分为32份,这能够对原始图像进行更好的特征提取;此外,ResNeXt101 还对低层的特征进行了复用,在一定程度上解决了梯度消失的问题;最后, ResNeXt101相比较ResNet101而言并没有带来额外的参数,对于训练和推导过程的速度没有影响。图4是ResNeXt50和ResNet50的结构对比示意图。The first improvement of this embodiment is to change the basic network from VGG16 to ResNeXt101. The ResNeXt101 network has 101 convolutional layers, and each convolutional layer divides the feature map into 32 equal parts along the channel direction, which enables better feature extraction for the original image; To a certain extent, it solves the problem of gradient disappearance; finally, compared with ResNet101, ResNeXt101 does not bring additional parameters, which has no effect on the speed of training and derivation process. Figure 4 is a schematic diagram of the structure comparison between ResNeXt50 and ResNet50.

从图4中可以看出,虽然在每一个卷积层的输出特征图的通道数增加了两倍,但是ResNeXt网络的参数没有增加,反而还有一定的减少。图5则为 ResNeXt和ResNet的块结构对比示意图。As can be seen from Figure 4, although the number of channels of the output feature map in each convolutional layer has doubled, the parameters of the ResNeXt network have not increased, but have decreased to a certain extent. Figure 5 is a schematic diagram of the block structure comparison between ResNeXt and ResNet.

本实施例对Faster-RCNN网络做的第二个改进是将卷积层替换为虫蚀卷积(Atrous Convolution)层,使用此卷积方式可增加特征图的感受野。使用虫蚀卷积的想法来源于InceptionV3网络,这种语义分割网络使用了虫蚀卷积的方式增加了卷积过程中特征图的感受野。而增加特征图的感受野对于目标检测任务同样非常重要,故在本实施例中,普通卷积层都被替换成了虫蚀卷积层。The second improvement made to the Faster-RCNN network in this embodiment is to replace the convolutional layer with an Atrous Convolution layer. Using this convolutional method can increase the receptive field of the feature map. The idea of using worm-eaten convolution comes from the InceptionV3 network. This semantic segmentation network uses worm-eaten convolution to increase the receptive field of feature maps during the convolution process. And increasing the receptive field of the feature map is also very important for the target detection task, so in this embodiment, the ordinary convolutional layers are replaced by worm-eaten convolutional layers.

本实施例对Faster-RCNN网络做的第三个改进就是在基础网络上使用了 FPN(Feature Pyramid Network)网络,其网络结构如图6所示,它可以使用低层产生的尺寸较大的特征图,而原始的网络只用了最后一层的特征。在使用 FPN之后,原始的Faster-RCNN网络针对一个anchor的9个region proposal 被提升到了15个。The third improvement made to the Faster-RCNN network in this embodiment is that the FPN (Feature Pyramid Network) network is used on the basic network. , while the original network only uses the features of the last layer. After using FPN, the original Faster-RCNN network's 9 region proposals for one anchor are boosted to 15.

S104,采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。S104, using a preset model compression method to compress the trained target detection model, so that it can be deployed on the intelligent terminal.

需要说明的是,Faster-RCNN等目标检测网络训练出来的模型参数多,速度较慢,对硬件的要求较高;而例如YOLOv3等目标检测网络模型参数较少,速度较快,能实现实时的目标检测任务,但是模型的精度相对较低。因此,进行模型压缩是一种能够平衡检测精度和检测速度的有效方法。It should be noted that the target detection network such as Faster-RCNN has many model parameters, the speed is slow, and the hardware requirements are higher; while the target detection network model such as YOLOv3 has fewer parameters, faster speed, and can achieve real-time Object detection task, but the accuracy of the model is relatively low. Therefore, model compression is an effective method that can balance detection accuracy and detection speed.

这种方法的理论基础是人类在思考问题的时候并不是所有的神经元都被激活,因此在深度学习模型中本发明剔除掉一些不必要的连接从而减少模型的参数数量,加快深度学习模型的速度。The theoretical basis of this method is that not all neurons are activated when human beings are thinking about problems. Therefore, in the deep learning model, the present invention removes some unnecessary connections to reduce the number of parameters of the model and speed up the deep learning model. speed.

在模型压缩领域中,最常使用的是方法是模型剪枝,即根据模型剪枝之后的性能表现来决定剪掉哪些连接。在本实施例中采用的基础网络是ResNet101,每一个卷积层后面都会连接一个BN(Batch Normalization)层。因此可以使用BN层的gamma参数来进行模型剪枝。为了使用BN层的gamma参数,需要为 gamma参数加上L1正则惩罚训练模型,新的损失函数变为:In the field of model compression, the most commonly used method is model pruning, which determines which connections to prune according to the performance of the model after pruning. The basic network adopted in this embodiment is ResNet101, and a BN (Batch Normalization) layer is connected behind each convolutional layer. Therefore, the gamma parameter of the BN layer can be used for model pruning. In order to use the gamma parameter of the BN layer, it is necessary to add the L1 regular penalty to the gamma parameter to train the model, and the new loss function becomes:

L=∑(x,y)l(f(x,W),y)+μ∑γ∈τg(γ)。L=∑ (x,y) l(f(x,W),y)+μ∑ γ∈τ g(γ).

对该网络中的所有的gamma进行排序,根据人为给定的剪枝比例,去掉 gamma很小的通道,最后进行微调,这个过程重复多次,得到更好的结果,具体的过程如图7所示。Sort all the gammas in the network, remove the channel with small gamma according to the artificially given pruning ratio, and finally perform fine-tuning. This process is repeated many times to obtain better results. The specific process is shown in Figure 7. Show.

本实施例通过采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;对Faster-RCNN 网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster-RCNN网络训练目标检测模型;采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使其能被部署到智能终端上。实现了可以对卫星拍摄的低分辨率的卫星图像进行实时目标检测的效果,且具有良好的检测精度。In this embodiment, the super-resolution model is trained by using images with high resolution as training labels and corresponding images with low resolution as training samples; processing to generate corresponding high-resolution images; improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model; use the preset The model compression method compresses the trained target detection model so that it can be deployed on smart terminals. The effect of real-time target detection on low-resolution satellite images captured by satellites is realized, and the detection accuracy is good.

第二实施例Second Embodiment

本实施例提供一种基于超分辨率和模型压缩的目标检测系统,该基于超分辨率和模型压缩的目标检测系统包括:This embodiment provides a target detection system based on super-resolution and model compression. The target detection system based on super-resolution and model compression includes:

超分辨率模型训练模块,用于采用分辨率高的图像作为训练的标签,采用对应的分辨率低的图像作为训练的样本,训练超分辨率模型;The super-resolution model training module is used to use high-resolution images as training labels, and use corresponding low-resolution images as training samples to train super-resolution models;

高分辨率图像生成模块,用于利用训练好的超分辨率模型对待处理的低分辨率图像进行处理,生成对应的高分辨率图像;The high-resolution image generation module is used to process the low-resolution images to be processed by using the trained super-resolution model to generate corresponding high-resolution images;

目标检测模型训练模块,用于对Faster-RCNN网络按照预设方法进行改进,对超分辨率模型生成的高分辨率图像,使用改进后的Faster-RCNN网络训练目标检测模型;The target detection model training module is used to improve the Faster-RCNN network according to the preset method, and use the improved Faster-RCNN network to train the target detection model for the high-resolution images generated by the super-resolution model;

模型压缩模块,用于采用预设的模型压缩方法对训练好的目标检测模型进行压缩,使得其能被部署到智能终端上。The model compression module is used to compress the trained target detection model by using a preset model compression method, so that it can be deployed on the intelligent terminal.

本实施例的基于超分辨率和模型压缩的目标检测系统与上述第一实施例中的基于超分辨率和模型压缩的目标检测方法相互对应,其中,该系统中各模块单元所实现的功能与上述方法中的各流程步骤一一对应;故,在此不再赘述。The target detection system based on super-resolution and model compression in this embodiment corresponds to the target detection method based on super-resolution and model compression in the first embodiment. Each process step in the above method corresponds to each other; therefore, it is not repeated here.

此外,需要说明的是,本领域内的技术人员应明白,本发明实施例的实施例可提供为方法、装置、或计算机程序产品。因此,本发明实施例可采用完全硬件实施例、完全软件实施例、或结合软件和硬件方面的实施例的形式。而且,本发明实施例可采用在一个或多个其中包含有计算机可用程序代码的计算机可用存储介质(包括但不限于磁盘存储器、CD-ROM、光学存储器等)上实施的计算机程序产品的形式。In addition, it should be noted that those skilled in the art should understand that the embodiments of the embodiments of the present invention may be provided as methods, apparatuses, or computer program products. Accordingly, embodiments of the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment, or an embodiment combining software and hardware aspects. Furthermore, embodiments of the present invention may take the form of a computer program product implemented on one or more computer-usable storage media having computer-usable program code embodied therein, including but not limited to disk storage, CD-ROM, optical storage, and the like.

本发明实施例是参照根据本发明实施例的方法、终端设备(系统)、和计算机程序产品的流程图和/或方框图来描述的。应理解可由计算机程序指令实现流程图和/或方框图中的每一流程和/或方框、以及流程图和/或方框图中的流程和/或方框的结合。可提供这些计算机程序指令到通用计算机、嵌入式处理机或其他可编程数据处理终端设备的处理器以产生一个机器,使得通过计算机或其他可编程数据处理终端设备的处理器执行的指令产生用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的装置。Embodiments of the present invention are described with reference to flowcharts and/or block diagrams of methods, terminal devices (systems), and computer program products according to embodiments of the present invention. It will be understood that each flow and/or block in the flowchart illustrations and/or block diagrams, and combinations of flows and/or blocks in the flowchart illustrations and/or block diagrams, can be implemented by computer program instructions. These computer program instructions may be provided to the processor of a general purpose computer, embedded processor or other programmable data processing terminal to produce a machine such that the instructions executed by the processor of the computer or other programmable data processing terminal produce Means implementing the functions specified in one or more of the flowcharts and/or one or more blocks of the block diagrams.

这些计算机程序指令也可存储在能引导计算机或其他可编程数据处理终端设备以特定方式工作的计算机可读存储器中,使得存储在该计算机可读存储器中的指令产生包括指令装置的制造品,该指令装置实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能。这些计算机程序指令也可装载到计算机或其他可编程数据处理终端设备上,使得在计算机或其他可编程终端设备上执行一系列操作步骤以产生计算机实现的处理,从而在计算机或其他可编程终端设备上执行的指令提供用于实现在流程图一个流程或多个流程和/或方框图一个方框或多个方框中指定的功能的步骤。These computer program instructions may also be stored in a computer readable memory capable of directing a computer or other programmable data processing terminal equipment to operate in a particular manner, such that the instructions stored in the computer readable memory result in an article of manufacture comprising instruction means, the The instruction means implement the functions specified in the flow or flow of the flowcharts and/or the block or blocks of the block diagrams. These computer program instructions can also be loaded on a computer or other programmable data processing terminal equipment, so that a series of operational steps are performed on the computer or other programmable terminal equipment to produce a computer-implemented process, thereby executing on the computer or other programmable terminal equipment The instructions executed on the above provide steps for implementing the functions specified in the flowchart or blocks and/or the block or blocks of the block diagrams.

尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例做出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明实施例范围的所有变更和修改。Although the preferred embodiments of the present invention have been described, additional changes and modifications to these embodiments may occur to those skilled in the art once the basic inventive concepts are known. Therefore, the appended claims are intended to be construed to include the preferred embodiments as well as all changes and modifications that fall within the scope of the embodiments of the present invention.

还需要说明的是,在本文中,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者终端设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者终端设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者终端设备中还存在另外的相同要素。It should also be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or terminal device comprising a series of elements includes not only those elements, but also other elements not expressly listed or inherent to such process, method, article or terminal equipment. Without further limitation, an element defined by the phrase "comprises a..." does not preclude the presence of additional identical elements in the process, method, article or terminal device comprising said element.

以上所述是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明所述原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above are the preferred embodiments of the present invention. It should be pointed out that for those skilled in the art, without departing from the principles of the present invention, several improvements and modifications can be made. These improvements and modifications It should also be regarded as the protection scope of the present invention.

Claims (10)

1. a kind of object detection method based on super-resolution and model compression characterized by comprising
Label using the image of high resolution as training, the sample using the low image of corresponding resolution ratio as training, Training super-resolution model;
Low-resolution image to be processed is handled using trained super-resolution model, generates corresponding high-resolution Image;
Faster-RCNN network is improved according to presetting method, to the high-definition picture that super-resolution model generates, is made With improved Faster-RCNN network training target detection model;
Trained target detection model is compressed using preset model compression method, enables it to be deployed to intelligence In terminal.
2. the object detection method based on super-resolution and model compression as described in claim 1, which is characterized in that described right Faster-RCNN network is improved according to presetting method, comprising:
The basic network of Faster-RCNN network is changed to ResNeXt101 by VGG16.
3. the object detection method based on super-resolution and model compression as claimed in claim 2, which is characterized in that described right Faster-RCNN network is improved according to presetting method, further includes:
Convolutional layer in Faster-RCNN network is replaced with into worm-eaten convolutional layer.
4. the object detection method based on super-resolution and model compression as claimed in claim 3, which is characterized in that described right Faster-RCNN network is improved according to presetting method, further includes:
FPN network is used on the basic network.
5. the object detection method based on super-resolution and model compression as claimed in claim 4, which is characterized in that described to adopt Trained target detection model is compressed with preset model compression method, enables it to be deployed to intelligent terminal On, specifically:
Model beta pruning is carried out using BN layers behind the convolutional layer in improved Faster-RCNN network of gamma parameters, to instruction The target detection model perfected is compressed, and enables it to be deployed on intelligent terminal.
6. a kind of object detection system based on super-resolution and model compression characterized by comprising
Super-resolution model training module, the label for the image using high resolution as training, using corresponding resolution Sample of the low image of rate as training, training super-resolution model;
High-definition picture generation module, for using trained super-resolution model to low-resolution image to be processed into Row processing, generates corresponding high-definition picture;
Target detection model training module, for being improved to Faster-RCNN network according to presetting method, to super-resolution The high-definition picture that model generates, uses improved Faster-RCNN network training target detection model;
Model compression module is made for being compressed using preset model compression method to trained target detection model Obtaining it can be deployed on intelligent terminal.
7. the object detection system based on super-resolution and model compression as claimed in claim 6, which is characterized in that the mesh Mark detection model training module is specifically used for:
The basic network of Faster-RCNN network is changed to ResNeXt101 by VGG16.
8. the object detection system based on super-resolution and model compression as claimed in claim 7, which is characterized in that the mesh Mark detection model training module is also used to:
Convolutional layer in Faster-RCNN network is replaced with into worm-eaten convolutional layer.
9. the object detection system based on super-resolution and model compression as claimed in claim 8, which is characterized in that the mesh Mark detection model training module is also used to:
FPN network is used on the basic network.
10. the object detection system based on super-resolution and model compression as claimed in claim 9, which is characterized in that described Model compression module is specifically used for:
Model beta pruning is carried out using BN layers behind the convolutional layer in improved Faster-RCNN network of gamma parameters, to instruction The target detection model perfected is compressed, and enables it to be deployed on intelligent terminal.
CN201910678925.4A 2019-07-25 2019-07-25 A kind of object detection method and system based on super-resolution and model compression Pending CN110443172A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201910678925.4A CN110443172A (en) 2019-07-25 2019-07-25 A kind of object detection method and system based on super-resolution and model compression

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201910678925.4A CN110443172A (en) 2019-07-25 2019-07-25 A kind of object detection method and system based on super-resolution and model compression

Publications (1)

Publication Number Publication Date
CN110443172A true CN110443172A (en) 2019-11-12

Family

ID=68431653

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201910678925.4A Pending CN110443172A (en) 2019-07-25 2019-07-25 A kind of object detection method and system based on super-resolution and model compression

Country Status (1)

Country Link
CN (1) CN110443172A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111160368A (en) * 2019-12-24 2020-05-15 中国建设银行股份有限公司 Method, device and equipment for detecting target in image and storage medium
CN111368696A (en) * 2020-02-28 2020-07-03 淮阴工学院 Detection method and system for illegal driving behavior of hazardous chemicals transport vehicle based on visual collaboration
CN111582377A (en) * 2020-05-09 2020-08-25 济南浪潮高新科技投资发展有限公司 Edge end target detection method and system based on model compression
CN111652211A (en) * 2020-05-21 2020-09-11 哈尔滨市科佳通用机电股份有限公司 A fault detection method for foreign objects hanging on the mounting seat of the anti-snake shock absorber of a motor vehicle
CN111833335A (en) * 2020-07-17 2020-10-27 燕山大学 A method and system for defect detection of lattice structures based on super-resolution reconstruction
CN112929665A (en) * 2021-01-28 2021-06-08 北京博雅慧视智能技术研究院有限公司 Target tracking method, device, equipment and medium combining super-resolution and video coding
CN113191945A (en) * 2020-12-03 2021-07-30 陕西师范大学 High-energy-efficiency image super-resolution system and method for heterogeneous platform
CN113221925A (en) * 2021-06-18 2021-08-06 北京理工大学 Target detection method and device based on multi-scale image
CN113435384A (en) * 2021-07-07 2021-09-24 中国人民解放军国防科技大学 Target detection method, device and equipment for medium-low resolution optical remote sensing image

Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389797A (en) * 2015-10-16 2016-03-09 西安电子科技大学 Unmanned aerial vehicle video small-object detecting method based on super-resolution reconstruction
CN105741252A (en) * 2015-11-17 2016-07-06 西安电子科技大学 Sparse representation and dictionary learning-based video image layered reconstruction method
CN108734143A (en) * 2018-05-28 2018-11-02 江苏迪伦智能科技有限公司 A kind of transmission line of electricity online test method based on binocular vision of crusing robot
CN109191369A (en) * 2018-08-06 2019-01-11 三星电子(中国)研发中心 2D pictures turn method, storage medium and the device of 3D model
CN109472801A (en) * 2018-11-22 2019-03-15 廖祥 It is a kind of for multiple dimensioned neuromorphic detection and dividing method
CN109784476A (en) * 2019-01-12 2019-05-21 福州大学 A method of improving DSOD network

Patent Citations (6)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105389797A (en) * 2015-10-16 2016-03-09 西安电子科技大学 Unmanned aerial vehicle video small-object detecting method based on super-resolution reconstruction
CN105741252A (en) * 2015-11-17 2016-07-06 西安电子科技大学 Sparse representation and dictionary learning-based video image layered reconstruction method
CN108734143A (en) * 2018-05-28 2018-11-02 江苏迪伦智能科技有限公司 A kind of transmission line of electricity online test method based on binocular vision of crusing robot
CN109191369A (en) * 2018-08-06 2019-01-11 三星电子(中国)研发中心 2D pictures turn method, storage medium and the device of 3D model
CN109472801A (en) * 2018-11-22 2019-03-15 廖祥 It is a kind of for multiple dimensioned neuromorphic detection and dividing method
CN109784476A (en) * 2019-01-12 2019-05-21 福州大学 A method of improving DSOD network

Non-Patent Citations (5)

* Cited by examiner, † Cited by third party
Title
AI研习社: ""一文教你如何pytorch 构建 faster r-cnn"", 《HTTP://ZHUANLAN.ZHIHU.COM/P/56710152》 *
GEORGE PAPANDREOU 等: ""Towards Accurate Multi-person Pose Estimation in the Wild"", 《ARXIV.ORG》 *
JACOB SHERMEYER 等: ""The Effects of Super-Resolution on Object Detection Performance in Satellite Imagery"", 《2019 IEEE/CVF CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW)》 *
YIWU YAO 等: ""CREATING LIGHTWEIGHT OBJECT DETECTORS WITH MODEL COMPRESSION FOR DEPLOYMENT ON EDGE DEVICES"", 《ARXIV.ORG》 *
YULUN ZHANG 等: ""Residual Dense Network for Image Super-Resolution"", 《PROCEEDINGS OF THE IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION (CVPR)》 *

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111160368A (en) * 2019-12-24 2020-05-15 中国建设银行股份有限公司 Method, device and equipment for detecting target in image and storage medium
CN111368696A (en) * 2020-02-28 2020-07-03 淮阴工学院 Detection method and system for illegal driving behavior of hazardous chemicals transport vehicle based on visual collaboration
CN111582377A (en) * 2020-05-09 2020-08-25 济南浪潮高新科技投资发展有限公司 Edge end target detection method and system based on model compression
CN111652211A (en) * 2020-05-21 2020-09-11 哈尔滨市科佳通用机电股份有限公司 A fault detection method for foreign objects hanging on the mounting seat of the anti-snake shock absorber of a motor vehicle
CN111833335A (en) * 2020-07-17 2020-10-27 燕山大学 A method and system for defect detection of lattice structures based on super-resolution reconstruction
CN113191945A (en) * 2020-12-03 2021-07-30 陕西师范大学 High-energy-efficiency image super-resolution system and method for heterogeneous platform
CN113191945B (en) * 2020-12-03 2023-10-27 陕西师范大学 Heterogeneous platform-oriented high-energy-efficiency image super-resolution system and method thereof
CN112929665A (en) * 2021-01-28 2021-06-08 北京博雅慧视智能技术研究院有限公司 Target tracking method, device, equipment and medium combining super-resolution and video coding
CN113221925A (en) * 2021-06-18 2021-08-06 北京理工大学 Target detection method and device based on multi-scale image
CN113221925B (en) * 2021-06-18 2022-11-11 北京理工大学 Target detection method and device based on multi-scale image
CN113435384A (en) * 2021-07-07 2021-09-24 中国人民解放军国防科技大学 Target detection method, device and equipment for medium-low resolution optical remote sensing image

Similar Documents

Publication Publication Date Title
CN110443172A (en) A kind of object detection method and system based on super-resolution and model compression
CN108334934B (en) Convolutional neural network compression method based on pruning and distillation
CN111028177B (en) An edge-based deep learning image de-blurring method
Zheng et al. Cascaded dilated dense network with two-step data consistency for MRI reconstruction
CN112380921A (en) Road detection method based on Internet of vehicles
CN115761472B (en) Underwater dim light scene reconstruction method based on fusion event and RGB data
CN109035149A (en) A kind of license plate image based on deep learning goes motion blur method
CN113222825B (en) Infrared image super-resolution reconstruction method based on visible light image training and application
CN112699844B (en) An Image Super-resolution Method Based on Multi-scale Residual Hierarchical Densely Connected Network
CN110942454A (en) Agricultural image semantic segmentation method
EP4239585A1 (en) Video loop recognition method and apparatus, computer device, and storage medium
CN110443764A (en) Video repairing method, device and server
CN114913588A (en) Face image restoration and recognition method applied to complex scene
CN109376580A (en) A deep learning-based identification method for power tower components
CN116188402A (en) Insulator defect identification method based on improved SSD algorithm
CN111881743A (en) Human face feature point positioning method based on semantic segmentation
WO2019100065A1 (en) Image segmentation using neural networks
CN113065653A (en) Design method of lightweight convolutional neural network for mobile terminal image classification
CN115984424A (en) Electronic component defect image generation method and system based on diffusion model
CN112016639B (en) Flexible separable convolution framework and feature extraction method and application thereof in VGG and ResNet
CN118447031A (en) A dermatoscopic image segmentation method based on diffusion fuzzy learning
CN117058552A (en) Lightweight pest detection method based on improved YOLOv7 and RKNPU2
CN116030342A (en) Method, device, equipment and storage medium for detecting water shortage state of greening plants
CN113516604B (en) Image restoration method
CN111027542A (en) Target detection method improved based on fast RCNN algorithm

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20191112

RJ01 Rejection of invention patent application after publication