Nothing Special   »   [go: up one dir, main page]

CN109753853A - One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again - Google Patents

One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again Download PDF

Info

Publication number
CN109753853A
CN109753853A CN201711076330.9A CN201711076330A CN109753853A CN 109753853 A CN109753853 A CN 109753853A CN 201711076330 A CN201711076330 A CN 201711076330A CN 109753853 A CN109753853 A CN 109753853A
Authority
CN
China
Prior art keywords
pedestrian
network
frame
ppn
detection
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201711076330.9A
Other languages
Chinese (zh)
Inventor
单鼎一
刘惟锦
张晓林
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Changfeng Science Technology Industry Group Corp
Original Assignee
China Changfeng Science Technology Industry Group Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Changfeng Science Technology Industry Group Corp filed Critical China Changfeng Science Technology Industry Group Corp
Priority to CN201711076330.9A priority Critical patent/CN109753853A/en
Publication of CN109753853A publication Critical patent/CN109753853A/en
Pending legal-status Critical Current

Links

Landscapes

  • Image Analysis (AREA)

Abstract

The present invention provides that one kind is completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, and the predeterminable area under different angle camera extracts video frame, artificial to demarcate pedestrian position frame and relevant label information composition training data;Using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, it adds local pedestrian's candidate network PPN and generates candidate pedestrian's frame position, result is exported according to PPN network and carries out the operation of ROI-pooling pondization, carries out Fusion Features using three full articulamentums;Using the output of the last one full articulamentum as character representation, characteristics dictionary-characteristic key library is built, all pedestrian's features in the deep learning feature of the determined pedestrian area part of detection model and characteristic key library are sought similarity mode;When two characteristic similarities meet preset requirement, the maximum artificial same person of similarity in pedestrian and the picture library in test picture is determined.

Description

One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again
Technical field
The invention belongs to mode identification technologies, more particularly, to the depth identified for pedestrian detection and pedestrian again Spend learning method.
Background technique
Pedestrian identifies the technology for referring under non-overlap video camera different perspectives picture Auto-matching with a group traveling together's object again, Have in mind and identifying work without the camera specific objective pedestrian under the public ken, due to offices such as video definition partial occlusions Limit, is difficult directly to find same target by specifying informations such as faces.Weight identification technology requires to pass through pedestrian's different topography texture Etc. information, suitable feature space under measurement criterion complete identification match.This task first has to pedestrian detection and chooses height generally Rate pedestrian's frame, after feature extraction and similarity mode are carried out to multiple candidate frames, finally lock searched targets.
New lover of the deep learning as video image processing task, under mass data and the auxiliary of high-performance computer, Object identification, target detection and tracking, the tasks such as image segmentation all significantly machine learning algorithms of beyond tradition.It can be in practice The combination of high-precision detection algorithm and efficient tracing algorithm tends not to play the effect of one-plus-one is greater than two, and reason is to detect Algorithm, which obtains target frame and pedestrian's weight recognition training collection picture, has position deviation, and testing result is that algorithm is asked under natural scene It obtains, and pedestrian's weight recognition training collection picture is mostly artificial the problems such as cutting acquisition, causing data-bias asymmetric.
Summary of the invention
For the disadvantages described above of the prior art, the invention proposes one kind based on deep learning be completed at the same time pedestrian detection with Pedestrian's recognition methods again, purpose make detection and the weight better seamless connection of identification mission, overcome intermediate transition phase data not The problems such as symmetrical, promotes pedestrian's weight identification technology precision.Furthermore it designs detection and identifies general feature again, when greatly improving Between efficiency, guarantee algorithm operation real-time requirement.
Technical scheme is as follows:
One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, it is characterised in that: in same deep learning network The prediction of pedestrian candidate frame is carried out in structure, pedestrian detection frame returns, the strategy of multirow people classification combination learning, major network structure Mainly include a large amount of convolutional layer for VGG16 network+PPN Area generation network+connect identification layer entirely, pond layer with connect entirely Layer;Specifically includes the following steps:
(1) data set constructs: the predeterminable area under different angle camera extracts video frame, manually demarcates pedestrian position Frame forms training data to relevant label information, or obtains similar data by channel and construct data set;
(2) it recognition training: using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, adds Local pedestrian's candidate network PPN generates candidate pedestrian's frame position, exports result according to PPN network and carries out the pond ROI-pooling Operation carries out Fusion Features using three full articulamentums, it is last simultaneously using pedestrian detection frame offset error regression function with it is more Pedestrian target Classification Loss function carries out model parameter adjustment;
(3) feature calculation: using the output of the last one full articulamentum as character representation, this feature construction tagged word is utilized Allusion quotation-characteristic key library, deep learning feature and characteristic key library of the test phase the determined pedestrian area part of detection model In all pedestrian's features seek similarity mode;
(4) similarity confirms: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture and figure The maximum artificial same person of similarity in valut.
A kind of deep learning that is based on proposed by the present invention is completed at the same time pedestrian detection and pedestrian's recognition methods again, uses convolution Neural network maps the automatic learning characteristic from large-scale data by multilayered nonlinear.Furthermore pass through Model Fusion and target letter Number combination learnings, pedestrian detection and pedestrian's weight identification mission can common characteristic and weighting parameter it is shared, improve Classification and Identification energy Raising efficiency while power, preferably completion pedestrian weight identification mission.The present invention specifically has the advantage that
1, the invention proposes the deep learning pedestrian retrievals of a kind of end-to-end (end to end), and pedestrian to be added to identify again Frame.
2, training process is closer to natural actual scene, strong antijamming capability, and the identification again being more suitable under natural scene is appointed Business.
3, Detection task and identification mission weight are shared, and common features accomplish seamless connection.
4, the present invention can efficiently handle in real time the pedestrian detection of monitoring system and pedestrian identifies problem again.
Detailed description of the invention
Fig. 1 is convolutional neural networks structure chart of the invention;
Fig. 2 is that deep learning pedestrian detection and pedestrian of the invention identify supervised training flow chart again;
Fig. 3 is the operation test flow chart that pedestrian detection and pedestrian of the invention identify again.
Specific embodiment
It is the core depth convolutional neural networks structure chart of invention shown in Fig. 1.
The major network structure that the present invention uses is VGG16 network+PPN Area generation network+connect identification layer entirely, mainly Including a large amount of convolutional layer, pond layer and full articulamentum.Model is using the implicit identification feature in convolutional network study picture The artificial design of traditional characteristic is overcome to interfere, wherein the corresponding goal regression function of PPN pedestrian candidate region network helps Generate high likelihood pedestrian frame.ROI-pooling layers solve the problems, such as that pedestrian's frame extraction characteristic pattern is not of uniform size, are similar to The function of resize.The pedestrian position regressive object of end further corrects pedestrian's frame position, and pedestrian's class object is corresponding complete The output of articulamentum three is as final character representation.
The present invention is using technical solution: the prediction of pedestrian candidate frame is carried out in same deep learning network structure, Pedestrian detection frame returns, the strategy of multirow people classification combination learning, specifically includes the following steps:
Step 1: data set building: the predeterminable area under different angle camera extracts video frame, manually demarcates pedestrian Position frame forms training data to relevant label information, or obtains similar data by channel and construct data set.
Step 2: recognition training: the present invention is using net based on preceding 5 convolutional layers of VGG16 convolutional neural networks structure Network, and part pedestrian's candidate network PPN is added and generates candidate pedestrian's frame position, result is exported according to PPN network and carries out ROI The operation of (Region of interest)-pooling pondization carries out Fusion Features, last while benefit using three full articulamentums Model parameter adjustment is carried out with pedestrian detection frame offset error regression function and multirow people target classification loss function.Largely instructing Practicing data can rapid fine adjustment global depth convolutional Neural net under deep learning error-duration model and gradient decline optimisation strategy are supported Network.This discovery detection algorithm is indicated with identification common features again, reduces model complexity, raising time efficiency, using joint The differentiation ability to express of feature is reinforced in study.
Step 3: feature calculation: the present invention, can be special using this using the output of the last one full articulamentum as character representation Levy construction feature dictionary-characteristic key library.Deep learning feature of the test phase the determined pedestrian area part of detection model Similarity mode is sought with pedestrian's features all in characteristic key library.
Step 4: similarity confirmation: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture With the maximum artificial same person of similarity in picture library.
By the following examples, in conjunction with attached drawing, implementation of the invention is further illustrated.
It is that deep learning pedestrian detection and pedestrian of the invention identify supervised training flow chart again shown in Fig. 2, how is explanation Carry out network monitoring training:
The building of S201 data set: pedestrian's view is acquired in the case where disturbing scene certainly, under the camera of different angle different zones Frequently (cooperate without pedestrian, can be every frame sampling), it is artificial to demarcate pedestrian position frame and label information in video frame, to save workload Can be by the tracing algorithm aid mark of high quality, later period artificial nucleus are to the number of an accurate markup information of high-resolution It is an important ring for all algorithm tasks according to library.
S202 trains PPN pedestrian candidate network module, and PPN is a kind of full convolutional network, can suggest for detection is generated The task of frame is end-to-endly trained.It is basic network forward-propagating with VGG16 convolutional neural networks, according to every in Feature Mapping figure 9 of a position generation are selected the friendship of the location information of frame and authentic signature and ratio determines whether that candidate frame has pedestrian, and carry out Two-value classification learns with the reversed error that frame position returns.
S203 and S204 standardizes pedestrian's Feature Mapping figure: after the completion of PPN network training, fixed relevant parameter, and network Forward-propagating forms the Feature Mapping figure of full figure, according to pedestrian's frame prediction result of PPN network, cuts in full figure Feature Mapping figure The feature for taking single pedestrian uses ROI-pooling standardized feature figure size.
S205 and S206: the training in three full articulamentums with the loss function of pedestrian is returned using pedestrian's feature and position Study gives under learning rate, seeks the updated value of weight according to local derviation with chain type derivation principle by gradient decline, and optimization fine tuning is deep Convolutional neural networks are spent, restrains and stablizes until model.
The extraction of S207 pedestrian's expression feature: the output valve of the last one corresponding full articulamentum of each pedestrian's frame is as it Corresponding character representation.
It is the operation test flow chart that pedestrian detection and pedestrian of the invention identify again shown in Fig. 3, detailed process is as follows:
S301 inputs the pedestrian's picture to be searched: pedestrian inputs picture and is input to network forward-propagating, does not enable PPN network Part is directly extracted the activation value of the last one full articulamentum of the pedestrian as expression feature and is saved.
S302 and S303: camera real-time data collection, every frame picture is input in network, according to the generation of PPN network Frame suggestion, further obtains the Feature Mapping figure of pedestrian, and carries out ROI-pooling standardized feature figure size.
The feature of each pedestrian of S304 continues forward pass, extracts the output valve of the last one full articulamentum as its corresponding spy Sign indicates.
S305 calculates similarity: calculating the feature for searching people in 301 and food inspection to the similarity of pedestrian's feature, can lead to A variety of calculation methods such as COS distance, Euclidean distance are crossed, apart from two features of smaller expression more like being more likely same a group traveling together's mesh Mark.
S306 definitive result: similarity, which reaches preset value, can be determined as same people.

Claims (1)

1. one kind is completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, it is characterised in that: in same deep learning network knot The prediction of pedestrian candidate frame is carried out in structure, pedestrian detection frame returns, the strategy of multirow people classification combination learning, and major network structure is VGG16 network+PPN Area generation network+connect identification layer entirely mainly includes a large amount of convolutional layer, pond layer and full articulamentum; Specifically includes the following steps:
(1) data set constructs: predeterminable area under different angle camera extracts video frame, it is artificial demarcate pedestrian position frame with Relevant label information forms training data, or obtains similar data by channel and construct data set;
(2) recognition training: using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, part is added Pedestrian candidate network PPN generates candidate pedestrian's frame position, exports result according to PPN network and carries out the operation of ROI-pooling pondization, Fusion Features are carried out using three full articulamentums, it is last to utilize pedestrian detection frame offset error regression function and multirow people mesh simultaneously It marks Classification Loss function and carries out model parameter adjustment;
(3) feature calculation: using the output of the last one full articulamentum as character representation, this feature construction characteristics dictionary-is utilized Characteristic key library, pedestrians all in the deep learning feature of the determined pedestrian area part of detection model and characteristic key library are special Solicit similarity mode;
(4) similarity confirms: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture and picture library The middle maximum artificial same person of similarity.
CN201711076330.9A 2017-11-06 2017-11-06 One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again Pending CN109753853A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN201711076330.9A CN109753853A (en) 2017-11-06 2017-11-06 One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN201711076330.9A CN109753853A (en) 2017-11-06 2017-11-06 One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again

Publications (1)

Publication Number Publication Date
CN109753853A true CN109753853A (en) 2019-05-14

Family

ID=66428459

Family Applications (1)

Application Number Title Priority Date Filing Date
CN201711076330.9A Pending CN109753853A (en) 2017-11-06 2017-11-06 One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again

Country Status (1)

Country Link
CN (1) CN109753853A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110866487A (en) * 2019-11-12 2020-03-06 复旦大学 Large-scale pedestrian detection and re-identification sample set construction method and device
CN111046724A (en) * 2019-10-21 2020-04-21 武汉大学 Pedestrian retrieval method based on area matching network
CN111401286A (en) * 2020-03-24 2020-07-10 武汉大学 Pedestrian retrieval method based on component weight generation network
CN111539257A (en) * 2020-03-31 2020-08-14 苏州科达科技股份有限公司 Personnel re-identification method, device and storage medium
CN112613472A (en) * 2020-12-31 2021-04-06 上海交通大学 Pedestrian detection method and system based on deep search matching
CN112686088A (en) * 2019-10-20 2021-04-20 广东毓秀科技有限公司 Cross-lens pedestrian retrieval method based on pedestrian re-identification
CN112767346A (en) * 2021-01-18 2021-05-07 北京医准智能科技有限公司 Multi-image-based full-convolution single-stage mammary image lesion detection method and device
CN113516146A (en) * 2020-12-21 2021-10-19 腾讯科技(深圳)有限公司 Data classification method, computer and readable storage medium

Cited By (14)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN112686088A (en) * 2019-10-20 2021-04-20 广东毓秀科技有限公司 Cross-lens pedestrian retrieval method based on pedestrian re-identification
CN111046724A (en) * 2019-10-21 2020-04-21 武汉大学 Pedestrian retrieval method based on area matching network
CN111046724B (en) * 2019-10-21 2021-09-14 武汉大学 Pedestrian retrieval method based on area matching network
CN110866487A (en) * 2019-11-12 2020-03-06 复旦大学 Large-scale pedestrian detection and re-identification sample set construction method and device
CN110866487B (en) * 2019-11-12 2023-01-17 复旦大学 Large-scale pedestrian detection and re-identification sample set construction method and device
CN111401286A (en) * 2020-03-24 2020-07-10 武汉大学 Pedestrian retrieval method based on component weight generation network
CN111401286B (en) * 2020-03-24 2022-03-04 武汉大学 Pedestrian retrieval method based on component weight generation network
CN111539257B (en) * 2020-03-31 2022-07-26 苏州科达科技股份有限公司 Person re-identification method, device and storage medium
CN111539257A (en) * 2020-03-31 2020-08-14 苏州科达科技股份有限公司 Personnel re-identification method, device and storage medium
CN113516146A (en) * 2020-12-21 2021-10-19 腾讯科技(深圳)有限公司 Data classification method, computer and readable storage medium
CN112613472A (en) * 2020-12-31 2021-04-06 上海交通大学 Pedestrian detection method and system based on deep search matching
CN112613472B (en) * 2020-12-31 2022-04-26 上海交通大学 Pedestrian detection method and system based on deep search matching
CN112767346A (en) * 2021-01-18 2021-05-07 北京医准智能科技有限公司 Multi-image-based full-convolution single-stage mammary image lesion detection method and device
CN112767346B (en) * 2021-01-18 2021-10-29 北京医准智能科技有限公司 Multi-image-based full-convolution single-stage mammary image lesion detection method and device

Similar Documents

Publication Publication Date Title
CN109753853A (en) One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again
Tao et al. An object detection system based on YOLO in traffic scene
CN107609525B (en) Remote sensing image target detection method for constructing convolutional neural network based on pruning strategy
CN103679674B (en) Method and system for splicing images of unmanned aircrafts in real time
CN106127204B (en) A kind of multi-direction meter reading Region detection algorithms of full convolutional neural networks
CN107871124B (en) A kind of Remote Sensing Target detection method based on deep neural network
CN107451607B (en) A kind of personal identification method of the typical character based on deep learning
CN106022237B (en) A kind of pedestrian detection method of convolutional neural networks end to end
CN108319972A (en) A kind of end-to-end difference online learning methods for image, semantic segmentation
CN113516664A (en) Visual SLAM method based on semantic segmentation dynamic points
CN110443818A (en) A kind of Weakly supervised semantic segmentation method and system based on scribble
CN107818302A (en) Non-rigid multi-scale object detection method based on convolutional neural network
CN109635748B (en) Method for extracting road characteristics in high-resolution image
CN107833213A (en) A kind of Weakly supervised object detecting method based on pseudo- true value adaptive method
CN106845373A (en) Towards pedestrian's attribute forecast method of monitor video
CN110110646A (en) A kind of images of gestures extraction method of key frame based on deep learning
CN107657625A (en) Merge the unsupervised methods of video segmentation that space-time multiple features represent
CN106408030A (en) SAR image classification method based on middle lamella semantic attribute and convolution neural network
CN107146237A (en) A kind of method for tracking target learnt based on presence with estimating
CN111462140B (en) Real-time image instance segmentation method based on block stitching
CN110516633A (en) A kind of method for detecting lane lines and system based on deep learning
Arun et al. Effective and efficient multi-crop pest detection based on deep learning object detection models
JP2022082493A (en) Pedestrian re-identification method for random shielding recovery based on noise channel
CN113505670A (en) Remote sensing image weak supervision building extraction method based on multi-scale CAM and super-pixels
CN107730553A (en) A kind of Weakly supervised object detecting method based on pseudo- true value search method

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
WD01 Invention patent application deemed withdrawn after publication

Application publication date: 20190514

WD01 Invention patent application deemed withdrawn after publication