CN109753853A - One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again - Google Patents
One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again Download PDFInfo
- Publication number
- CN109753853A CN109753853A CN201711076330.9A CN201711076330A CN109753853A CN 109753853 A CN109753853 A CN 109753853A CN 201711076330 A CN201711076330 A CN 201711076330A CN 109753853 A CN109753853 A CN 109753853A
- Authority
- CN
- China
- Prior art keywords
- pedestrian
- network
- frame
- ppn
- detection
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Pending
Links
Landscapes
- Image Analysis (AREA)
Abstract
The present invention provides that one kind is completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, and the predeterminable area under different angle camera extracts video frame, artificial to demarcate pedestrian position frame and relevant label information composition training data;Using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, it adds local pedestrian's candidate network PPN and generates candidate pedestrian's frame position, result is exported according to PPN network and carries out the operation of ROI-pooling pondization, carries out Fusion Features using three full articulamentums;Using the output of the last one full articulamentum as character representation, characteristics dictionary-characteristic key library is built, all pedestrian's features in the deep learning feature of the determined pedestrian area part of detection model and characteristic key library are sought similarity mode;When two characteristic similarities meet preset requirement, the maximum artificial same person of similarity in pedestrian and the picture library in test picture is determined.
Description
Technical field
The invention belongs to mode identification technologies, more particularly, to the depth identified for pedestrian detection and pedestrian again
Spend learning method.
Background technique
Pedestrian identifies the technology for referring under non-overlap video camera different perspectives picture Auto-matching with a group traveling together's object again,
Have in mind and identifying work without the camera specific objective pedestrian under the public ken, due to offices such as video definition partial occlusions
Limit, is difficult directly to find same target by specifying informations such as faces.Weight identification technology requires to pass through pedestrian's different topography texture
Etc. information, suitable feature space under measurement criterion complete identification match.This task first has to pedestrian detection and chooses height generally
Rate pedestrian's frame, after feature extraction and similarity mode are carried out to multiple candidate frames, finally lock searched targets.
New lover of the deep learning as video image processing task, under mass data and the auxiliary of high-performance computer,
Object identification, target detection and tracking, the tasks such as image segmentation all significantly machine learning algorithms of beyond tradition.It can be in practice
The combination of high-precision detection algorithm and efficient tracing algorithm tends not to play the effect of one-plus-one is greater than two, and reason is to detect
Algorithm, which obtains target frame and pedestrian's weight recognition training collection picture, has position deviation, and testing result is that algorithm is asked under natural scene
It obtains, and pedestrian's weight recognition training collection picture is mostly artificial the problems such as cutting acquisition, causing data-bias asymmetric.
Summary of the invention
For the disadvantages described above of the prior art, the invention proposes one kind based on deep learning be completed at the same time pedestrian detection with
Pedestrian's recognition methods again, purpose make detection and the weight better seamless connection of identification mission, overcome intermediate transition phase data not
The problems such as symmetrical, promotes pedestrian's weight identification technology precision.Furthermore it designs detection and identifies general feature again, when greatly improving
Between efficiency, guarantee algorithm operation real-time requirement.
Technical scheme is as follows:
One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, it is characterised in that: in same deep learning network
The prediction of pedestrian candidate frame is carried out in structure, pedestrian detection frame returns, the strategy of multirow people classification combination learning, major network structure
Mainly include a large amount of convolutional layer for VGG16 network+PPN Area generation network+connect identification layer entirely, pond layer with connect entirely
Layer;Specifically includes the following steps:
(1) data set constructs: the predeterminable area under different angle camera extracts video frame, manually demarcates pedestrian position
Frame forms training data to relevant label information, or obtains similar data by channel and construct data set;
(2) it recognition training: using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, adds
Local pedestrian's candidate network PPN generates candidate pedestrian's frame position, exports result according to PPN network and carries out the pond ROI-pooling
Operation carries out Fusion Features using three full articulamentums, it is last simultaneously using pedestrian detection frame offset error regression function with it is more
Pedestrian target Classification Loss function carries out model parameter adjustment;
(3) feature calculation: using the output of the last one full articulamentum as character representation, this feature construction tagged word is utilized
Allusion quotation-characteristic key library, deep learning feature and characteristic key library of the test phase the determined pedestrian area part of detection model
In all pedestrian's features seek similarity mode;
(4) similarity confirms: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture and figure
The maximum artificial same person of similarity in valut.
A kind of deep learning that is based on proposed by the present invention is completed at the same time pedestrian detection and pedestrian's recognition methods again, uses convolution
Neural network maps the automatic learning characteristic from large-scale data by multilayered nonlinear.Furthermore pass through Model Fusion and target letter
Number combination learnings, pedestrian detection and pedestrian's weight identification mission can common characteristic and weighting parameter it is shared, improve Classification and Identification energy
Raising efficiency while power, preferably completion pedestrian weight identification mission.The present invention specifically has the advantage that
1, the invention proposes the deep learning pedestrian retrievals of a kind of end-to-end (end to end), and pedestrian to be added to identify again
Frame.
2, training process is closer to natural actual scene, strong antijamming capability, and the identification again being more suitable under natural scene is appointed
Business.
3, Detection task and identification mission weight are shared, and common features accomplish seamless connection.
4, the present invention can efficiently handle in real time the pedestrian detection of monitoring system and pedestrian identifies problem again.
Detailed description of the invention
Fig. 1 is convolutional neural networks structure chart of the invention;
Fig. 2 is that deep learning pedestrian detection and pedestrian of the invention identify supervised training flow chart again;
Fig. 3 is the operation test flow chart that pedestrian detection and pedestrian of the invention identify again.
Specific embodiment
It is the core depth convolutional neural networks structure chart of invention shown in Fig. 1.
The major network structure that the present invention uses is VGG16 network+PPN Area generation network+connect identification layer entirely, mainly
Including a large amount of convolutional layer, pond layer and full articulamentum.Model is using the implicit identification feature in convolutional network study picture
The artificial design of traditional characteristic is overcome to interfere, wherein the corresponding goal regression function of PPN pedestrian candidate region network helps
Generate high likelihood pedestrian frame.ROI-pooling layers solve the problems, such as that pedestrian's frame extraction characteristic pattern is not of uniform size, are similar to
The function of resize.The pedestrian position regressive object of end further corrects pedestrian's frame position, and pedestrian's class object is corresponding complete
The output of articulamentum three is as final character representation.
The present invention is using technical solution: the prediction of pedestrian candidate frame is carried out in same deep learning network structure,
Pedestrian detection frame returns, the strategy of multirow people classification combination learning, specifically includes the following steps:
Step 1: data set building: the predeterminable area under different angle camera extracts video frame, manually demarcates pedestrian
Position frame forms training data to relevant label information, or obtains similar data by channel and construct data set.
Step 2: recognition training: the present invention is using net based on preceding 5 convolutional layers of VGG16 convolutional neural networks structure
Network, and part pedestrian's candidate network PPN is added and generates candidate pedestrian's frame position, result is exported according to PPN network and carries out ROI
The operation of (Region of interest)-pooling pondization carries out Fusion Features, last while benefit using three full articulamentums
Model parameter adjustment is carried out with pedestrian detection frame offset error regression function and multirow people target classification loss function.Largely instructing
Practicing data can rapid fine adjustment global depth convolutional Neural net under deep learning error-duration model and gradient decline optimisation strategy are supported
Network.This discovery detection algorithm is indicated with identification common features again, reduces model complexity, raising time efficiency, using joint
The differentiation ability to express of feature is reinforced in study.
Step 3: feature calculation: the present invention, can be special using this using the output of the last one full articulamentum as character representation
Levy construction feature dictionary-characteristic key library.Deep learning feature of the test phase the determined pedestrian area part of detection model
Similarity mode is sought with pedestrian's features all in characteristic key library.
Step 4: similarity confirmation: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture
With the maximum artificial same person of similarity in picture library.
By the following examples, in conjunction with attached drawing, implementation of the invention is further illustrated.
It is that deep learning pedestrian detection and pedestrian of the invention identify supervised training flow chart again shown in Fig. 2, how is explanation
Carry out network monitoring training:
The building of S201 data set: pedestrian's view is acquired in the case where disturbing scene certainly, under the camera of different angle different zones
Frequently (cooperate without pedestrian, can be every frame sampling), it is artificial to demarcate pedestrian position frame and label information in video frame, to save workload
Can be by the tracing algorithm aid mark of high quality, later period artificial nucleus are to the number of an accurate markup information of high-resolution
It is an important ring for all algorithm tasks according to library.
S202 trains PPN pedestrian candidate network module, and PPN is a kind of full convolutional network, can suggest for detection is generated
The task of frame is end-to-endly trained.It is basic network forward-propagating with VGG16 convolutional neural networks, according to every in Feature Mapping figure
9 of a position generation are selected the friendship of the location information of frame and authentic signature and ratio determines whether that candidate frame has pedestrian, and carry out
Two-value classification learns with the reversed error that frame position returns.
S203 and S204 standardizes pedestrian's Feature Mapping figure: after the completion of PPN network training, fixed relevant parameter, and network
Forward-propagating forms the Feature Mapping figure of full figure, according to pedestrian's frame prediction result of PPN network, cuts in full figure Feature Mapping figure
The feature for taking single pedestrian uses ROI-pooling standardized feature figure size.
S205 and S206: the training in three full articulamentums with the loss function of pedestrian is returned using pedestrian's feature and position
Study gives under learning rate, seeks the updated value of weight according to local derviation with chain type derivation principle by gradient decline, and optimization fine tuning is deep
Convolutional neural networks are spent, restrains and stablizes until model.
The extraction of S207 pedestrian's expression feature: the output valve of the last one corresponding full articulamentum of each pedestrian's frame is as it
Corresponding character representation.
It is the operation test flow chart that pedestrian detection and pedestrian of the invention identify again shown in Fig. 3, detailed process is as follows:
S301 inputs the pedestrian's picture to be searched: pedestrian inputs picture and is input to network forward-propagating, does not enable PPN network
Part is directly extracted the activation value of the last one full articulamentum of the pedestrian as expression feature and is saved.
S302 and S303: camera real-time data collection, every frame picture is input in network, according to the generation of PPN network
Frame suggestion, further obtains the Feature Mapping figure of pedestrian, and carries out ROI-pooling standardized feature figure size.
The feature of each pedestrian of S304 continues forward pass, extracts the output valve of the last one full articulamentum as its corresponding spy
Sign indicates.
S305 calculates similarity: calculating the feature for searching people in 301 and food inspection to the similarity of pedestrian's feature, can lead to
A variety of calculation methods such as COS distance, Euclidean distance are crossed, apart from two features of smaller expression more like being more likely same a group traveling together's mesh
Mark.
S306 definitive result: similarity, which reaches preset value, can be determined as same people.
Claims (1)
1. one kind is completed at the same time pedestrian detection and pedestrian knows method for distinguishing again, it is characterised in that: in same deep learning network knot
The prediction of pedestrian candidate frame is carried out in structure, pedestrian detection frame returns, the strategy of multirow people classification combination learning, and major network structure is
VGG16 network+PPN Area generation network+connect identification layer entirely mainly includes a large amount of convolutional layer, pond layer and full articulamentum;
Specifically includes the following steps:
(1) data set constructs: predeterminable area under different angle camera extracts video frame, it is artificial demarcate pedestrian position frame with
Relevant label information forms training data, or obtains similar data by channel and construct data set;
(2) recognition training: using preceding 5 convolutional layers of VGG16 convolutional neural networks structure as basic network, part is added
Pedestrian candidate network PPN generates candidate pedestrian's frame position, exports result according to PPN network and carries out the operation of ROI-pooling pondization,
Fusion Features are carried out using three full articulamentums, it is last to utilize pedestrian detection frame offset error regression function and multirow people mesh simultaneously
It marks Classification Loss function and carries out model parameter adjustment;
(3) feature calculation: using the output of the last one full articulamentum as character representation, this feature construction characteristics dictionary-is utilized
Characteristic key library, pedestrians all in the deep learning feature of the determined pedestrian area part of detection model and characteristic key library are special
Solicit similarity mode;
(4) similarity confirms: when two characteristic similarities meet preset requirement, determining the pedestrian in test picture and picture library
The middle maximum artificial same person of similarity.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711076330.9A CN109753853A (en) | 2017-11-06 | 2017-11-06 | One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN201711076330.9A CN109753853A (en) | 2017-11-06 | 2017-11-06 | One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again |
Publications (1)
Publication Number | Publication Date |
---|---|
CN109753853A true CN109753853A (en) | 2019-05-14 |
Family
ID=66428459
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN201711076330.9A Pending CN109753853A (en) | 2017-11-06 | 2017-11-06 | One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN109753853A (en) |
Cited By (8)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN110866487A (en) * | 2019-11-12 | 2020-03-06 | 复旦大学 | Large-scale pedestrian detection and re-identification sample set construction method and device |
CN111046724A (en) * | 2019-10-21 | 2020-04-21 | 武汉大学 | Pedestrian retrieval method based on area matching network |
CN111401286A (en) * | 2020-03-24 | 2020-07-10 | 武汉大学 | Pedestrian retrieval method based on component weight generation network |
CN111539257A (en) * | 2020-03-31 | 2020-08-14 | 苏州科达科技股份有限公司 | Personnel re-identification method, device and storage medium |
CN112613472A (en) * | 2020-12-31 | 2021-04-06 | 上海交通大学 | Pedestrian detection method and system based on deep search matching |
CN112686088A (en) * | 2019-10-20 | 2021-04-20 | 广东毓秀科技有限公司 | Cross-lens pedestrian retrieval method based on pedestrian re-identification |
CN112767346A (en) * | 2021-01-18 | 2021-05-07 | 北京医准智能科技有限公司 | Multi-image-based full-convolution single-stage mammary image lesion detection method and device |
CN113516146A (en) * | 2020-12-21 | 2021-10-19 | 腾讯科技(深圳)有限公司 | Data classification method, computer and readable storage medium |
-
2017
- 2017-11-06 CN CN201711076330.9A patent/CN109753853A/en active Pending
Cited By (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112686088A (en) * | 2019-10-20 | 2021-04-20 | 广东毓秀科技有限公司 | Cross-lens pedestrian retrieval method based on pedestrian re-identification |
CN111046724A (en) * | 2019-10-21 | 2020-04-21 | 武汉大学 | Pedestrian retrieval method based on area matching network |
CN111046724B (en) * | 2019-10-21 | 2021-09-14 | 武汉大学 | Pedestrian retrieval method based on area matching network |
CN110866487A (en) * | 2019-11-12 | 2020-03-06 | 复旦大学 | Large-scale pedestrian detection and re-identification sample set construction method and device |
CN110866487B (en) * | 2019-11-12 | 2023-01-17 | 复旦大学 | Large-scale pedestrian detection and re-identification sample set construction method and device |
CN111401286A (en) * | 2020-03-24 | 2020-07-10 | 武汉大学 | Pedestrian retrieval method based on component weight generation network |
CN111401286B (en) * | 2020-03-24 | 2022-03-04 | 武汉大学 | Pedestrian retrieval method based on component weight generation network |
CN111539257B (en) * | 2020-03-31 | 2022-07-26 | 苏州科达科技股份有限公司 | Person re-identification method, device and storage medium |
CN111539257A (en) * | 2020-03-31 | 2020-08-14 | 苏州科达科技股份有限公司 | Personnel re-identification method, device and storage medium |
CN113516146A (en) * | 2020-12-21 | 2021-10-19 | 腾讯科技(深圳)有限公司 | Data classification method, computer and readable storage medium |
CN112613472A (en) * | 2020-12-31 | 2021-04-06 | 上海交通大学 | Pedestrian detection method and system based on deep search matching |
CN112613472B (en) * | 2020-12-31 | 2022-04-26 | 上海交通大学 | Pedestrian detection method and system based on deep search matching |
CN112767346A (en) * | 2021-01-18 | 2021-05-07 | 北京医准智能科技有限公司 | Multi-image-based full-convolution single-stage mammary image lesion detection method and device |
CN112767346B (en) * | 2021-01-18 | 2021-10-29 | 北京医准智能科技有限公司 | Multi-image-based full-convolution single-stage mammary image lesion detection method and device |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN109753853A (en) | One kind being completed at the same time pedestrian detection and pedestrian knows method for distinguishing again | |
Tao et al. | An object detection system based on YOLO in traffic scene | |
CN107609525B (en) | Remote sensing image target detection method for constructing convolutional neural network based on pruning strategy | |
CN103679674B (en) | Method and system for splicing images of unmanned aircrafts in real time | |
CN106127204B (en) | A kind of multi-direction meter reading Region detection algorithms of full convolutional neural networks | |
CN107871124B (en) | A kind of Remote Sensing Target detection method based on deep neural network | |
CN107451607B (en) | A kind of personal identification method of the typical character based on deep learning | |
CN106022237B (en) | A kind of pedestrian detection method of convolutional neural networks end to end | |
CN108319972A (en) | A kind of end-to-end difference online learning methods for image, semantic segmentation | |
CN113516664A (en) | Visual SLAM method based on semantic segmentation dynamic points | |
CN110443818A (en) | A kind of Weakly supervised semantic segmentation method and system based on scribble | |
CN107818302A (en) | Non-rigid multi-scale object detection method based on convolutional neural network | |
CN109635748B (en) | Method for extracting road characteristics in high-resolution image | |
CN107833213A (en) | A kind of Weakly supervised object detecting method based on pseudo- true value adaptive method | |
CN106845373A (en) | Towards pedestrian's attribute forecast method of monitor video | |
CN110110646A (en) | A kind of images of gestures extraction method of key frame based on deep learning | |
CN107657625A (en) | Merge the unsupervised methods of video segmentation that space-time multiple features represent | |
CN106408030A (en) | SAR image classification method based on middle lamella semantic attribute and convolution neural network | |
CN107146237A (en) | A kind of method for tracking target learnt based on presence with estimating | |
CN111462140B (en) | Real-time image instance segmentation method based on block stitching | |
CN110516633A (en) | A kind of method for detecting lane lines and system based on deep learning | |
Arun et al. | Effective and efficient multi-crop pest detection based on deep learning object detection models | |
JP2022082493A (en) | Pedestrian re-identification method for random shielding recovery based on noise channel | |
CN113505670A (en) | Remote sensing image weak supervision building extraction method based on multi-scale CAM and super-pixels | |
CN107730553A (en) | A kind of Weakly supervised object detecting method based on pseudo- true value search method |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
WD01 | Invention patent application deemed withdrawn after publication |
Application publication date: 20190514 |
|
WD01 | Invention patent application deemed withdrawn after publication |