CN112287855A - Driving behavior detection method and device based on multitask neural network - Google Patents
Driving behavior detection method and device based on multitask neural network Download PDFInfo
- Publication number
- CN112287855A CN112287855A CN202011207383.1A CN202011207383A CN112287855A CN 112287855 A CN112287855 A CN 112287855A CN 202011207383 A CN202011207383 A CN 202011207383A CN 112287855 A CN112287855 A CN 112287855A
- Authority
- CN
- China
- Prior art keywords
- neural network
- network
- output
- sample
- training
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000013528 artificial neural network Methods 0.000 title claims abstract description 185
- 238000001514 detection method Methods 0.000 title claims abstract description 49
- 238000012549 training Methods 0.000 claims abstract description 42
- 238000000034 method Methods 0.000 claims abstract description 32
- 230000006870 function Effects 0.000 claims description 42
- 230000006399 behavior Effects 0.000 claims description 31
- 230000004927 fusion Effects 0.000 claims description 10
- 238000004590 computer program Methods 0.000 claims description 6
- 238000005070 sampling Methods 0.000 claims description 3
- 238000005516 engineering process Methods 0.000 abstract description 7
- 238000004364 calculation method Methods 0.000 abstract description 6
- 230000000694 effects Effects 0.000 abstract description 6
- 238000004891 communication Methods 0.000 description 7
- 238000010586 diagram Methods 0.000 description 6
- 230000008569 process Effects 0.000 description 6
- 230000006855 networking Effects 0.000 description 5
- 238000012545 processing Methods 0.000 description 4
- 206010039203 Road traffic accident Diseases 0.000 description 3
- 230000008878 coupling Effects 0.000 description 3
- 238000010168 coupling process Methods 0.000 description 3
- 238000005859 coupling reaction Methods 0.000 description 3
- 238000012544 monitoring process Methods 0.000 description 2
- 238000012795 verification Methods 0.000 description 2
- 208000037170 Delayed Emergence from Anesthesia Diseases 0.000 description 1
- 238000010276 construction Methods 0.000 description 1
- 238000000802 evaporation-induced self-assembly Methods 0.000 description 1
- 238000012986 modification Methods 0.000 description 1
- 230000004048 modification Effects 0.000 description 1
- 230000003287 optical effect Effects 0.000 description 1
- 238000006467 substitution reaction Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V40/00—Recognition of biometric, human-related or animal-related patterns in image or video data
- G06V40/10—Human or animal bodies, e.g. vehicle occupants or pedestrians; Body parts, e.g. hands
- G06V40/16—Human faces, e.g. facial parts, sketches or expressions
- G06V40/161—Detection; Localisation; Normalisation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/04—Architecture, e.g. interconnection topology
- G06N3/045—Combinations of networks
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06N—COMPUTING ARRANGEMENTS BASED ON SPECIFIC COMPUTATIONAL MODELS
- G06N3/00—Computing arrangements based on biological models
- G06N3/02—Neural networks
- G06N3/08—Learning methods
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING OR COUNTING
- G06V—IMAGE OR VIDEO RECOGNITION OR UNDERSTANDING
- G06V20/00—Scenes; Scene-specific elements
- G06V20/50—Context or environment of the image
- G06V20/59—Context or environment of the image inside of a vehicle, e.g. relating to seat occupancy, driver state or inner lighting conditions
- G06V20/597—Recognising the driver's state or behaviour, e.g. attention or drowsiness
Landscapes
- Engineering & Computer Science (AREA)
- Theoretical Computer Science (AREA)
- Physics & Mathematics (AREA)
- General Physics & Mathematics (AREA)
- General Health & Medical Sciences (AREA)
- Health & Medical Sciences (AREA)
- Artificial Intelligence (AREA)
- Molecular Biology (AREA)
- Software Systems (AREA)
- Life Sciences & Earth Sciences (AREA)
- Mathematical Physics (AREA)
- Biomedical Technology (AREA)
- Biophysics (AREA)
- Computational Linguistics (AREA)
- Data Mining & Analysis (AREA)
- Evolutionary Computation (AREA)
- Multimedia (AREA)
- Computing Systems (AREA)
- General Engineering & Computer Science (AREA)
- Oral & Maxillofacial Surgery (AREA)
- Human Computer Interaction (AREA)
- Image Analysis (AREA)
Abstract
The invention provides a driving behavior detection method and device based on a multitask neural network, and relates to the technical field of driving state detection, wherein the method comprises the following steps: determining a first neural network based on the convolutional layer of the underlying neural network, the first neural network comprising: a first output network and a second output network; determining a second neural network based on the basic neural network and the Hourglass network, and determining a third neural network based on the first neural network and the second neural network; and then training a third neural network by using the sample information, and finally predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample. The method can solve the problems of long operation time consumption and low prediction accuracy in the existing driving behavior detection technology, thereby realizing the effect of improving the calculation performance.
Description
Technical Field
The invention relates to the technical field of driving state detection, in particular to a driving behavior detection method and device based on a multitask neural network.
Background
With the increase of the number of private cars, motor vehicles become a very common trip mode, but in recent years, the accident rate of motor vehicles is high, and particularly, driving behaviors such as fatigue driving or inattention of a driver are gradually an important reason for frequent traffic accidents, so that the detection and supervision of the driving behaviors of the driver are of great significance for effectively reducing the occurrence of the traffic accidents. The driver fatigue monitoring technology generally performs head posture estimation and face key point detection on a driver, so that the attention detection, face verification and the like of the driver are realized.
However, when the existing driving behavior detection technology carries out head posture estimation and face key point detection at the same time, a separate neural network is usually adopted for realization, and the problems of long operation time consumption and low prediction accuracy exist.
Disclosure of Invention
The invention aims to provide a driving behavior detection method and device based on a multitask neural network, so as to solve the technical problems of long operation time consumption and low prediction accuracy in the prior art.
In a first aspect, an embodiment of the present invention provides a driving behavior detection method based on a multitask neural network, where the method includes: determining a first neural network based on a convolutional layer of a base neural network, the first neural network comprising: a first output network and a second output network;
determining a second neural network based on the base neural network and the Hourglass network;
determining a third neural network based on the first neural network and the second neural network;
training the third neural network using sample information; the sample information comprises a head image of a sample, the head image comprising face information of the sample;
predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head pose angle of the sample.
In some embodiments, the step of determining the first neural network based on the convolutional layer of the underlying neural network comprises:
connecting a specified number of convolutional layers in the basic neural network with the convolutional layers of the first output network;
and connecting the last convolution layer of the basic neural network with the convolution layer of the second output network.
In some embodiments, the first output network comprises a keypoint detection network; the second output network comprises a head pose estimation network.
In some embodiments, the step of determining a second neural network based on the base neural network and the HourGlass network comprises:
taking a plurality of convolution layers in the basic neural network as down-sampling convolution layers of a first Hourglass network to construct a first cascade network;
wherein the first HourGlass network further comprises the same number of upsampled convolutional layers as the downsampled convolutional layers.
In some embodiments, the step of determining a second neural network based on the base neural network and the HourGlass network further comprises:
connecting a plurality of Hourglass networks to construct a second cascade network;
and connecting the second cascade network after the first cascade network to determine a second neural network.
In some embodiments, the step of training the third neural network using sample information comprises:
training a first loss function of the first output network and a second loss function of the second output network using sample information;
training a fusion loss function of the second neural network based on training results of the first loss function and the second loss function.
In some embodiments, the step of training the second neural network using sample information comprises:
training a first loss function of the first output network, a second loss function of the second output network, and a fusion loss function of the second neural network simultaneously using sample information.
In a second aspect, an embodiment of the present invention provides a driving behavior detection apparatus based on a multitasking neural network, including:
a first determination module to determine a first neural network based on a convolutional layer of a base neural network, the first neural network comprising: a first output network and a second output network;
a second determining module for determining a second neural network based on the basis of the basic neural network and the Hourglass network;
a third determination module to determine a third neural network based on the first neural network and the second neural network;
a training module for training the third neural network using sample information; the sample information comprises a head image of a sample, the head image comprising face information of the sample;
and the prediction module is used for predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample.
In a third aspect, an embodiment of the present invention provides an electronic device, including a memory and a processor, where the memory stores a computer program operable on the processor, and the processor implements the steps of the method according to any one of the first aspect when executing the computer program.
In a fourth aspect, embodiments of the present invention provide a computer-readable storage medium storing machine executable instructions that, when invoked and executed by a processor, cause the processor to perform the method of any of the first aspects.
The invention provides a driving behavior detection method and a device based on a multitask neural network, wherein the method comprises the following steps: determining a first neural network based on the convolutional layer of the underlying neural network, the first neural network comprising: a first output network and a second output network; determining a second neural network based on the basic neural network and the Hourglass network, and determining a third neural network based on the first neural network and the second neural network; and then training a third neural network by using the sample information, and finally predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample. The method can solve the problems of long operation time consumption and low prediction accuracy in the existing driving behavior detection technology, thereby realizing the effect of improving the calculation performance.
The calculation time is saved, and the detection precision is improved.
Drawings
In order to more clearly illustrate the embodiments of the present invention or the technical solutions in the prior art, the drawings used in the description of the embodiments or the prior art will be briefly described below, and it is obvious that the drawings in the following description are some embodiments of the present invention, and other drawings can be obtained by those skilled in the art without creative efforts.
Fig. 1 is a schematic flowchart of a driving behavior detection method based on a multitask neural network according to an embodiment of the present invention;
FIG. 2 is a schematic diagram of a HourGlass network according to an embodiment of the present invention;
FIG. 3 is a schematic structural diagram of a multitasking neural network according to an embodiment of the present invention;
fig. 4 is a schematic structural diagram of a driving behavior detection apparatus based on a multitask neural network according to an embodiment of the present invention;
fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the drawings in the embodiments of the present invention, and it is obvious that the described embodiments are some, but not all, embodiments of the present invention. The components of embodiments of the present invention generally described and illustrated in the figures herein may be arranged and designed in a wide variety of different configurations.
Thus, the following detailed description of the embodiments of the present invention, presented in the figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of selected embodiments of the invention. All other embodiments, which can be derived by a person skilled in the art from the embodiments given herein without making any creative effort, shall fall within the protection scope of the present invention. Some embodiments of the invention are described in detail below with reference to the accompanying drawings. The embodiments described below and the features of the embodiments can be combined with each other without conflict.
The detection and supervision of the driving behavior of the driver are of great significance to effectively reduce the occurrence of traffic accidents. The driver fatigue monitoring technology generally performs head posture estimation and face key point detection on a driver, so that the attention detection, face verification and the like of the driver are realized. However, when processing head pose estimation and face key point detection, the existing driving behavior detection technology generally adopts two separate networks to respectively estimate the head pose and detect the face key points of the face ROI after a face detection framework, and this method is time-consuming on an embedded system. That is, in the prior art, when head pose estimation and face key point detection are performed simultaneously, a separate neural network is usually required to be used for realization, and the problems of long operation time consumption and low prediction accuracy exist.
Based on the above, the embodiment of the invention provides a driving behavior detection method and device based on a multitask neural network, and the technical problems of long operation time consumption and low prediction accuracy in the prior art can be solved through the method. To facilitate understanding of the present embodiment, first, a driving behavior detection method based on a multitask neural network disclosed in the embodiment of the present invention is described in detail, referring to a schematic flow chart of the driving behavior detection method based on the multitask neural network shown in fig. 1, where the method may be executed by an electronic device, and mainly includes the following steps S110 to S150:
s110, determining a first neural network based on the convolution layer of the basic neural network, wherein the first neural network comprises: a first output network and a second output network;
the basic neural network may be a neural network including convolutional layers, and as a specific example, the basic neural network may be a lightweight network MobilenetV2 or a residual network ResNet.
In some embodiments, the first output network may be a keypoint detection network, which may be used to predict keypoint coordinates of the input image; the second output network may be a head pose estimation network that may be used to predict an angle of a head pose of the input image.
The construction method of the first neural network comprises the following steps: and selecting a specified number of convolutional layers from the basic neural network to be connected with the convolutional layer of the first output network, and connecting the last convolutional layer of the basic neural network with the convolutional layer of the second output network. The first neural network constructed in the way can simultaneously realize two functions of key point detection and head posture estimation, can reduce the operation time and improve the prediction efficiency.
S120, determining a second neural network based on the basic neural network and the Hourglass network;
the HourGlass network generally includes the structure shown in part (a) of fig. 2, and each square in the diagram is a residual block shown in part (B) of fig. 2.
The second neural network is formed by fusing a basic neural network and a Hourglass network. In some embodiments, several convolutional layers may be decimated in the underlying neural network as the downsampled convolutional layers of the first HourGlass network, such as the left downsampled convolutional portion (also referred to as the downsampled convolutional portion) in part (a) of fig. 2. The first HourGlass network also includes the same number of upsampled convolutional layers as the downsampled convolutional layers, e.g., the upsampled convolutional portion on the right in portion (a) of fig. 2. Therefore, the basic model and the Hourglass network are fused, and the first-level networking network is successfully constructed and can be used as a second neural network.
However, in order to obtain a better training effect, a second cascade network can be constructed by connecting a plurality of Hourglass networks, and then the second cascade network is connected to the first cascade network to form a second neural network consisting of a plurality of Hourglass networks. As a specific example, the number of HourGlass networks that build the second level networking may be 3. The prediction result of the second neural network formed by the plurality of Hourglass networks is obviously superior to the prediction results of the basic neural network and the single Hourglass network, and the prediction precision is improved.
S130, determining a third neural network based on the first neural network and the second neural network;
referring to fig. 3, the third neural network includes: the first output network and the second output network are determined by a second neural network formed by fusing the basic neural network and the Hourglass network and a convolution layer based on the basic neural network. The third neural network may thus comprise at least three loss functions, respectively: a fusion loss function of the second neural network, a first loss function of the first output network, and a second loss function of the second output network. The loss function can be added into an optimizer during the training of the neural network, and the training of the corresponding neural network is realized through the training of the loss function and the parameters of the loss function.
S140, training a third neural network by using the sample information;
wherein the sample information comprises a head image of the sample, and the head image comprises face information of the sample.
Because the third neural network includes the second neural network, the first output network, and the second output network, training the third neural network using the sample information includes: the second neural network, the first output network, and the second output network are trained using the sample information, respectively.
As a specific example, the training process may include:
step (A): training a first loss function of the first output network and a second loss function of the second output network using the sample information;
step (B): and training the fusion loss function of the second neural network based on the training results of the first loss function and the second loss function.
Alternatively, the training process may include:
step (C): a first loss function of the first output network, a second loss function of the second output network, and a fusion loss function of the second neural network are trained simultaneously using the sample information.
In the training process, in order to improve the precision of the training parameters, a second neural network formed by the first cascade network and the second-level networking network can be trained, and the training precision of the fusion loss function of the second neural network can be improved.
S150, predicting the sample information based on the trained third neural network;
the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample.
In order to improve the calculation speed, a second neural network comprising only the first-level networking network and a third neural network consisting of the first output network and the second output network can be selected for prediction during prediction, so that the prediction efficiency can be improved, and the calculation performance can be saved.
The driving behavior detection method based on the multitask neural network comprises the following steps: determining a first neural network based on the convolutional layer of the underlying neural network, the first neural network comprising: a first output network and a second output network; determining a second neural network based on the basic neural network and the Hourglass network, and determining a third neural network based on the first neural network and the second neural network; and then training a third neural network by using the sample information, and finally predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample. The method can solve the problems of long operation time consumption and low prediction accuracy in the existing driving behavior detection technology, thereby realizing the effect of improving the calculation performance.
The embodiment of the present application further provides a driving behavior detection device based on a multitask neural network, and with reference to fig. 4, the device includes:
a first determining module 310 for determining a first neural network based on a convolutional layer of a base neural network, the first neural network comprising: a first output network and a second output network;
a second determining module 320 for determining a second neural network based on the base neural network and the Hourglass network;
a third determining module 330 for determining a third neural network based on the first neural network and the second neural network;
a training module 340 for training the third neural network using the sample information; the sample information comprises a head image of the sample, and the head image comprises face information of the sample;
and the predicting module 350 is configured to predict the sample information based on the trained third neural network, where a first output network is used to predict the key point coordinates of the sample, and a second output network is used to predict the head pose angle of the sample.
In some embodiments, the first determining module is further configured to:
connecting a specified number of convolutional layers in the basic neural network with the convolutional layers of the first output network;
and connecting the last convolution layer of the basic neural network with the convolution layer of the second output network.
Wherein the first output network comprises a key point detection network; the second output network comprises a head pose estimation network.
In some embodiments, the second determining module is further configured to: taking a plurality of convolution layers in the basic neural network as down-sampling convolution layers of a first Hourglass network to construct a first cascade network; wherein the first HourGlass network further comprises the same number of upsampled convolutional layers as the downsampled convolutional layers.
In some embodiments, the second determining module is further configured to: connecting a plurality of Hourglass networks to construct a second cascade network; and connecting a second-level networking network behind the first cascade network to determine a second neural network.
In some embodiments, the training module is further to: training a first loss function of the first output network and a second loss function of the second output network using the sample information; and training the fusion loss function of the second neural network based on the training results of the first loss function and the second loss function.
In some embodiments, the training module is further to: a first loss function of the first output network, a second loss function of the second output network, and a fusion loss function of the second neural network are trained simultaneously using the sample information.
The driving behavior detection device based on the multitask neural network provided by the embodiment of the application can be specific hardware on equipment or software or firmware installed on the equipment. The device provided by the embodiment of the present application has the same implementation principle and technical effect as the foregoing method embodiments, and for the sake of brief description, reference may be made to the corresponding contents in the foregoing method embodiments where no part of the device embodiments is mentioned. It is clear to those skilled in the art that, for convenience and brevity of description, the specific working processes of the foregoing systems, apparatuses and units may refer to the corresponding processes in the foregoing method embodiments, and are not described herein again. The driving behavior detection device based on the multitask neural network provided by the embodiment of the application has the same technical characteristics as the driving behavior detection method based on the multitask neural network provided by the embodiment, so that the same technical problems can be solved, and the same technical effects can be achieved.
The embodiment of the application further provides an electronic device, and specifically, the electronic device comprises a processor and a storage device; the storage means has stored thereon a computer program which, when executed by the processor, performs the method of any of the above described embodiments.
Fig. 5 is a schematic structural diagram of an electronic device according to an embodiment of the present application, where the electronic device 400 includes: a processor 40, a memory 41, a bus 42 and a communication interface 43, wherein the processor 40, the communication interface 43 and the memory 41 are connected through the bus 42; the processor 40 is arranged to execute executable modules, such as computer programs, stored in the memory 41.
The Memory 41 may include a high-speed Random Access Memory (RAM) and may also include a non-volatile Memory (non-volatile Memory), such as at least one disk Memory. The communication connection between the network element of the system and at least one other network element is realized through at least one communication interface 43 (which may be wired or wireless), and the internet, a wide area network, a local network, a metropolitan area network, etc. may be used.
The bus 42 may be an ISA bus, PCI bus, EISA bus, or the like. The bus may be divided into an address bus, a data bus, a control bus, etc. For ease of illustration, only one double-headed arrow is shown in FIG. 4, but that does not indicate only one bus or one type of bus.
The memory 41 is used for storing a program, the processor 40 executes the program after receiving an execution instruction, and the method executed by the apparatus defined by the flow process disclosed in any of the foregoing embodiments of the present invention may be applied to the processor 40, or implemented by the processor 40.
The processor 40 may be an integrated circuit chip having signal processing capabilities. In implementation, the steps of the above method may be performed by integrated logic circuits of hardware or instructions in the form of software in the processor 40. The Processor 40 may be a general-purpose Processor, and includes a Central Processing Unit (CPU), a Network Processor (NP), and the like; the device can also be a Digital Signal Processor (DSP), an Application Specific Integrated Circuit (ASIC), a Field-Programmable Gate Array (FPGA), or other Programmable logic devices, discrete Gate or transistor logic devices, discrete hardware components. The various methods, steps and logic blocks disclosed in the embodiments of the present invention may be implemented or performed. A general purpose processor may be a microprocessor or the processor may be any conventional processor or the like. The steps of the method disclosed in connection with the embodiments of the present invention may be directly implemented by a hardware decoding processor, or implemented by a combination of hardware and software modules in the decoding processor. The software module may be located in ram, flash memory, rom, prom, or eprom, registers, etc. storage media as is well known in the art. The storage medium is located in a memory 41, and the processor 40 reads the information in the memory 41 and completes the steps of the method in combination with the hardware thereof.
Corresponding to the method, the embodiment of the application also provides a computer readable storage medium, wherein the computer readable storage medium stores machine executable instructions, and when the computer executable instructions are called and executed by a processor, the computer executable instructions cause the processor to execute the steps of the method.
In the embodiments provided in the present application, it should be understood that the disclosed apparatus and method may be implemented in other ways. The above-described embodiments of the apparatus are merely illustrative, and for example, the division of the units is only one logical division, and there may be other divisions when actually implemented, and for example, a plurality of units or components may be combined or integrated into another system, or some features may be omitted, or not executed. In addition, the shown or discussed mutual coupling or direct coupling or communication connection may be an indirect coupling or communication connection of devices or units through some communication interfaces, and may be in an electrical, mechanical or other form.
The units described as separate parts may or may not be physically separate, and parts displayed as units may or may not be physical units, may be located in one place, or may be distributed on a plurality of network units. Some or all of the units can be selected according to actual needs to achieve the purpose of the solution of the embodiment.
In addition, functional units in the embodiments provided in the present application may be integrated into one processing unit, or each unit may exist alone physically, or two or more units are integrated into one unit.
The functions, if implemented in the form of software functional units and sold or used as a stand-alone product, may be stored in a computer readable storage medium. Based on such understanding, the technical solution of the present invention may be embodied in the form of a software product, which is stored in a storage medium and includes instructions for causing a computer device (which may be a personal computer, an electronic device, or a network device) to perform all or part of the steps of the method according to the embodiments of the present invention. And the aforementioned storage medium includes: a U-disk, a removable hard disk, a Read-Only Memory (ROM), a Random Access Memory (RAM), a magnetic disk or an optical disk, and other various media capable of storing program codes.
It should be noted that: like reference numbers and letters indicate like items in the figures, and thus once an item is defined in a figure, it need not be further defined or explained in subsequent figures, and moreover, the terms "first," "second," "third," etc. are used merely to distinguish one description from another and are not to be construed as indicating or implying relative importance.
Finally, it should be noted that: the above-mentioned embodiments are only specific embodiments of the present invention, which are used for illustrating the technical solutions of the present invention and not for limiting the same, and the protection scope of the present invention is not limited thereto, although the present invention is described in detail with reference to the foregoing embodiments, those skilled in the art should understand that: any person skilled in the art can modify or easily conceive the technical solutions described in the foregoing embodiments or equivalent substitutes for some technical features within the technical scope of the present disclosure; such modifications, changes or substitutions do not depart from the spirit and scope of the embodiments of the present invention, and they should be construed as being included therein.
Claims (10)
1. A driving behavior detection method based on a multitask neural network is characterized by comprising the following steps:
determining a first neural network based on a convolutional layer of a base neural network, the first neural network comprising: a first output network and a second output network;
determining a second neural network based on the base neural network and the Hourglass network;
determining a third neural network based on the first neural network and the second neural network;
training the third neural network using sample information; the sample information comprises a head image of a sample, the head image comprising face information of the sample;
predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head pose angle of the sample.
2. The multitask neural network-based driving behavior detection method according to claim 1, wherein the step of determining the first neural network based on the convolutional layer of the basic neural network comprises:
connecting a specified number of convolutional layers in the basic neural network with the convolutional layers of the first output network;
and connecting the last convolution layer of the basic neural network with the convolution layer of the second output network.
3. The multitask neural network-based driving behavior detection method of claim 2, wherein the first output network comprises a keypoint detection network; the second output network comprises a head pose estimation network.
4. The multitask neural network-based driving behavior detection method according to claim 1, wherein the step of determining a second neural network based on the base neural network and the Hourglass network includes:
taking a plurality of convolution layers in the basic neural network as down-sampling convolution layers of a first Hourglass network to construct a first cascade network;
wherein the first HourGlass network further comprises the same number of upsampled convolutional layers as the downsampled convolutional layers.
5. The multitask neural network-based driving behavior detection method of claim 4, wherein the step of determining a second neural network based on the base neural network and the Hourglass network further comprises:
connecting a plurality of Hourglass networks to construct a second cascade network;
and connecting the second cascade network after the first cascade network to determine a second neural network.
6. The multitask neural network-based driving behavior detection method according to claim 5, wherein the step of training the third neural network using the sample information includes:
training a first loss function of the first output network and a second loss function of the second output network using sample information;
training a fusion loss function of the second neural network based on training results of the first loss function and the second loss function.
7. The multitask neural network-based driving behavior detection method according to claim 5, wherein the step of training the second neural network using the sample information includes:
training a first loss function of the first output network, a second loss function of the second output network, and a fusion loss function of the second neural network simultaneously using sample information.
8. A multitask neural network-based driving behavior detection apparatus, comprising:
a first determination module to determine a first neural network based on a convolutional layer of a base neural network, the first neural network comprising: a first output network and a second output network;
a second determining module for determining a second neural network based on the basis of the basic neural network and the Hourglass network;
a third determination module to determine a third neural network based on the first neural network and the second neural network;
a training module for training the third neural network using sample information; the sample information comprises a head image of a sample, the head image comprising face information of the sample;
and the prediction module is used for predicting the sample information based on the trained third neural network, wherein the first output network is used for predicting the key point coordinates of the sample, and the second output network is used for predicting the head posture angle of the sample.
9. An electronic device comprising a memory and a processor, wherein the memory stores a computer program operable on the processor, and wherein the processor implements the steps of the method of any of claims 1 to 7 when executing the computer program.
10. A computer readable storage medium having stored thereon machine executable instructions which, when invoked and executed by a processor, cause the processor to execute the method of any of claims 1 to 7.
Priority Applications (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011207383.1A CN112287855B (en) | 2020-11-02 | 2020-11-02 | Driving behavior detection method and device based on multi-task neural network |
Applications Claiming Priority (1)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
CN202011207383.1A CN112287855B (en) | 2020-11-02 | 2020-11-02 | Driving behavior detection method and device based on multi-task neural network |
Publications (2)
Publication Number | Publication Date |
---|---|
CN112287855A true CN112287855A (en) | 2021-01-29 |
CN112287855B CN112287855B (en) | 2024-05-10 |
Family
ID=74350376
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CN202011207383.1A Active CN112287855B (en) | 2020-11-02 | 2020-11-02 | Driving behavior detection method and device based on multi-task neural network |
Country Status (1)
Country | Link |
---|---|
CN (1) | CN112287855B (en) |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112926553A (en) * | 2021-04-25 | 2021-06-08 | 北京芯盾时代科技有限公司 | Training method and device for motion detection network |
CN113537115A (en) * | 2021-07-26 | 2021-10-22 | 东软睿驰汽车技术(沈阳)有限公司 | Method and device for acquiring driving state of driver and electronic equipment |
Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN107038422A (en) * | 2017-04-20 | 2017-08-11 | 杭州电子科技大学 | The fatigue state recognition method of deep learning is constrained based on space geometry |
CN108229490A (en) * | 2017-02-23 | 2018-06-29 | 北京市商汤科技开发有限公司 | Critical point detection method, neural network training method, device and electronic equipment |
WO2018121737A1 (en) * | 2016-12-30 | 2018-07-05 | 北京市商汤科技开发有限公司 | Keypoint prediction, network training, and image processing methods, device, and electronic device |
CN108664885A (en) * | 2018-03-19 | 2018-10-16 | 杭州电子科技大学 | Human body critical point detection method based on multiple dimensioned Cascade H ourGlass networks |
CN109376681A (en) * | 2018-11-06 | 2019-02-22 | 广东工业大学 | A kind of more people's Attitude estimation method and system |
CN110084180A (en) * | 2019-04-24 | 2019-08-02 | 北京达佳互联信息技术有限公司 | Critical point detection method, apparatus, electronic equipment and readable storage medium storing program for executing |
CN110309723A (en) * | 2019-06-04 | 2019-10-08 | 东南大学 | A kind of driving behavior recognition methods based on characteristics of human body's disaggregated classification |
CN110443222A (en) * | 2019-08-14 | 2019-11-12 | 北京百度网讯科技有限公司 | Method and apparatus for training face's critical point detection model |
CN110503063A (en) * | 2019-08-28 | 2019-11-26 | 东北大学秦皇岛分校 | Fall detection method based on hourglass convolution autocoding neural network |
CN110807380A (en) * | 2019-10-22 | 2020-02-18 | 北京达佳互联信息技术有限公司 | Human body key point detection method and device |
CN111222522A (en) * | 2018-11-23 | 2020-06-02 | 北京市商汤科技开发有限公司 | Neural network training, road surface detection and intelligent driving control method and device |
US20200218883A1 (en) * | 2017-12-25 | 2020-07-09 | Beijing Sensetime Technology Development Co., Ltd. | Face pose analysis method, electronic device, and storage medium |
US20200290608A1 (en) * | 2018-04-20 | 2020-09-17 | Shenzhen Sensetime Technology Co., Ltd. | Forward collision control method and apparatus, electronic device, program, and medium |
CN111814661A (en) * | 2020-07-07 | 2020-10-23 | 西安电子科技大学 | Human behavior identification method based on residual error-recurrent neural network |
-
2020
- 2020-11-02 CN CN202011207383.1A patent/CN112287855B/en active Active
Patent Citations (14)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2018121737A1 (en) * | 2016-12-30 | 2018-07-05 | 北京市商汤科技开发有限公司 | Keypoint prediction, network training, and image processing methods, device, and electronic device |
CN108229490A (en) * | 2017-02-23 | 2018-06-29 | 北京市商汤科技开发有限公司 | Critical point detection method, neural network training method, device and electronic equipment |
CN107038422A (en) * | 2017-04-20 | 2017-08-11 | 杭州电子科技大学 | The fatigue state recognition method of deep learning is constrained based on space geometry |
US20200218883A1 (en) * | 2017-12-25 | 2020-07-09 | Beijing Sensetime Technology Development Co., Ltd. | Face pose analysis method, electronic device, and storage medium |
CN108664885A (en) * | 2018-03-19 | 2018-10-16 | 杭州电子科技大学 | Human body critical point detection method based on multiple dimensioned Cascade H ourGlass networks |
US20200290608A1 (en) * | 2018-04-20 | 2020-09-17 | Shenzhen Sensetime Technology Co., Ltd. | Forward collision control method and apparatus, electronic device, program, and medium |
CN109376681A (en) * | 2018-11-06 | 2019-02-22 | 广东工业大学 | A kind of more people's Attitude estimation method and system |
CN111222522A (en) * | 2018-11-23 | 2020-06-02 | 北京市商汤科技开发有限公司 | Neural network training, road surface detection and intelligent driving control method and device |
CN110084180A (en) * | 2019-04-24 | 2019-08-02 | 北京达佳互联信息技术有限公司 | Critical point detection method, apparatus, electronic equipment and readable storage medium storing program for executing |
CN110309723A (en) * | 2019-06-04 | 2019-10-08 | 东南大学 | A kind of driving behavior recognition methods based on characteristics of human body's disaggregated classification |
CN110443222A (en) * | 2019-08-14 | 2019-11-12 | 北京百度网讯科技有限公司 | Method and apparatus for training face's critical point detection model |
CN110503063A (en) * | 2019-08-28 | 2019-11-26 | 东北大学秦皇岛分校 | Fall detection method based on hourglass convolution autocoding neural network |
CN110807380A (en) * | 2019-10-22 | 2020-02-18 | 北京达佳互联信息技术有限公司 | Human body key point detection method and device |
CN111814661A (en) * | 2020-07-07 | 2020-10-23 | 西安电子科技大学 | Human behavior identification method based on residual error-recurrent neural network |
Non-Patent Citations (8)
Title |
---|
ADIA: "论文笔记Stacked Hourglass Networks", pages 1 - 11, Retrieved from the Internet <URL:《 https://zhuanlan.zhihu.com/p/45002720》> * |
JING YANG: "Stacked Hourglass Network for Robust Facial Landmark Localisation", 《2017 IEEE CONFERENCE ON COMPUTER VISION AND PATTERN RECOGNITION WORKSHOPS (CVPRW)》, pages 1 - 10 * |
KEVAN YUEN: "An Occluded Stacked Hourglass Approach to Facial Landmark Localization and Occlusion Estimation", 《IEEE TRANSACTIONS ON INTELLIGENT VEHICLES ( VOLUME: 2, ISSUE: 4, DECEMBER 2017)》, 13 November 2017 (2017-11-13), pages 1 - 10 * |
KEVAN YUEN: "Looking at Faces in a Vehicle: A Deep CNN Based Approach and Evaluation", 《2016 IEEE 19TH INTERNATIONAL CONFERENCE ON INTELLIGENT TRANSPORTATION SYSTEMS (ITSC)》, pages 649 - 654 * |
佚名: "卷积神经网络的网络结构——Hourglass", pages 1 - 10, Retrieved from the Internet <URL:《https://blog.csdn.net/u013841196/article/details/81048237 》> * |
夏瀚笙: "基于Stacked Hourglass网络的二维人体骨骼关键点检测", 《万方》, 22 April 2020 (2020-04-22), pages 1 - 56 * |
夏瀚笙: "基于Stacked Hourglass网络的二维人体骨骼关键点检测", 《万方》, pages 1 - 67 * |
都文龙: "基于多尺度级联沙漏网络的人体姿势估计", 《中国优秀硕士学位论文全文数据库 (信息科技辑)》, 15 January 2020 (2020-01-15), pages 138 - 1241 * |
Cited By (2)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN112926553A (en) * | 2021-04-25 | 2021-06-08 | 北京芯盾时代科技有限公司 | Training method and device for motion detection network |
CN113537115A (en) * | 2021-07-26 | 2021-10-22 | 东软睿驰汽车技术(沈阳)有限公司 | Method and device for acquiring driving state of driver and electronic equipment |
Also Published As
Publication number | Publication date |
---|---|
CN112287855B (en) | 2024-05-10 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
US20230408278A1 (en) | Path planning method, apparatus and electronic device | |
JP6651006B2 (en) | Automatic discovery method of gas station POI, automatic discovery device, storage medium and device | |
CN112287855A (en) | Driving behavior detection method and device based on multitask neural network | |
CN109993980B (en) | Traffic flow prediction method, apparatus, computer device and storage medium | |
CN108228442B (en) | Abnormal node detection method and device | |
CN111111176B (en) | Method and device for managing object LOD in game and electronic equipment | |
CN111062642A (en) | Method and device for identifying industrial risk degree of object and electronic equipment | |
CN111091068B (en) | Density estimation model training method and device, storage medium and electronic equipment | |
CN117197796A (en) | Vehicle shielding recognition method and related device | |
CN113111580A (en) | Method for constructing battery SOH evaluation model and method for evaluating battery SOH value | |
CN114048573B (en) | Method, device, equipment and medium for evaluating life of turbine blade of aircraft engine | |
CN112785844B (en) | Curve coordinate system establishing method and device and electronic equipment | |
CN110796115B (en) | Image detection method and device, electronic equipment and readable storage medium | |
CN116502131B (en) | Bearing fault diagnosis model training method and device based on transfer learning | |
JP2013011987A (en) | Abnormal state detection device and abnormal state detection method | |
CN115775366A (en) | Road target detection method, detection device, electronic equipment and storage medium | |
CN111352825B (en) | Data interface testing method and device and server | |
CN108959028B (en) | Method and device for analyzing life cycle of disk | |
CN112907257A (en) | Risk threshold determining method, abnormality detecting device and electronic equipment | |
CN113221907A (en) | Vehicle part segmentation method, device, equipment and storage medium | |
TWI841040B (en) | Electronic device and method for determining scenario data of self driving car | |
CN107135137B (en) | Network group identification method and device | |
CN112987596B (en) | Test method and test device for unmanned equipment | |
CN113299345B (en) | Virus gene classification method and device and electronic equipment | |
JP7350216B2 (en) | Design support device, design support method, and design support program |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
PB01 | Publication | ||
PB01 | Publication | ||
SE01 | Entry into force of request for substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
GR01 | Patent grant | ||
GR01 | Patent grant |