Nothing Special   »   [go: up one dir, main page]

WO2021253961A1 - Intelligent visual perception system - Google Patents

Intelligent visual perception system Download PDF

Info

Publication number
WO2021253961A1
WO2021253961A1 PCT/CN2021/087921 CN2021087921W WO2021253961A1 WO 2021253961 A1 WO2021253961 A1 WO 2021253961A1 CN 2021087921 W CN2021087921 W CN 2021087921W WO 2021253961 A1 WO2021253961 A1 WO 2021253961A1
Authority
WO
WIPO (PCT)
Prior art keywords
target
camera
visual perception
target detection
information
Prior art date
Application number
PCT/CN2021/087921
Other languages
French (fr)
Chinese (zh)
Inventor
李丰
万成凯
Original Assignee
北京世纪瑞尔技术股份有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 北京世纪瑞尔技术股份有限公司 filed Critical 北京世纪瑞尔技术股份有限公司
Publication of WO2021253961A1 publication Critical patent/WO2021253961A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/667Camera operation mode switching, e.g. between still and video, sport and normal or high- and low-resolution modes
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/69Control of means for changing angle of the field of view, e.g. optical zoom objectives or electronic zooming

Definitions

  • the invention relates to the field of security and protection, in particular to an intelligent visual perception system.
  • Common movable cameras include pan-tilt cameras, PTZ cameras, etc., which usually have three control parameters of horizontal rotation, vertical rotation and focal length change. By adjusting these parameters, the motion camera can not only change the focal length to obtain different resolution information of the target or area in the shooting scene, but also change the angle to obtain different field of view direction information of the target or area in the shooting scene.
  • these cameras when shooting scenes in different areas, use the pan-tilt to control the direction of the camera's field of view, and the field of view direction is preset. This method only realizes an automatic monitoring.
  • a multi-camera is a commonly used imaging device.
  • the body of the multi-camera is equipped with multiple lenses, and the multiple lenses are distributed along the circumferential direction of the body.
  • the multiple lenses of the current multi-eye camera are usually fixed on the base to perform shooting in various angles and directions.
  • the images taken by multiple lenses can be combined into a panoramic image of a multi-lens camera.
  • the main function of these multi-eye cameras is to achieve panoramic azimuth video shooting, and the discoverable distance of the target and the recognizable distance of the target are both small. If the focal lengths of the lenses of these multi-cameras are all lengthened, these multi-cameras are relatively large in size and not suitable for actual scene applications.
  • the system cannot automatically capture, track, amplify and collect these targets automatically, and cannot obtain detailed information about the targets.
  • the present invention provides an intelligent visual perception system that can find targets in real time, track targets, recognize targets, and alert in time, improve the accuracy and accuracy of the system in capturing environmental abnormalities, and make up for the prior art
  • the shortcomings can be found targets in real time, track targets, recognize targets, and alert in time, improve the accuracy and accuracy of the system in capturing environmental abnormalities, and make up for the prior art
  • the present invention provides an intelligent visual perception system, wherein the system includes a variable focal length camera and a false target feedback feature information database in a designated monitoring area;
  • the camera collects video images of the monitored area at the first resolution, and uses a target detection algorithm for the collected video images to perform a target detection in the area, including: judging whether the designated monitoring area contains suspected targets based on the false target feedback feature information database ;
  • the camera When it is found that the monitoring area contains a suspected target, adjust the camera to collect a video image of the suspected target at the second resolution, and based on the collected video image, use a secondary target detection algorithm to perform secondary target detection to determine the suspected target as True target or false target;
  • the second resolution is greater than the first resolution.
  • the invention detects the valuable target in the area twice, can take into account the contradiction between the larger scene of the monitoring area and the target details, and can detect the suspicious target in the larger monitoring area through one target detection, with large resolution
  • the detection results on the rate images often have high accuracy, and reliable detection results can be obtained through secondary target detection.
  • the target feedback feature information database includes location information of the fake target and feature description information of the fake target;
  • the video image collection of the monitoring area by the camera at the first resolution includes: the video image collection of the monitoring area by the camera with a large field of view and a small resolution;
  • a target detection includes: based on the false target feedback feature information database, judging whether the image features in the monitoring area meet the specified conditions with the false target feedback feature description and location information matching degree of the area. If it is met, it is not judged as a suspected target;
  • the target detection algorithm includes a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background;
  • the monitoring area contains one or more suspected targets
  • adjust the camera's field of view direction and angle of view and perform video image acquisition on the suspected target at the second resolution, including: successively performing the suspected target with a small field of view and a large resolution Video image collection;
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background
  • Updating the false target feedback feature information database according to the information of the false target includes: mapping the false target determined under the large resolution image to the small resolution image, obtaining the position information of the false target, extracting the description information of the false target feature, and updating the false target Feedback feature information database;
  • the suspected target is a real target
  • track and monitor the real target and output large-resolution video data, output alarm information and/or target characteristic information
  • Small field of view and large resolution means that by adjusting the PTZ parameters, the target is zoomed to 1/10 to 4/5 of the height of the video image.
  • Setting the false target feedback feature information database allows targeted screening of targets, which is different from the general calculation mode, can improve the accuracy of the system to capture environmental abnormalities, and has achieved unexpected results.
  • the suspected target is a fake target in the secondary target detection
  • the fake target is described, and the characteristics are fed back to the video image target detection algorithm, which can make the system perform a target detection in this area with a large field of view and a small resolution, and the fake target is merged Features to reduce the probability of false targets being misjudged as suspected targets again.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the invention not only ensures that the largest possible area can be monitored, but also the reliability of the monitoring results. After the useful target is detected, the details of the target can be tracked continuously.
  • the present invention provides an intelligent visual perception system, wherein the small field of view and large resolution refers to adjusting the PTZ parameters to zoom the target to 1/6 to 2/3 of the height of the video image and adjust it to the center of the field of view.
  • the present invention provides an intelligent visual perception system, wherein the target feature information includes feature information that characterizes the specific classification of the target and/or feature information that characterizes the specific identity of the target; the feature information that characterizes the specific classification of the target includes humans, animals, One or more of vehicles and/or models, flying objects, and designated foreign objects, where the designated foreign objects include one or more of natural falling objects and/or diffuse objects, and human remains; those that characterize the specific identity of the target
  • the characteristic information includes one or more of the identity of the person, the type of the animal, the license plate of the car, and the type of other foreign objects.
  • the intelligent visual perception system is applied in the security field.
  • the specific classification information and target of the target are represented
  • the specific identity information is sent to the monitoring terminal to make the alarm information true, specific, and credible, which is beneficial to the operator to take reasonable measures to eliminate the intrusion target in the follow-up, and to improve the efficiency of emergency response.
  • the intelligent visual perception system provided by the present invention sets an update rate for the false target feature description information, and optimizes the false target feedback feature information database.
  • the present invention sets the update rate for the feature description information of the false target, so that the false target feedback feature information database has time characteristics, which is more realistic, accelerates the self-adaptation and self-learning speed of the feature database, and further improves the accuracy of one-time detection of the system. Performance, thereby improving the overall performance of the system.
  • the present invention provides an intelligent visual perception system, wherein there are N monitoring areas, where N ⁇ 1; the system monitoring process includes:
  • the inventor improves the detection efficiency of the monitoring area by setting the target tracking conditions, such as whether a suspected target is found, the target disappearance time, and the tracking time setting value, etc., and realizes the sequential detection of true targets in the area. And tracking, as well as the sequential detection of multiple monitoring areas, further improve the detection area size and detection distance of a single device, thereby improving equipment utilization, reducing equipment costs and floor space, and having good economic benefits.
  • the target tracking conditions such as whether a suspected target is found, the target disappearance time, and the tracking time setting value, etc.
  • the present invention provides an intelligent visual perception system, in which a target detection is performed on all N monitoring areas respectively, and the respective feature weight of each monitoring area is calculated for the area where a suspect target is found in a target detection; according to each monitoring area The size of the respective feature weights is used to perform secondary target detection and tracking for each monitoring area in the specified order.
  • the inventor can calculate the occurrence probability of each monitoring area target and other attributes, ensuring that the system can prioritize the monitoring of areas with relatively large feature weights, which is beneficial to hierarchical early warning and enables operators Prioritize the processing of more suspected targets, or the target moving speed is relatively fast, or other focused monitoring areas in the specific monitoring field where the characteristics of the target are concentrated, and prioritize the elimination of potential safety hazards in this area, thereby reducing the alarm response time of the entire security system .
  • this setting of the priority of regional monitoring can enable railway workers to prioritize the removal of important obstacles affecting the passage of trains on the track and its surroundings, such as a large number of sheep passing by, and road collapse covering the railway track. It won time for incident intervention and system linkage, thus saving most of the casualties and hundreds of millions of property losses.
  • the present invention provides an intelligent visual perception system, wherein the system includes one or more intelligent visual perception devices, and the intelligent visual perception devices include:
  • Camera used for video image acquisition, including one or more of focus motor, zoom motor, drive module, image signal acquisition and processing unit;
  • Transmission mechanism used to adjust the direction and size of the camera's field of view
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the direction and size of the camera's field of view, and/or exchange information with the cloud platform or data center;
  • the communication interface unit is used for information interaction with the cloud platform and/or data center, and/or linkage information interaction with other sensing or action devices on site, and/or other related systems, including wired and/or wireless interfaces;
  • the power management unit is used to supply power to all power-consuming units
  • the protective shell is used to encapsulate each unit and play a protective role.
  • the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device further includes a light unit for supplementing light to the area monitored by the camera; there are one or more light units, including visible light sources and infrared light sources. One or more of the light sources; the transmission mechanism adjusts the direction and size of the field of view of the light unit.
  • the intelligent visual perception device further includes a light unit for supplementing light to the area monitored by the camera; there are one or more light units, including visible light sources and infrared light sources.
  • the transmission mechanism adjusts the direction and size of the field of view of the light unit.
  • video image acquisition and tracking can be achieved through camera focusing, illumination unit fill light, transmission mechanism to drive the angle adjustment of the camera and illumination power supply; suspected target discovery, true and false target judgment, false target feature description, etc., need
  • the data processing unit implements the video image target detection algorithm to achieve this.
  • the intelligent visual perception device provided by the present invention, through the combination of the above hardware and software, can realize primary target detection and secondary target detection in the monitoring area, distinguish whether the suspected target is a true target or a false target; describe the characteristics of the false target, and Real target tracking, etc., is achievable and technologically advanced.
  • the intelligent visual perception system of the present invention includes one or more of the above-mentioned intelligent visual perception devices, which is beneficial to expand the monitoring area, covers a relatively large area and relatively high security level monitoring scene, and provides a full range of safety warnings.
  • the present invention provides an intelligent visual perception system, wherein the system includes one or more intelligent visual perception devices, in the intelligent visual perception device, there are one or more cameras, and the cameras are visible light cameras and/or infrared cameras , Infrared cameras include near-infrared cameras and/or infrared thermal imaging cameras.
  • one or more cameras complete the video image acquisition in the primary target detection, and/or the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
  • the infrared camera completes the video image acquisition in the primary target detection; the visible light camera completes the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
  • a visible light camera completes the video image acquisition in the primary target detection, and/or the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
  • the infrared thermal imaging camera has the characteristics of working normally in the outdoor natural environment of -40°C ⁇ +70°C (direct sunlight), can penetrate smoke, fog, and haze, has high image clarity, can night vision and is sensitive to the temperature of the shooting object, etc. , Used for a target detection, it is sensitive to the identification of moving objects that can usually generate early warning signals, especially living things.
  • the visible light camera has the advantages of stable performance and high camera clarity.
  • the infrared thermal imaging camera performs a target detection and finds the suspected target.
  • the visible light camera is used for secondary target detection, and the suspected target is amplified to realize the judgment of the true target and the false target. , Can meet the requirements of system stability and practicability under harsh environmental conditions, and meet the needs of accurate system judgment.
  • the transmission mechanism is used to adjust the horizontal and/or vertical field of view direction and size of the camera and/or the light unit, including a drive motor, a horizontal shaft, and a vertical shaft; the drive motor drives the camera and/or the light unit around the shaft
  • the horizontal rotation is 0-360 degrees
  • the vertical rotation is 0-180 degrees.
  • the transmission mechanism by driving the camera and the light unit to rotate at a certain angle around the horizontal and vertical shafts, meets the requirements of the camera to collect video images to cover the area that needs to be monitored, and realizes that in the prior art with a simple device structure, multiple
  • the camera can cooperate with each other to realize the image collection needs, and the camera angle can be stably controlled, the stability of the image collection during real target tracking can be realized, and the object of the present invention can be further achieved.
  • the data processing unit adopts a chip with video image processing capabilities, integrates image target detection algorithm programs, and performs real-time video image processing; when the suspected target is identified as a false target, the false target is characterized, and the characteristics are fed back Give the video image target detection algorithm program.
  • the system judges that there is a suspected target, and after the second target detection, the suspected target is found to be a false target, it means that the features of the false target are not included in the feature database, which leads to additional actions of the system. Therefore, the false target is characterized and fed back to the video image target detection algorithm, which is conducive to the rapid identification of specific obstacles that may appear or non-obstacles that appear regularly in a specific monitoring field.
  • the communication interface unit includes an input interface and an output interface; the input interface is used to receive external device signals; the output interface is used to send signals collected or received by the system, and the connection mode includes wireless and/or wired modes; among them, the wireless mode Including one or more of WIFI, BT, ZIGBEE, LORA, 2G, 3G, 4G, 5G, NB-IOT; wired methods include AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, One or more of optical fibers.
  • the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device receives signals from other sensing or motion devices in the monitoring area through an input interface, and when the sensing or motion device sends abnormal situation signals and/ Or when there is an alarm message, the intelligent visual perception device adjusts the direction and focal length of the camera's field of view, and prioritizes target detection in the area where the sensor or action device is located.
  • the intelligent visual perception device can receive the sensor equipment signal in the monitoring area, process the dangerous signal recognized by the external sensor, give priority to the target detection in the area, and realize the secondary verification and target recognition of the information sent with the external sensor , Alarm and characteristic information sending.
  • This linkage mechanism truly realizes the interconnection of sensor devices, saves the system's response time to external early warning events, and at the same time makes up for the fact that external sensors do not understand the specific characteristics of the source of danger, and cannot accurately reflect the alarm information received, even The real problem of being at a loss as to what to do, and wasting the precious saving time before the accident, so as to truly play the role of safety early warning of this system.
  • the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device provides its perception data and/or result information to other sensing or action devices in the monitoring area through an output interface for data fusion and / Or joint judgment, and / or direct control or linkage of sensing or action devices, that is, system linkage.
  • the present invention provides an intelligent visual perception system, wherein other sensing or action devices in the monitoring area include alarm sound and light equipment, access control equipment, fire fighting equipment, obstacle removal equipment, animal repelling equipment, trailer equipment, and cleaning equipment.
  • One or more of the information includes alarm information and/or target characteristic information.
  • the intelligent visual perception device can provide sensory data output to other systems or equipment in the monitoring area. These systems or devices can perform data fusion and joint judgment on the data signals output by the intelligent visual perception device, and can also directly control other local devices. Since the intelligent visual perception device of the present invention can directly control external sensing equipment or system linkage, when there is a hidden accident or emergency danger in the monitoring area, it not only sends alarm information and/or target feature information to the monitoring terminal, but also It can also control or link alarm sound and light equipment, access control equipment, fire fighting equipment, obstacle removal equipment, etc., to eliminate accidental hazards or sources of danger in a targeted manner, and perform emergency braking and on-site diversion of equipment that is operating closely.
  • Measures such as establishing contact between the communication equipment and the monitoring area, reduce the external response time to the alarm information sent by the system, seize the golden gap before the accident, and realize the emergency avoidance of personnel and transportation devices in the monitoring area. This will avoid accidents to the greatest extent, reduce accident damage, and protect the safety of personnel and property, which is of great practical significance.
  • the present invention provides an intelligent visual perception system, wherein the power management unit includes a battery integrated inside the system, and/or an external solar panel, and/or a wired power supply.
  • the power management unit includes a battery integrated inside the system, and/or an external solar panel, and/or a wired power supply.
  • the protective housing includes an interface board, a window, and a fixing seat;
  • the window adopts light-transmitting material to respectively transmit the video image collected by the camera and/or the light emitted by the light unit;
  • the fixing seat is used to fix the protective shell on the external bracket.
  • the protective shell design of the intelligent visual perception device can make the camera, lighting unit, data processing unit, transmission mechanism, communication interface unit, power management unit, etc. of the device be protected by a solid shell.
  • the protective shell is provided with windows for projecting video images and light emitted by the lighting unit, using light-transmitting materials, etc., so that the camera and the lighting unit do not directly contact the external environment, thereby improving the stability and service life of the system.
  • Different numbers of window designs can realize the specific image acquisition needs of the present invention under different application conditions.
  • the present invention provides an intelligent visual perception system, wherein the alarm information and/or target characteristic information are output to the monitoring terminal, and/or the alarm information and/or target characteristic information is output to the data center and/or cloud platform, Start the alarm processing service, distribute information to the monitoring terminal, and complete one or more functions including alarm handling and/or intervention and/or system linkage.
  • the present invention provides an intelligent visual perception system, wherein the cloud platform and/or data center includes a server and software for image recognition through image target detection algorithms; monitoring terminals are one or more for display Target recognition result information, receiving alarm information, remote configuration and control, including intelligent terminal equipment and its running management software, intelligent terminal equipment includes but not limited to one or more of computers and handhelds.
  • the cloud platform and/or data center includes a server and software for image recognition through image target detection algorithms; monitoring terminals are one or more for display Target recognition result information, receiving alarm information, remote configuration and control, including intelligent terminal equipment and its running management software, intelligent terminal equipment includes but not limited to one or more of computers and handhelds.
  • the alarm information, target feature information, etc. sent by the intelligent visual recognition system of the present invention can be sent to the monitoring terminal through the cloud platform and/or data center, ensuring the stability of data transmission and the remote control of the system.
  • the data center combined with other feature databases, it is possible to further identify the true target, including identity information, so as to issue more accurate alarm information and feature information reports.
  • This solution makes up for the shortcomings in the prior art that in the security field, the monitoring equipment sends out fuzzy information, which causes the monitoring terminal to be unable to eliminate the obstacles and eliminate the danger of accidents in time after receiving the information.
  • the present invention provides an intelligent visual perception system, wherein the server includes a virtual server, including one or more of a local server, an edge cloud, and a public cloud.
  • the server includes a virtual server, including one or more of a local server, an edge cloud, and a public cloud.
  • the intelligent visual perception system can find targets, track targets, recognize targets and generate alarm information and target feature recognition information in real time, eliminate non-suspected targets through one target detection, and distinguish between suspected targets through secondary target detection. True and false targets, and feature descriptions of false targets to supplement the feature library, reduce the rate of misjudgment of a target detection, and continue to track true targets.
  • the present invention realizes the intelligent capture, discrimination and feature description of environmental abnormalities, and solves the problem of non-intelligent monitoring of the environment in the security field in the prior art.
  • the operator needs to keep an eye on the screen, and artificially judge whether there is environmental hazard in real time. Resolution, distinguish the actual dangers in the distant place, there is a great possibility of misjudgment, which affects the emergency response time, which delays the timing of accident rescue, and may cause hundreds of millions of property losses and many practical problems of casualties. Has great social value.
  • Figure 1 shows the work flow of the intelligent visual perception system provided by the present invention
  • Fig. 2 shows the working process of target detection and/or identification for multiple monitoring areas provided by the present invention
  • FIG. 3 shows a workflow of performing target detection and/or recognition in sequence by setting area weights provided by the present invention
  • Figure 4 shows the data flow of the intelligent visual perception system provided by the present invention
  • Figure 5 shows a single-camera intelligent visual perception device provided by the present invention
  • Figure 6 shows a single-camera, single-light source intelligent visual perception device provided by the present invention
  • Figure 7 shows a dual-camera, dual-light source intelligent visual perception device provided by the present invention
  • Figure 8 shows a dual-camera, single-light source intelligent visual perception device provided by the present invention
  • Figure 9 shows that the system provided by the present invention finds that there is a suspected target in the distance after a target detection in a railway scene
  • FIG. 10 shows that the system provided by the present invention finds a true target through secondary target detection in a railway scene
  • Figure 11 shows that the system provided by the present invention tracks real targets in a railway scene.
  • the present invention is an application of computer technology in the field of security and protection.
  • the application of multiple software function modules will be involved. The applicant believes that after carefully reading the application documents, accurately understanding the realization principle and purpose of the invention, and combining the existing known technologies, those skilled in the art can fully use the software programming skills they have mastered to realize the invention. .
  • the intelligent visual perception system includes a variable focal length camera and a false target feedback feature information library in a designated monitoring area.
  • First establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
  • the system adjusts the camera's field of view direction and focal length, sets the PTZ parameters, so that the video image covers the preset monitoring area that needs to be monitored, and the large field of view and small resolution perform video image collection on the designated monitoring area .
  • a target detection algorithm is used to perform a target detection in the area on the video image collected by the camera.
  • Perform a target detection in the area including: judging whether the designated monitoring area contains suspected targets based on the false target feedback feature information database. Further, based on the false target feedback feature information database, it is determined whether the image features in the monitoring area meet the specified conditions with the false target feedback feature description and location information matching degree of the area. If it is satisfied, it is not judged as a suspected target.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, for example, when the matching degree reaches 90%, it is considered that the area has a high matching degree with the false target feedback feature description.
  • the object at the corresponding location of the target feedback feature description is not a suspected target, so the area will be classified as a background with a high probability, that is, when the corresponding location of the false target feedback feature description in the area matches the specified value, it will not be classified Classified as a suspected target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • a target detection When a target detection is found to be undoubtedly a target, adjust the camera's field of view direction and focal length, use a large field of view and small resolution to capture the video image of the next designated monitoring area, that is, the preset monitoring area, and use a target detection algorithm to control the camera.
  • the collected video images are subjected to a target detection in the area.
  • the suspected target is captured with a second resolution, and the second resolution is greater than the first resolution.
  • the suspected targets in the monitoring area are collected sequentially with a small field of view and large resolution, and a secondary target detection algorithm is adopted to perform secondary target detection on the collected video images to distinguish the suspected targets as true targets. Or false goals.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the alarm information is output to the monitoring terminal, and target characteristic information is generated. Adjust the direction and focal length of the camera's field of view, and continue to track the target for a period of time. At the same time, the target feature information generated by the secondary target detection is sent to the monitoring terminal.
  • Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, designated foreign objects, that is, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks
  • feature information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, and the license plate of the vehicle.
  • the specific information types and contents are defined according to the specific application environment.
  • the secondary target detection finds that the suspected target of the primary target detection is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image to obtain the position information of the false target, and extract the false target feature description information.
  • Update the false target feedback feature information database to reduce the false detection probability of a target detection in the system.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • target detection and/or recognition are performed on N (0 ⁇ N) preset monitoring areas, and finally, target detection and/or recognition are restarted from the first preset monitoring area, and the execution is repeated The above steps.
  • the M-th monitoring area perform secondary detection on T suspected targets.
  • the secondary detection with large field of view and small resolution means that the video image covers the area that needs to be monitored by setting the PTZ parameters; then The next area, that is, the M+1 monitoring area, performs a second inspection.
  • Small field of view and large resolution means that by adjusting the PTZ parameters, the target height is zoomed to 1/10 to 4/5 of the video image height, and adjusted to the center of the field of view.
  • the system adjusts the direction of the camera's field of view, sets the PTZ parameters so that the video image covers the area that needs to be monitored, and collects video images in the first preset monitoring area with the large field of view and small resolution.
  • a target detection algorithm is used for the video image collected by the camera to perform a target detection in the first preset monitoring area.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the intelligent visual perception device When a target detection is found to be undoubtedly a target, the intelligent visual perception device continues to collect video images of the next preset monitoring area with a large field of view and small resolution.
  • a target detection algorithm is used to perform a target in the area on the collected video image. Detection.
  • the first preset monitoring area feature weight Q1 is given; at the same time, the system continues to collect video images of other preset monitoring areas with a large field of view and small resolution, and uses a secondary target detection algorithm , Perform a target detection in the monitoring area on the collected video images.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the system monitoring process includes:
  • a target detection is performed on all N monitoring areas respectively, and the respective feature weight of each monitoring area is calculated for the area where a suspected target is found in a target detection; according to the size of the respective feature weight of each monitoring area, according to Perform secondary target detection and tracking on each monitoring area in the specified order.
  • a target detection of suspected targets is performed on N (0 ⁇ N) preset monitoring areas in sequence, the area where the suspected target exists is found, and the feature weight Qi is given to the area where the suspected target exists.
  • the feature weight Qi is given to the area where the suspected target exists.
  • the system adjusts the camera's field of view direction and focal length, adjusts the PTZ parameters, zooms the target height, and collects the video image of the suspected target in the first area where the suspected target exists with a small field of view and large resolution, and passes the video image target detection algorithm to The collected video images are subjected to secondary target detection.
  • the first alarm information is output, and the first target characteristic information is generated. Adjust the direction and focal length of the camera's field of view, and continue to track the target for a period of time.
  • the first target feature information generated by the secondary target detection is sent to the cloud platform or data center, and the cloud platform or data center is based on the target feature sent by the intelligent visual perception device Information 1 passes through the target recognition algorithm to perform target recognition, and sends the obtained second target characteristic information and second alarm information to the monitoring terminal; the monitoring terminal responds to the second alarm information, the first target characteristic information and/or the second target Characteristic information is processed and displayed.
  • the first target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides , And one or more of human remains.
  • the second target characteristic information includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the secondary target detection finds that the suspected target of the primary target detection is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the false target feature description information is extracted
  • the false target feature description information is set Determine the update rate, optimize the false target feedback feature information database, and further reduce the probability of false detection of a target detection in the system.
  • the detection and/or recognition method is the same as the detection and/or recognition method for the first area target where there is a suspected target.
  • the target detection and/or recognition is performed again on the first preset monitoring area, and the above steps are automatically executed, and the target detection and/or recognition is performed on the N areas in a loop.
  • the large field of view and small resolution means that the video image covers the area that needs to be monitored by setting the PTZ parameters
  • Small field of view and large resolution means that by adjusting the PTZ parameters, the target height is zoomed to 1/6 to 2/3 of the video image height, and adjusted to the center of the field of view.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • a structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, and a protective casing.
  • the shell is left with a window and power and signal line interfaces.
  • the window is sealed with a light-permeable material for the camera to collect video images.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line.
  • the drive motor drives the camera to rotate 0-360 degrees horizontally around the axis of rotation, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the camera angle, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the transmission mechanism controls the left and right swing positions of the camera.
  • the intelligent visual perception device includes a visible light camera 501, a transmission mechanism 502, a data processing unit 503, a communication interface unit 504, a power management unit 505, and a protective housing 507.
  • the protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the fixing seat is used to fix the protective shell and can be fixed on the external bracket.
  • the housing has a window 508 and a power and signal line interface board 510.
  • the window 508 is sealed with a light-permeable material for video image capture by the camera.
  • the transmission mechanism 502 adjusts the up, down, left, and right field of view directions of the camera 501 by receiving control commands sent by the data processing unit 503.
  • the communication interface unit 504 mainly includes wired and wireless communication interfaces.
  • the power management unit 505 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 503 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view ,
  • the camera 501 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 501 is adjusted to collect video images in this area with a large field of view and small resolution.
  • the data processing unit 503 performs a video image target detection algorithm on the collected The video image performs a target detection of the target.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the field of view direction of the camera 501 so that the field of view direction of the camera 501 is aligned
  • the PTZ parameters By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/3 of the height of the video image, and the video image of the suspected target is collected with this small field of view and large resolution.
  • the data processing unit 503 performs a secondary target detection algorithm on the collected video image. Perform secondary target detection.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 503 When the secondary target detection determines that it is a true target, the data processing unit 503 outputs alarm information and generates the true target characteristic information. At the same time, the data processing unit 503 sends control commands to the camera 501 and the transmission mechanism 502, the transmission mechanism 502 adjusts the direction of the field of view of the camera 501, adjusts the focal length of the camera 501 in real time, and tracks the real target detected in real time. At the same time, the data processing unit 503 sends the characteristic information and alarm information of the real target to the monitoring terminal.
  • Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the field of view of the camera 501 so that The direction of the field of view of the camera 501 is aimed at the next real target for tracking.
  • the secondary target detection determines that it is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 503 characterizes the false target and extracts the false target feature description Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • a structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, and a protective shell.
  • the shell is left with a window and power and signal line interfaces.
  • the window is sealed with a light-permeable material for the camera to collect video images.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line.
  • the drive motor drives the camera to rotate 0-360 degrees horizontally around the axis of rotation, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the camera angle, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the transmission mechanism controls the left and right swing positions of the camera.
  • the intelligent visual perception device includes a visible light camera 501, a transmission mechanism 502, a data processing unit 503, a communication interface unit 504, a power management unit 505, and a protective housing 507.
  • the protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the fixing seat is used to fix the protective shell and is fixed on the external bracket.
  • the housing has a window 508 and a power and signal line interface board 510.
  • the window 508 is sealed with a light-permeable material for video image capture by the camera.
  • the transmission mechanism 502 adjusts the up, down, left, and right field of view directions of the camera 501 by receiving control commands sent by the data processing unit 503.
  • the communication interface unit 504 mainly includes wired and wireless communication interfaces.
  • the power management unit 505 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 503 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view ,
  • the camera 501 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 501 is adjusted to collect video images in this area with a large field of view and small resolution.
  • the data processing unit 503 performs a video image target detection algorithm on the collected The video image performs a target detection of the target.
  • the data processing unit 503 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; set the PTZ parameter to a large field of view and a small resolution Cover the second preset monitoring area, perform video image collection on this area, transmit the collected video image to the data processing unit 503 in real time, and perform a target detection of the target through the video image target detection algorithm.
  • each area feature weight Qi (0 ⁇ i ⁇ N) is obtained from the number of suspected targets in each area.
  • the intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 501.
  • the height of the suspected target is zoomed to 1/2 of the height of the field of view, and the first area with the suspected target is performed with the small field of view and large resolution.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 503 sends control commands to the camera 501 and the transmission mechanism 502.
  • the transmission mechanism 502 adjusts the field of view direction of the camera 501, adjusts the focal length of the camera 501 in real time, and tracks the real target detected in real time;
  • the target characteristic information 1 of the detected real target is sent to the cloud platform or data center;
  • the cloud platform or data center is based on the target characteristic information 1 sent by the intelligent visual perception device through the target recognition algorithm, and the target is identified, and the target characteristic information is obtained 2
  • alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view. Make the field of view of the camera 501 align with the next real target, adjust the focal length of the camera 501, collect video images of the next real target with a small field of view and large resolution, and transmit the collected video images to the data processing unit 503 in real time. 503 continues to perform secondary target detection on the collected video images through the video image target detection algorithm.
  • the secondary target detection determines that the suspected target is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 503 characterizes the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the camera 501 tracks the real target.
  • the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view. Align the field of view of the camera 501 with the second area, adjust the focal length of the camera 501, collect video images in the second area with a small field of view and large resolution, and transmit the collected video images to the data processing unit 503 in real time to perform the secondary target Target detection, the detection method is the same as the second detection method of the first area target.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • a structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, a lighting unit, and a protective housing.
  • the 2 windows are sealed with light-permeable materials.
  • One of the windows is used for video image collection by the camera, and the other window is used for the lighting unit to fill light.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the direction and size of the field of view of the camera and/or the light unit, and/or exchange information with the cloud platform or data center .
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit.
  • the light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a visible light camera 201, a transmission mechanism 202, a data processing unit 203, a communication interface unit 204, a power management unit 205, an illumination unit 206, and a protective housing 207.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell is left with windows 208 and 209, as well as the power and signal line interface board 210.
  • the windows 208 and 209 are sealed with light-permeable materials.
  • the window 208 is used for video image capture by the camera, and the window 209 Used to fill light.
  • the transmission mechanism 202 adjusts the vertical field of view direction of the camera 201 by receiving the control command sent by the data processing unit 203, and adjusts the horizontal field of view direction of the camera 201 through the rotating shaft 211 extending outside the protective housing and mounted on the fixed bracket.
  • the communication interface unit 204 mainly includes wired and wireless communication interfaces.
  • the power management unit 205 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system.
  • the illumination unit 206 is a visible light source, and mainly includes a light-emitting device and a unit for adjusting light intensity and light range.
  • the illumination unit 206 and the camera 201 are fixed together, and the transmission mechanism 202 jointly controls the left and right swing positions of the illumination unit and the camera.
  • the transmission mechanism includes a driving motor and a turntable.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and location of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 203 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201 field of view ,
  • the camera 201 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 201 is adjusted to collect video images in this area with a large field of view and small resolution.
  • the data processing unit 203 performs a video image target detection algorithm on the collected The video image undergoes a target detection.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the field of view of the camera 201 so that the direction of the field of view of the camera 201 is aligned
  • the focal length of the camera 201 zoom the height of the suspected target to 1/10 of the height of the video image, and collect the video image of the suspected target with this small field of view and large resolution.
  • the data processing unit 203 performs a secondary target detection algorithm to collect The video image is subjected to secondary target detection.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 203 When the secondary target detection determines that it is a true target, the data processing unit 203 outputs alarm information and generates the true target characteristic information. At the same time, the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202.
  • the transmission mechanism 202 adjusts the direction of the field of view of the camera 201, adjusts the focal length of the camera 201 in real time, and tracks the detected real target in real time.
  • the data processing unit 203 sends the real target feature information and alarm information to the monitoring terminal.
  • Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the camera 201 tracks the real target.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202.
  • the transmission mechanism 202 adjusts the direction of the camera 201's field of view so that the camera 201 The direction of the field of view is aligned with the next real target for tracking.
  • the secondary target detection determines that it is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 203 characterizes the false target and extracts the false target feature description Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the camera 201 tracks the real target.
  • the PTZ parameter is set for the second preset monitoring area, and the video image is collected with the large field of view and small resolution.
  • the collected video images are transmitted to the data processing unit 203 in real time, and the data processing unit 203 performs a target detection on the collected video images through a video image target detection algorithm.
  • Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
  • the data processing unit 203 performs real-time analysis on the brightness of the captured video image.
  • the brightness is insufficient, it sends a control command to the lighting unit 206 in time to adjust the light intensity and light field angle to make the video image captured by the camera 201
  • the brightness is moderate.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • a structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, a lighting unit, and a protective housing.
  • the 2 windows are sealed with light-permeable materials.
  • One of the windows is used for video image collection by the camera, and the other window is used for the lighting unit to fill light.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a visible light camera 201, a transmission mechanism 202, a data processing unit 203, a communication interface unit 204, a power management unit 205, an illumination unit 206, and a protective housing 207.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell is left with windows 208 and 209, as well as the power and signal line interface board 210.
  • the windows 208 and 209 are sealed with light-permeable materials.
  • the window 208 is used for video image capture by the camera, and the window 209 It is used to fill light in the lighting unit.
  • the transmission mechanism 202 adjusts the vertical field of view direction of the camera 201 by receiving the control command sent by the data processing unit 203, and adjusts the horizontal field of view direction of the camera 201 through the rotating shaft 211 extending outside the protective housing and mounted on the fixed bracket.
  • the communication interface unit 204 mainly includes wired and wireless communication interfaces.
  • the power management unit 205 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is an external solar panel.
  • the light unit 206 is visible light, and mainly includes a light-emitting device and a unit for adjusting light intensity and light range.
  • the illumination unit 206 and the camera 201 are fixed together, and the transmission mechanism 202 jointly controls the left and right swing positions of the illumination unit and the camera.
  • the transmission mechanism includes a driving motor and a turntable.
  • the camera is used for video image acquisition, including focus motor, zoom motor, drive module, image signal acquisition and processing unit, etc.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 203 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201 field of view ,
  • the camera 201 field of view direction is aligned with the second preset monitoring area, the focal length of the camera 201 is adjusted, and the video image is collected in this area with a large field of view and small resolution.
  • the data processing unit 203 performs a video image target detection algorithm on the collected video The image performs a target detection of the target.
  • the data processing unit 203 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; set the PTZ parameter to a large field of view and a small resolution Cover the second preset monitoring area, perform video image collection on this area, transmit the collected video image to the data processing unit 203 in real time, and perform a target detection of the target through the video image target detection algorithm.
  • each area feature weight Qi (0 ⁇ i ⁇ N) is obtained from the number of suspected targets in each area.
  • the intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 201, and by adjusting the PTZ parameters, the height of the suspected target is zoomed to 4/5 of the height of the video image, and the first area where the suspected target exists with the small field of view and large resolution Video image collection is performed, and the collected video images are transmitted to the data processing unit 203 in real time, and the data processing unit 203 performs secondary target detection of suspected targets in the area sequentially on the collected video images through a secondary target detection algorithm.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit When the secondary target detection determines that the suspected target is a true target, the data processing unit generates target feature information 1 and warning information 1 of the true target.
  • the data processing unit 203 sends control commands to the camera 201 and the transmission mechanism 202.
  • the transmission mechanism 202 adjusts the field of view direction of the camera 201, adjusts the focal length of the camera 201 in real time, and tracks the real target detected in real time;
  • the target characteristic information 1 of the detected real target is sent to the cloud platform or data center; the cloud platform or data center is based on the target characteristic information 1 sent by the intelligent visual perception device through the target recognition algorithm, and the target is identified, and the target characteristic information is obtained 2
  • alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the field of view of the camera 201, Make the field of view of the camera 201 aim at the next real target, adjust the focal length of the camera 201, collect video images of the next real target with a small field of view and large resolution, and transmit the collected video images to the data processing unit 203 in real time.
  • the video image target detection algorithm continue to perform secondary target detection on the collected video image.
  • the secondary target detection determines that the suspected target is a false target
  • the false target determined in the large-resolution image is mapped to the small-resolution image
  • the data processing unit 203 features description of the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the camera 201 tracks the real target.
  • the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201's field of view. Align the field of view of the camera 201 with the second area, adjust the focal length of the camera 201, collect video images of the second area with a small field of view and large resolution, and transmit the collected video images to the data processing unit 203 in real time to perform the secondary target Target detection, the detection method is the same as the second detection method of the first area target.
  • the data processing unit 203 analyzes the brightness of the captured video image in real time. When the brightness is insufficient, it sends a control command to the lighting unit 206 in time to adjust the light intensity so that the video image captured by the camera 201 has a moderate brightness. .
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 2 visible light cameras, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing.
  • the shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. Two of the windows are used for the camera to collect video images, and the other two windows are used for the illumination unit to fill light. 2
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a visible light camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, a power management unit 305, The light unit 306-1, the light unit 306-2, and the protective shell constitute 307.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1.
  • the windows 309-2 are all sealed with light-permeable materials.
  • the windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
  • the transmission mechanism 302-1 adjusts the position of the camera 301-1 left and right and the vertical field of view by receiving the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the camera 301-2 by receiving the control commands sent by the data processing unit 303
  • the communication interface unit 304 mainly includes wired and wireless communication interfaces.
  • the power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, which is a wired power supply in this embodiment.
  • the illumination units 306-1 and 306-2 mainly include a light emitting device and a light intensity adjustment unit, and the light sources are visible light and/or infrared light, respectively.
  • the light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 303 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 sends a control command to the camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302-1 Adjust the field of view direction of the camera 301-1 so that the field of view direction of the camera 301-1 is aligned with the second preset monitoring area.
  • the video image is transmitted to the data processing unit 303 in real time to perform a target detection of the target.
  • the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302-2 adjusts the field of view direction of the camera 301-2.
  • the direction of the field of view of the camera 301-2 is aligned with the suspected target, and the focal length of the camera 301-2 is adjusted.
  • the height of the suspected target is scaled to 1/6 of the height of the video image, and the video image of the suspected target is collected with this small field of view and large resolution.
  • the data processing unit 303 performs a secondary target detection algorithm on the collected video image. Perform the secondary target detection of the target.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information 1 and generates the true target characteristic information 1. At the same time, the data processing unit 303 sends control commands to the cameras 301-2 and 301-2.
  • the transmission mechanism 302-2 and the transmission mechanism 302-2 adjust the field of view direction of the camera 301-2, adjust the focal length of the camera 301-2 in real time, and track the real target detected in real time; at the same time, the data processing unit 303 obtains the characteristic information of the real target 1 Send to the cloud platform or data center, and the cloud platform or data center sends the true target characteristic information according to the intelligent visual perception device.
  • the target recognition After the target recognition algorithm, the target recognition is performed, and the true target characteristic information 2 and alarm information 2 are generated.
  • the true target characteristic information 2 and the alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302 -2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aimed at the next real target, and adjust the focal length of the camera 301-2 to track the next real target with a small field of view and large resolution.
  • the secondary target detection determines that it is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 303 performs a feature description on the false target and extracts the feature description of the false target Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the camera 301-2 tracks the real target.
  • all real targets disappear or the tracking time reaches the set value set the PTZ parameters for the second preset monitoring area, and perform video images with the large field of view and small resolution Acquisition, real-time transmission of the collected video images to the data processing unit 303 to perform a target detection, the detection method is the same as the first preset monitoring area target detection method.
  • the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and the lighting unit in time. 306-2. Adjust the light intensity and light field angle of the light unit so that the brightness of the video images collected by the camera 301-1 and the camera 301-2 is moderate.
  • an eighth implementation manner of an intelligent visual perception system is also provided:
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 2 visible light cameras, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing.
  • the shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image acquisition by the camera, and the other 2 windows are used for light supplementation.
  • 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a visible light camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, a power management unit 305, The light unit 306-1, the light unit 306-2 and the protective housing constitute 307.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the window is made of light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit;
  • the fixing seat is used to fix the protective shell, or fix it on the outside On the stand.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the windows are made of light-transmitting materials, which respectively transmit the video images collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell, including the On the external bracket.
  • the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1.
  • the windows 309-2 are all sealed with light-permeable materials.
  • the windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
  • the transmission mechanism 302-1 adjusts the left and right positions of the camera 301-1 and the vertical field of view by receiving the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the left and right positions of the camera 301-2 by receiving the control commands sent by the data processing unit 303.
  • the communication interface unit 304 mainly includes wired and wireless communication interfaces.
  • the power management unit 305 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system.
  • the lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, and the light source is visible light and/or infrared light.
  • the light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 sends a control command to the camera 301-1 and the transmission mechanism 302, and the transmission mechanism 302-1 adjusts the camera
  • the direction of the field of view 301-1 makes the direction of the field of view of the camera 301-1 align with the second preset monitoring area, and adjust the focal length of the camera 301-1 to collect video images of the second preset monitoring area with a large field of view and a small resolution
  • the data processing unit 303 performs one-time target detection of the target in the second preset monitoring area, and the detection method is the same as the first-time detection method of the target in the first preset monitoring area.
  • the data processing unit 303 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; at the same time, the PTZ parameters are set so that the field of view is large and small.
  • the resolution covers the second preset monitoring area, so that the camera 301-1 collects video images of the monitoring area, and transmits the collected video images to the data processing unit 303 in real time.
  • each area feature weight Qi (0 ⁇ i ⁇ N) is obtained from the number of suspected targets in each area.
  • the intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 301-2, zooms the height of the suspected target to 2/3 of the height of the video image, and adjusts it to the center of the field of view, and uses the small field of view and large resolution for the suspected target
  • the first area collects video images, and transmits the collected video images to the data processing unit 303 in real time.
  • the data processing unit 303 performs secondary target detection of suspected targets on the collected video images through a secondary target detection algorithm.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 When the secondary target detection determines that the suspected target is a real target, the data processing unit 303 generates target feature information and alarm information of the real target, and at the same time, the data processing unit 303 sends control commands to the camera 301 and the transmission mechanism 302- 2.
  • the transmission mechanism 302-2 adjusts the field of view direction of the camera 301-2, adjusts the focal length of the camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 sends the target feature information and alarm information to the monitoring terminal.
  • Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the camera 301-2 tracks the real target.
  • the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302- 2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aimed at the next real target for tracking.
  • the secondary target detection determines that the suspected target is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the camera 301-2 tracks the true target.
  • the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302- 2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aligned with the second area, adjust the focal length of the camera 301-2, and collect the video image of the second area with a small field of view and large resolution, and the captured video
  • the image is transmitted to the data processing unit 303 in real time, and the data processing unit 303 performs secondary target detection of the second area target on the collected video image through the video image target detection algorithm.
  • the detection method is the same as the first area target second detection method.
  • step (2) By analogy, follow steps (7) to (11) to perform secondary target detection and/or recognition for W areas, and finally, start from step (2) again, starting from the first preset monitoring area to detect and / Or target recognition, automatic loop.
  • the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and the lighting unit in time. 306-2. Adjust the light intensity of the light unit so that the brightness of the video images collected by the camera 301-1 and the camera 301-2 is moderate.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 1 visible light camera, 1 near-infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing.
  • the shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. 2 windows are used for video image acquisition by 2 cameras, and the other 2 windows are used for light supplementation by the lighting unit. 2
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces, the input interface is used to receive external device signals, and its connection methods include wireless and/or wired methods; among them, wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA ; Wired methods include one or more of RS485, RS422, RS232, CAN bus; the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more of, 3G, 4G, 5G, NB-IOT; wired mode includes one or more of LAN and optical fiber.
  • wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA
  • Wired methods include one or more of RS485, RS422, RS232, CAN bus
  • the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center, and/or Interaction information with other sensors on site and/or other related systems.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit.
  • the light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a near-infrared camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, and a power management unit 305 ,
  • the illumination unit 306-1, the illumination unit 306-2, and the protective shell constitute 307.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1.
  • the windows 309-2 are all sealed with light-permeable materials.
  • the windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
  • the transmission mechanism 302-1 adjusts the left-right and vertical field of view position of the camera 301-1 through the control commands sent by the data processing unit 303
  • the transmission mechanism 302-2 adjusts the left-right and vertical vision of the camera 301-2 through the control commands sent by the data processing unit 303.
  • the communication interface unit 304 mainly includes wired and wireless communication interfaces.
  • the power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, and in this embodiment is an external solar panel.
  • the lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, both of which are infrared light.
  • the light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 ,
  • the transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the near-infrared camera 301-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 303 in real time.
  • the data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 sends control commands to the infrared camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302- 1 Adjust the direction of the infrared camera 301-1's field of view so that the direction of the infrared camera 301-1's field of view is aligned with the second preset monitoring area, adjust the focal length of the infrared camera 301-1, and collect video images in this area with a large field of view and a small resolution
  • the data processing unit 303 performs a target detection of the target on the collected video image through the video image target detection algorithm.
  • the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302-2 adjusts the field of view of the visible light camera 301-2 Direction so that the visual field of the visible light camera 301-2 is aimed at the suspected target, and the focal length of the visible light camera 301-2 is adjusted.
  • the height of the suspected target is scaled to 1/3 of the height of the video image, and the video image of the suspected target is collected with the small field of view and large resolution.
  • the data processing unit 303 performs the secondary target detection algorithm to collect the video image Perform secondary target detection.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information to the monitoring terminal and generates the true target characteristic information. At the same time, the data processing unit 303 sends a control command to the visible light camera 301- 2 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the visual field direction of the visible light camera 301-2, adjusts the focal length of the visible light camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 will Target feature information and alarm information are sent to the monitoring terminal.
  • Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, or characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of a car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the visible light camera 301-2 tracks the real target.
  • the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the direction of the field of view of the visible light camera 301-2, so that the direction of the field of view of the visible light camera 301-2 is aligned with the next real target for tracking.
  • the secondary target detection determines that the suspected target is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, update the false target feedback feature information database, in order to reduce the probability of a target detection error detection algorithm after this.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the visible light camera 301-2 tracks the real target.
  • the data processing unit 303 sends control commands to the infrared camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302-1 adjusts the direction of the infrared camera 301-1 field of view ,
  • the direction of the infrared camera 301-1 field of view is aligned with the second preset monitoring area, the focal length of the infrared camera 301-1 is adjusted, the video image is collected in this area, and the collected video image is transmitted to the data processing unit 303 in real time, and the data is processed
  • the unit 303 performs a target detection of the target on the collected video image through the video image target detection algorithm, and the detection method is the same as the target detection method of the first preset monitoring area.
  • the data processing unit 303 analyzes the brightness of the video images collected by the infrared camera 301-1 and the visible light camera 301-2 in real time, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and 306-1 in time.
  • the light unit 306-2 adjusts the light intensity and light range of the light unit so that the brightness of the video images collected by the infrared camera 301-1 and the visible light camera 301-2 is moderate.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 1 visible light camera, 1 near-infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing.
  • the shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. 2 windows are used for video image acquisition by 2 cameras, and the other 2 windows are used for light supplementation by the lighting unit. 2
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center, and/or Interaction information with other sensors on site and/or other related systems.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit.
  • the light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes a near-infrared camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, and a power management unit 305 ,
  • the illumination unit 306-1, the illumination unit 306-2 and the protective shell constitute 307.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1.
  • the windows 309-2 are all sealed with light-permeable materials.
  • the windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
  • the transmission mechanism 302-1 adjusts the left-right and vertical field of view position of the camera 301-1 through the control commands sent by the data processing unit 303
  • the transmission mechanism 302-2 adjusts the left-right and vertical vision of the camera 301-2 through the control commands sent by the data processing unit 303.
  • the communication interface unit 304 mainly includes wired and wireless communication interfaces.
  • the power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, which is a wired power supply in this embodiment.
  • the lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, and the light source is visible light and/or infrared light.
  • the light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 ,
  • the transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the near-infrared camera 301-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 303 in real time.
  • the data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 sends a control command to the infrared camera 301-1 and the transmission mechanism 302, and the transmission mechanism 302-1 is adjusted
  • the direction of the field of view of the near-infrared camera 301-1 is such that the direction of the field of view of the near-infrared camera 301-1 is aligned with the second preset monitoring area, and the focal length of the near-infrared camera 301-1 is adjusted to monitor the second preset with a large field of view and a small resolution
  • the video image is collected in the area, and the collected video images are transmitted to the data processing unit 303 in real time.
  • the data processing unit 303 performs a target detection of the target in the second preset monitoring area.
  • the detection method is the same as the first detection method of the target in the first preset monitoring area. .
  • the data processing unit 303 gives the first preset monitoring area feature weight Q1, which is weighted by the number of suspected targets in the area and the moving speed of the moving target; set the PTZ parameter , Cover the second preset monitoring area with a large field of view and small resolution, the near-infrared camera 301-1 collects video images of this area, and transmits the collected video images to the data processing unit 303 in real time.
  • each area feature weight Qi (0 ⁇ i ⁇ N) is respectively weighted by the number of suspected targets in each area and the moving speed of the moving target.
  • the intelligent visual perception device adjusts the direction and focal length of the field of view of the near-infrared camera 301-1.
  • the height of the suspected target is zoomed to 1/2 of the height of the field of view, and adjusted to the center of the field of view, so that the small field of view can be used for large resolution Rate the video image collection of the first area where there are suspected targets, and transmit the collected video images to the data processing unit 303 in real time.
  • the data processing unit 303 performs secondary target detection algorithms on the collected video images sequentially. Target Detection.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 When the secondary target detection determines that the suspected target is a real target, the data processing unit 303 generates target feature information 1 and alarm information 1 of the real target, and at the same time, the data processing unit 303 sends control commands to the camera 301 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the visual field direction of the visible light camera 301-2, adjusts the focal length of the visible light camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 detects the real target twice The target characteristic information 1 of the target is sent to the cloud platform or data center; the cloud platform or data center uses the target recognition algorithm to identify the target according to the target characteristic information 1 sent by the intelligent visual perception device, and obtains the target characteristic information 2 and alarm information 2 Send to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the visible light camera 301-2 tracks the true target.
  • the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302 -2 Adjust the direction of the field of view of the visible light camera 301-2 so that the direction of the field of view of the visible light camera 301-2 is aimed at the next real target for tracking.
  • the secondary target detection determines that the suspected target is a false target
  • the false target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the visible light camera 301-2 tracks the real target.
  • the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, the transmission mechanism 302-2 Adjust the field of view direction of the visible light camera 301-2 so that the field of view direction of the visible light camera 301-2 is aligned with the second area, adjust the focal length of the visible light camera 301-2, and collect video images of the second area with a small field of view and large resolution ,
  • the collected video images are transmitted to the data processing unit 303 in real time.
  • the data processing unit 303 performs the second target detection of the second area target on the collected video image through the video image target detection algorithm.
  • the detection method is the same as the second target detection method of the first area target.
  • the detection method is the same.
  • step (1) By analogy, follow the steps (7) to (11) to perform secondary target detection and recognition for W areas, and finally, start from step (1) again, start target recognition from the first preset monitoring area, and automatically cycle.
  • the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and the visible light camera 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and lighting in time.
  • the unit 306-2 adjusts the light intensity of the light unit so that the brightness of the video image collected by the camera 301 is moderate.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 1 visible light camera, 1 infrared thermal imaging camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 1 lighting unit and 1 protective housing.
  • the shell has 3 windows and power and signal line interfaces. The 3 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image collection by 2 cameras, and the other window is used for light supplementation.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system.
  • Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes an infrared thermal imaging camera 401-1, a visible light camera 401-2, a transmission mechanism 402-1, a transmission mechanism 402-2, a data processing unit 403, a communication interface unit 404, and a power management unit 405. Illumination unit 406, and protective housing 407.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell is left with windows 408-1, 408-2, and 409, and a power and signal line interface board 410.
  • the windows 408-1, 408-2, and 409 are all made of light-permeable materials.
  • the window 408-1 is used for the video image collection of the infrared thermal imaging camera 401-1
  • the window 408-2 is used for the video image collection of the visible light camera 401-2
  • the window 409 is used for the light unit 406 to transmit light to the visible light camera 401 -2
  • the monitoring target fills in light.
  • the transmission mechanism 402-1 adjusts the position of the infrared thermal imaging camera 401-1 in the left-right and vertical field of view by receiving the control commands sent by the data processing unit 403; the transmission mechanism 402-2 adjusts the camera 401 by receiving the control commands sent by the data processing unit 403 -2
  • the communication interface unit 404 mainly includes wired and wireless communication interfaces.
  • the power management unit 405 is mainly used to supply power to the entire intelligent video perception device, which is a wired power supply in this embodiment.
  • the light unit 406 mainly includes a light emitting device and a light intensity adjustment unit, and the light source is a laser.
  • the light unit 406 and the camera 401-2 are fixed together, the transmission mechanism 402-1 controls the left and right swinging positions of the infrared thermal imaging camera 401-1, and the transmission mechanism 402-2 jointly controls the light unit 406 and the camera 401-2 to swing left and right. Location.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the infrared thermal imaging camera 401-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 403 in real time.
  • the data processing unit 403 performs a target detection of the target on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 403 sends a control command to the infrared thermal imaging camera 401-1 and the transmission mechanism 402-1, the transmission mechanism 402-1 adjusts the field of view direction of the infrared thermal imaging camera 401-1 so that the field of view of the infrared thermal imaging camera 401-1 is aligned with the second preset monitoring area, and adjusts the focal length of the infrared thermal imaging camera 401-1 to achieve a large field of view and a small resolution Video image collection is performed on the second preset monitoring area at a high rate, and the data processing unit 403 performs a target detection of the target on the collected video image through the video image target detection algorithm.
  • the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402-2 adjusts the field of view of the visible light camera 401-2 Direction so that the visual field of the visible light camera 401-2 is aimed at the suspected target, and the focal length of the visible light camera 401-2 is adjusted.
  • the height of the suspected target is scaled to 1/3 of the height of the video image, and adjusted to the center of the field of view.
  • the video image of the suspected target is collected with this small field of view and large resolution.
  • the data processing unit 403 undergoes a secondary target detection algorithm. , Perform secondary target detection on the collected video images.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 303 When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information 1 to the monitoring terminal, and generates the true target characteristic information 1.
  • the data processing unit 403 sends control commands to the visible light camera 401-2 and the transmission mechanism 402-2.
  • the transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view, adjusts the focal length of the visible light camera 401-2 in real time, and responds to the detected real target Perform real-time tracking; at the same time, the data processing unit 403 sends the true target characteristic information 1 to the cloud platform or data center, and the cloud platform or data center performs target recognition algorithm according to the true target characteristic information 1 sent by the intelligent visual perception device. Identify, generate the true target feature information 2 and alarm information 2, and send the true target feature information 2 and alarm information 2 to the monitoring terminal; the monitoring terminal responds to the alarm information 1, alarm information 2, target feature information 1 and/or target feature Information 2 is processed.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the visible light camera 401-2 tracks the real target.
  • the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, the transmission mechanism 402-2 adjusts the field of view direction of the visible light camera 401-2 so that the field of view direction of the visible light camera 401-2 is aimed at the next real target for tracking.
  • the fake target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 403 characterizes the fake target and extracts the fake target Feature description information, update the false target feedback feature information database, in order to reduce the probability of a target detection error detection algorithm after this.
  • the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the visible light camera 401-2 tracks the real target.
  • the data processing unit 403 sends control commands to the infrared thermal imaging camera 401-1 and the transmission mechanism 402-1, and the transmission mechanism 402-1 adjusts the infrared thermal imaging camera 401- 1 Field of view direction, so that the field of view of the infrared thermal imaging camera 401-1 is aligned with the second preset monitoring area, adjust the focal length of the infrared thermal imaging camera 401-1, and video the second preset monitoring area with a large field of view and small resolution Image acquisition, real-time transmission of the collected video images to the data processing unit 403.
  • the data processing unit 403 performs a target detection on the collected video images through the video image target detection algorithm.
  • the detection method is the same as the first preset monitoring area target detection
  • the method is the
  • Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
  • the data processing unit 403 performs real-time analysis on the brightness of the video image collected by the visible light camera 401-2, and when the brightness is insufficient, it sends a control command to the lighting unit 406 in time, and the lighting unit 406 adjusts the light intensity and light. Range, so that the visible light camera 401-2 captures video images with moderate brightness.
  • An intelligent visual perception system includes one or more intelligent visual perception devices.
  • the intelligent visual perception device is composed of 1 visible light camera, 1 infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 1 lighting unit and 1 protective housing.
  • the shell has 3 windows and power and signal line interfaces. The 3 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image collection by 2 cameras, and the other window is used for light supplementation.
  • the transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line.
  • the driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
  • the communication interface unit mainly includes wired and wireless communication interfaces, the input interface is used to receive external device signals, and its connection methods include wireless and/or wired methods; among them, wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA ; Wired methods include one or more of RS485, RS422, RS232, CAN bus; the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more of, 3G, 4G, 5G, NB-IOT; wired mode includes one or more of LAN and optical fiber.
  • wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA
  • Wired methods include one or more of RS485, RS422, RS232, CAN bus
  • the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more
  • the data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center.
  • the power management unit is mainly used to supply power to the entire intelligent video sensing device.
  • the light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
  • the intelligent visual perception device includes an infrared camera 401-1, a visible light camera 401-2, a transmission mechanism 402-1, a transmission mechanism 402-2, a data processing unit 403, a communication interface unit 404, a power management unit 405, Illumination unit 406, protective housing 407.
  • the protective shell includes an interface board, a window, and a fixing seat.
  • the shell is left with windows 408-1, 408-2, and 409, and a power and signal line interface board 410.
  • the windows 408-1, 408-2, and 409 are all made of light-permeable materials.
  • the window 408-1 is used for the video image collection of the infrared camera 401-1
  • the window 408-2 is used for the video image collection of the visible light camera 401-2
  • the window 409 is used for the light unit 406 to transmit light to the visible light camera 401-2
  • the monitoring target fills in light.
  • the transmission mechanism 402-1 adjusts the left and right positions of the infrared camera 401-1 and the vertical field of view through the control commands sent by the data processing unit 403; the transmission mechanism 402-2 adjusts the left and right positions of the camera 401-2 through the control commands sent by the data processing unit 403 And the position of the up and down field of view.
  • the communication interface unit 404 mainly includes wired and wireless communication interfaces.
  • the power management unit 405 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system.
  • the light unit 406 mainly includes a light emitting device and a unit for adjusting light intensity and light range, and the light source is infrared light.
  • the light unit 406 and the camera 401-2 are fixed together, the transmission mechanism 402-1 controls the left and right swing positions of the infrared camera 401-1, and the transmission mechanism 402-2 jointly controls the left and right swing positions of the light unit 406 and the camera 401-2.
  • the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
  • the system workflow is as follows:
  • the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area.
  • Establish a false target feedback feature information database in a specific monitoring area including false target location information and false target feature description information.
  • the infrared thermal imaging camera 401-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 403 in real time.
  • the data processing unit 403 performs a target detection on the collected video image through a target detection algorithm.
  • a target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target.
  • the primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 403 sends a control command to the infrared camera 401-1 and the transmission mechanism 402, and the transmission mechanism 402-1 is adjusted
  • the field of view direction of the infrared camera 401-1 is such that the field of view direction of the infrared camera 401-1 is aligned with the second preset monitoring area, and the focal length of the infrared camera 401-1 is adjusted to collect video images of this area with a large field of view and a small resolution.
  • the data processing unit 403 performs a target detection of the target on the collected video image through the video image target detection algorithm. .
  • the data processing unit 403 gives the first preset monitoring area feature weight Q1, which is weighted by the number of suspected targets in the area and the moving speed of the moving target; set the PTZ parameter , Cover the second preset monitoring area with a large field of view and small resolution, so that the infrared camera 401-1 collects video images in this area, and transmits the collected video images to the data processing unit 403 in real time.
  • the video image target detection algorithm One target detection.
  • each area feature weight Qi (0 ⁇ i ⁇ N) is respectively weighted by the number of suspected targets in each area and the moving speed of the moving target.
  • the intelligent visual perception device adjusts the field of view direction and focal length of the visible light camera 401-2.
  • the height of the suspected target is zoomed to 1/6 of the field of view height, and adjusted to the center of the field of view, so that the small field of view has a large resolution
  • Video image acquisition is performed on the first area where the suspected target exists, and the data processing unit 403 passes the collected video image through a secondary target detection algorithm to sequentially perform secondary target detection of the suspected target.
  • the secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
  • the data processing unit 403 When the secondary target detection determines that the suspected target is a true target, the data processing unit 403 generates target feature information 1 and warning information 1 of the true target. The data processing unit 403 sends control commands to the camera 401 and the transmission mechanism 402-2. The transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view, adjusts the focal length of the visible light camera 401-2 in real time, and tracks the real target detected in real time.
  • the data processing unit 403 sends the target feature information 1 of the real target detected twice to the cloud platform or data center; the cloud platform or data center uses the target feature information 1 sent by the intelligent visual perception device to perform the target recognition algorithm Identify and send the obtained target characteristic information 2 and alarm information 2 to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
  • Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains.
  • the target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects.
  • the specific information types and contents are defined according to the specific application environment.
  • the visible light camera 401-2 tracks the real target.
  • the data processing unit 403 sends a control command to the camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402 -2 Adjust the direction of the field of view of the visible light camera 401-2 so that the direction of the field of view of the visible light camera 401-2 is aimed at the next real target for tracking.
  • the secondary target detection determines that the suspected target is a fake target
  • the fake target determined under the large-resolution image is mapped to the small-resolution image
  • the data processing unit 403 features the fake target and extracts the fake target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
  • the comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
  • the visible light camera 401-2 tracks the real target.
  • the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, the transmission mechanism 402-2 Adjust the field of view direction of the visible light camera 401-2 so that the field of view direction of the visible light camera 401-2 is aligned with the second area, adjust the focal length of the visible light camera 401-2, and collect video images of the second area with a small field of view and large resolution
  • the data processing unit 403 performs the secondary target detection of the second area target through the video image target detection algorithm on the collected video image, and the detection method is the same as the first area target second detection method.
  • the data processing unit 403 performs real-time analysis on the brightness of the video images collected by the camera 401-1 and the camera 401-2, and when the brightness is insufficient, it sends control commands to the light unit 406-1 and the light unit in time. 406-2. Adjust the light intensity and light range of the light unit, so that the video images captured by the camera 401-1 and the camera 401-2 have moderate brightness.
  • the intelligent visual perception system provided by the present invention is also provided.
  • infrared camera 401-1 is used.
  • the scope of the current monitoring area it can be calculated to cover The maximum magnification of the entire area, by setting the PTZ parameters, the monitoring area is video image collected with the large field of view and small resolution, and the collected video image is shown in FIG. 9.
  • the data processing unit 403 is used to perform a target detection on the collected video image through a video image target detection algorithm, and a suspicious target is found.
  • the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view so that the direction of the visible light camera 401-2’s field of view is aligned with the suspected target, and adjust Visible light camera 401-2 focal length.
  • the height of the suspected target is scaled to 1/3 of the height of the field of view, and adjusted to the center of the field of view.
  • the video image of the suspected target is collected with this small field of view and large resolution. The collected video image is shown in Figure 10. .
  • the data processing unit 403 performs secondary target detection on the collected video image through a video image target detection algorithm, and judges it as a true target.
  • the data processing unit sends the alarm information 1, the alarm information 2, the target characteristic information 1, and/or the target characteristic information 2 to the monitoring terminal.
  • the visible light camera 401-2 tracks the real target, as shown in Fig. 11. Warning information includes suspicious persons breaking into the specific monitoring area of railway operation, detention time, suspicious behavior, real dangers, etc.
  • the target characteristic information includes people and their physical characteristics, and even the identity information of the person after facial feature comparison .
  • the intelligent visual perception system of the present invention realizes the detection of the surrounding driving environment along the railway.
  • intelligent capture is carried out in time , Enlargement, feature description and warning to facilitate follow-up measures to ensure driving safety.
  • the present invention solves the current non-intelligent monitoring of the surrounding environment of the railroad tracks.
  • the monitor needs to keep a close eye on the screen, not dare to slack off, and artificially judge whether there are operational obstacles in real time; and for the actual danger that may exist in the distant place, it may be due to the image If it is not clear, it is ignored, or because the image cannot be zoomed in time, misjudgment is generated, which affects the emergency response time, delays the time of accident rescue, and may cause hundreds of millions of property losses and a real danger of numerous casualties. It can be seen that the present invention has technological advancement and extremely strong social application value.
  • an implementation of an intelligent visual perception system is also provided.
  • the intelligent visual perception device receives information that can be covered by the intelligent visual perception device through the input interface of the communication interface. Signals of other sensing or motion devices in the detection area, when other sensing or motion devices sense abnormal conditions, send alarm information to the intelligent visual perception device, and the intelligent perception device receives the alarm information sent by the sensing or motion device, Prioritize the adjustment of the camera's field of view direction and focal length, and perform target detection in the area where the sensing or action device is located.
  • the sensing device is a vibration sensor buried in the ground. Under normal circumstances, no vehicles will pass by the area where the vibration sensor is located. The intelligent visual perception device does not monitor the area. When a vehicle passes by, the vibration sensor will The visual perception device sends a vibration signal. After the intelligent visual perception device receives the vibration signal, it can adjust the direction and focal length of the field of view, and perform target recognition on the area where the vibration sensor is located.
  • the intelligent visual perception device provides its perception data and/or information to other sensing or action devices in the monitoring area through an output interface; further, Perform data fusion and/or joint judgment with other sensing or action devices in the monitoring area, and/or directly control or system linkage to other sensing or action devices in the monitoring area.
  • sensing or action devices in the monitoring area include alarm sound and light equipment, access control equipment, firefighting equipment, obstacle removal equipment, animal repelling equipment, trailer equipment, cleaning equipment, patrol equipment, impulse mitigation equipment, emergency stop equipment, and shunt equipment
  • alarm sound and light equipment access control equipment, firefighting equipment, obstacle removal equipment, animal repelling equipment, trailer equipment, cleaning equipment, patrol equipment, impulse mitigation equipment, emergency stop equipment, and shunt equipment
  • Example 13 the intelligent visual perception system of the present invention found that the true target was a suspicious person who appeared on the railway for unknown reasons, and immediately sent perception data and information to the sensing or action device in the monitoring area, sensing or Action devices such as patrol vehicles, on-site communication equipment, etc., are connected in parallel to broadcast warning information and train arrival information. For example, through feature recognition, the system finds that the true target is a flock, etc., it can link the animal driving equipment to keep the flock away from the train track and avoid major accidents.
  • the personnel handling process is initiated, and the relevant responsible personnel are sent to the real target area to direct and control suspicious personnel and remove obstacles on the spot to ensure the normal operation of the railway. Personnel safety.
  • the camera inside the intelligent visual perception device has a small rotatable field of view.
  • the housing of the intelligent visual perception device is fixed on the turntable, and the data processing unit of the intelligent visual perception device passes through the control line.
  • the intelligent visual perception device Connected with the transmission mechanism in the turntable, the intelligent visual perception device controls the turntable to rotate at a large angle, and then realizes the monitoring of other areas.
  • multiple cameras can be used to alternate the primary target detection of the target and the secondary target detection of the target to improve the recognition efficiency.
  • the video image data can be uploaded to the server in the edge cloud in real time through 5G technology, as shown in Figure 4, the target is identified twice and/or three times in the service.
  • the intelligent visual perception system of the present invention includes a plurality of intelligent visual perception devices, which respectively cover different monitoring ranges to form a monitoring network, thereby expanding the monitoring field and truly realizing the overall monitoring of the protection area.
  • Cloud platform and/or data center including server and software for image recognition through image target detection algorithm; monitoring terminal, one or more, used to display target recognition result information, receive alarm information, perform remote configuration and control, Including smart terminal equipment and its running management software.
  • the smart terminal equipment includes but is not limited to one or more of a computer and a handheld.
  • the server may be a virtual server, including one or more of a local server, an edge cloud, and a public cloud.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • Closed-Circuit Television Systems (AREA)
  • Alarm Systems (AREA)
  • Image Analysis (AREA)

Abstract

The present invention provides an intelligent visual perception system, comprising a variable focal length camera and a false target feedback feature information database in a designated monitoring area. The camera acquires video images of the monitoring area at a first resolution, and primary target detection in the area is performed by using a primary target detection algorithm, the primary target detection comprising: determining, on the basis of the false target feedback feature information database, whether a suspected target is contained in the designated monitoring area; when it is found that a suspected target is contained in the monitoring area, the camera is adjusted to acquire video images of the suspected target at a second resolution, and secondary target detection is performed using a secondary target detection algorithm to determine whether the suspected target is a true target or a false target; and if the suspected target is a false target, the false target feedback feature information database is updated according to information of the false target, or if the suspected target is a true target, the true target is tracked and monitored. The present intelligent visual perception system improves the precision and accuracy in capturing environmental anomalies.

Description

一种智能视觉感知系统An intelligent visual perception system 技术领域Technical field
本发明涉及安防领域,特别涉及一种智能视觉感知系统。The invention relates to the field of security and protection, in particular to an intelligent visual perception system.
背景技术Background technique
常见的可运动摄像机包括云台摄像机、PTZ摄像机等,它们通常具有水平转动、竖直转动和焦距变化三个控制参数。通过调整这些参数,可运动摄像机不但可以改变焦距,从而获得拍摄场景中目标或区域的不同分辨率信息,并且可以改变角度,从而获得拍摄场景中目标或区域的不同视场方向信息。但是,这些摄像机,拍摄不同区域场景时,都是通过云台来实现摄像机的视场方向控制,而且视场方向位都是预先设置好的,这种方式仅仅实现了一种自动化监测。Common movable cameras include pan-tilt cameras, PTZ cameras, etc., which usually have three control parameters of horizontal rotation, vertical rotation and focal length change. By adjusting these parameters, the motion camera can not only change the focal length to obtain different resolution information of the target or area in the shooting scene, but also change the angle to obtain different field of view direction information of the target or area in the shooting scene. However, these cameras, when shooting scenes in different areas, use the pan-tilt to control the direction of the camera's field of view, and the field of view direction is preset. This method only realizes an automatic monitoring.
多目摄像机是较为常用的摄像设备,多目摄像机的机体上搭载有多个镜头,多个镜头沿着机体的圆周方向分布。目前的多目摄像机的多个镜头通常固定在基座上,进而进行各个角度方向的拍摄。多个镜头所拍摄的图像能合成多目摄像机的全景图像。但是目前这些多目摄像机,主要功能还是实现全景方位视频拍摄,其目标的可发现距离与目标的可辨识距离都较小。如果将这些多目摄像机镜头焦距都加长,则这些多目摄像机体积都比较大,不适合实际场景应用。A multi-camera is a commonly used imaging device. The body of the multi-camera is equipped with multiple lenses, and the multiple lenses are distributed along the circumferential direction of the body. The multiple lenses of the current multi-eye camera are usually fixed on the base to perform shooting in various angles and directions. The images taken by multiple lenses can be combined into a panoramic image of a multi-lens camera. However, at present, the main function of these multi-eye cameras is to achieve panoramic azimuth video shooting, and the discoverable distance of the target and the recognizable distance of the target are both small. If the focal lengths of the lenses of these multi-cameras are all lengthened, these multi-cameras are relatively large in size and not suitable for actual scene applications.
现有的视频监控系统多为被动式的监控。这样的系统基本是“监而不控”,出现问题无法做到快速响应;也不能做到有选择性和智能地采集视频数据,系统消耗资源很大,但效率很低。Most of the existing video surveillance systems are passive surveillance. Such a system is basically "monitoring but not controlling", and it cannot respond quickly when problems occur; it cannot collect video data selectively and intelligently. The system consumes a lot of resources, but its efficiency is very low.
在监测较大的区域时,广角高清全景摄像机虽然能够看到整个场景宽度,但摄像机所能看清的距离非常有限。在视频中出现需要监控目标时,只能从全景视场方向去寻找和发现目标的细节,这种情况下一般只能得到较为粗略的信息。系统不能自动地对这些目标进行主动的捕捉、追踪、放大和采集,无法得到目标的细节信息。When monitoring a large area, although the wide-angle high-definition panoramic camera can see the entire width of the scene, the distance that the camera can see is very limited. When there is a target in the video that needs to be monitored, the details of the target can only be found and found from the direction of the panoramic field of view. In this case, generally only rough information can be obtained. The system cannot automatically capture, track, amplify and collect these targets automatically, and cannot obtain detailed information about the targets.
虽然也有少数智能视频系统可以对目标进行智能识别和跟踪,但它们只能在单个区域上进行,无论是监测区域范围、效率,还是准确度都不理想。同时由于缺少对疑似目标的准确判断,产生了系统额外动作,同时对真假目标没有二次验证,产生很多虚假的报警信息,导致监控人员怠于处理系统报警事件,当危险真正发生时,延误处理时间,导致此类安防系统没有发挥其真正的作用,产生了难以弥补的损失。Although there are a few intelligent video systems that can intelligently identify and track targets, they can only be performed on a single area, and the scope, efficiency, and accuracy of the monitoring area are not ideal. At the same time, due to the lack of accurate judgment of the suspected target, additional actions of the system are generated. At the same time, there is no secondary verification of the true and false targets, resulting in a lot of false alarm information. The processing time has caused this type of security system to fail to play its true role, resulting in irreparable losses.
发明内容Summary of the invention
为了克服现有技术的缺陷,本发明提供了一种智能视觉感知系统,能够实时发现目标、跟踪目标、识别目标并及时告警,提高了系统捕捉环境异常的精度和准确性,弥补 了现有技术的缺憾。In order to overcome the shortcomings of the prior art, the present invention provides an intelligent visual perception system that can find targets in real time, track targets, recognize targets, and alert in time, improve the accuracy and accuracy of the system in capturing environmental abnormalities, and make up for the prior art The shortcomings.
本发明提供的一种智能视觉感知系统,其中,系统包括可变焦距的摄像机和指定监测区域的假目标反馈特征信息库;The present invention provides an intelligent visual perception system, wherein the system includes a variable focal length camera and a false target feedback feature information database in a designated monitoring area;
摄像机以第一分辨率对监测区域进行视频图像采集,对采集的视频图像,采用一次目标检测算法,进行区域内一次目标检测,包括:基于假目标反馈特征信息库判断指定监测区域是否包含疑似目标;The camera collects video images of the monitored area at the first resolution, and uses a target detection algorithm for the collected video images to perform a target detection in the area, including: judging whether the designated monitoring area contains suspected targets based on the false target feedback feature information database ;
当发现监测区域包含疑似目标时,调整摄像机,以第二分辨率对疑似目标进行视频图像采集,并基于采集的视频图像,采用二次目标检测算法,进行二次目标检测,以确定疑似目标为真目标或假目标;When it is found that the monitoring area contains a suspected target, adjust the camera to collect a video image of the suspected target at the second resolution, and based on the collected video image, use a secondary target detection algorithm to perform secondary target detection to determine the suspected target as True target or false target;
当疑似目标为假目标时,根据假目标的信息更新假目标反馈特征信息库;When the suspected target is a false target, update the false target feedback feature information database according to the information of the false target;
当疑似目标为真目标时,对真目标进行跟踪监测;When the suspected target is a true target, follow up and monitor the true target;
第二分辨率大于第一分辨率。The second resolution is greater than the first resolution.
本发明分两次检测出区域中的有价值目标,可以兼顾监控区域的较大场景和目标细节之间的矛盾,通过一次目标检测可以在较大的监控区域中检测出疑似的目标,大分辨率图像上的检测结果往往有较高的准确度,通过二次目标检测可以得到可靠的检测结果。The invention detects the valuable target in the area twice, can take into account the contradiction between the larger scene of the monitoring area and the target details, and can detect the suspicious target in the larger monitoring area through one target detection, with large resolution The detection results on the rate images often have high accuracy, and reliable detection results can be obtained through secondary target detection.
优选地,目标反馈特征信息库包括假目标的位置信息和假目标特征描述信息;Preferably, the target feedback feature information database includes location information of the fake target and feature description information of the fake target;
摄像机以第一分辨率对监测区域进行视频图像采集包括:摄像机以大视野小分辨率对监测区域进行视频图像采集;The video image collection of the monitoring area by the camera at the first resolution includes: the video image collection of the monitoring area by the camera with a large field of view and a small resolution;
一次目标检测包括:基于假目标反馈特征信息库,判断监测区域内的图像特征与该区域的假目标反馈特征描述和位置信息匹配度是否满足指定条件,若满足,则不判为疑似目标;一次目标检测算法包括基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法;A target detection includes: based on the false target feedback feature information database, judging whether the image features in the monitoring area meet the specified conditions with the false target feedback feature description and location information matching degree of the area. If it is met, it is not judged as a suspected target; The target detection algorithm includes a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background;
当发现监测区域包含一个或多个疑似目标时,调整摄像机的视场方向和视场角,以第二分辨率对疑似目标进行视频图像采集,包括:以小视野大分辨率依次对疑似目标进行视频图像采集;When it is found that the monitoring area contains one or more suspected targets, adjust the camera's field of view direction and angle of view, and perform video image acquisition on the suspected target at the second resolution, including: successively performing the suspected target with a small field of view and a large resolution Video image collection;
二次目标检测算法为与背景无关的目标分类检测算法;The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background;
根据假目标的信息更新假目标反馈特征信息库包括:将大分辨率图像下确定的假目标,映射到小分辨率图像中,获取假目标位置信息,并提取假目标特征描述信息,更新假目标反馈特征信息库;Updating the false target feedback feature information database according to the information of the false target includes: mapping the false target determined under the large resolution image to the small resolution image, obtaining the position information of the false target, extracting the description information of the false target feature, and updating the false target Feedback feature information database;
当疑似目标为真目标时,对真目标进行跟踪监测,并输出大分辨率视频数据,输出告警信息和/或目标特征信息;When the suspected target is a real target, track and monitor the real target, and output large-resolution video data, output alarm information and/or target characteristic information;
大视野小分辨率指通过设定PTZ参数,使视频图像覆盖需要监控的区域;Large field of view and small resolution means that the video image covers the area that needs to be monitored by setting the PTZ parameters;
小视野大分辨率指通过调整PTZ参数,使目标缩放到视频图像高度的1/10~4/5。Small field of view and large resolution means that by adjusting the PTZ parameters, the target is zoomed to 1/10 to 4/5 of the height of the video image.
设定假目标反馈特征信息库,可以有针对性地进行目标筛选,有别于通用的计算模式,可以提高系统捕捉环境异常的准确度,取得了意想不到的效果。二次目标检测时确认疑似目标为假目标时,对假目标进行特征描述,将特征反馈给视频图像目标检测算法, 可使得系统对本区域以大视野小分辨率进行一次目标检测时,融合假目标特征,降低假目标再次被误判为疑似目标的概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。Setting the false target feedback feature information database allows targeted screening of targets, which is different from the general calculation mode, can improve the accuracy of the system to capture environmental abnormalities, and has achieved unexpected results. When it is confirmed that the suspected target is a fake target in the secondary target detection, the fake target is described, and the characteristics are fed back to the video image target detection algorithm, which can make the system perform a target detection in this area with a large field of view and a small resolution, and the fake target is merged Features to reduce the probability of false targets being misjudged as suspected targets again. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
本发明既保证能监测尽可能大的区域,又可以保证监测结果的可靠性。而在检测出有用目标后,可以持续跟踪目标的细节。The invention not only ensures that the largest possible area can be monitored, but also the reliability of the monitoring results. After the useful target is detected, the details of the target can be tracked continuously.
优选地,本发明提供的一种智能视觉感知系统,其中,小视野大分辨率指通过调整PTZ参数,使目标缩放到视频图像高度的1/6~2/3,并调整到视野中心。Preferably, the present invention provides an intelligent visual perception system, wherein the small field of view and large resolution refers to adjusting the PTZ parameters to zoom the target to 1/6 to 2/3 of the height of the video image and adjust it to the center of the field of view.
优选地,本发明提供的一种智能视觉感知系统,其中,目标特征信息包括表征目标具体分类的特征信息和/或表征目标具体身份的特征信息;表征目标具体分类的特征信息包括人、动物、车和/或车型、飞行物、指定异物中的一种或多种,其中指定异物包括自然掉落物和/或扩散物、人类遗落物中的一种或多种;表征目标具体身份的特征信息包括人的身份、动物的种类、车的车牌、其它异物种类中的一种或多种。在现实生活中,将智能视觉感知系统应用在安防领域,通过建立智能识别算法和特征库,针对监测领域中出现的疑似目标,当检测发现疑似目标为真目标时,表征目标具体分类信息和目标具体身份信息,发送监测终端,使告警信息真实、具体、可信,有利于操作者后续对于侵入目标采取合理的排除措施,提高应急响应效率。Preferably, the present invention provides an intelligent visual perception system, wherein the target feature information includes feature information that characterizes the specific classification of the target and/or feature information that characterizes the specific identity of the target; the feature information that characterizes the specific classification of the target includes humans, animals, One or more of vehicles and/or models, flying objects, and designated foreign objects, where the designated foreign objects include one or more of natural falling objects and/or diffuse objects, and human remains; those that characterize the specific identity of the target The characteristic information includes one or more of the identity of the person, the type of the animal, the license plate of the car, and the type of other foreign objects. In real life, the intelligent visual perception system is applied in the security field. By establishing intelligent recognition algorithms and feature libraries, for suspected targets in the monitoring field, when the suspected target is detected as a true target, the specific classification information and target of the target are represented The specific identity information is sent to the monitoring terminal to make the alarm information true, specific, and credible, which is beneficial to the operator to take reasonable measures to eliminate the intrusion target in the follow-up, and to improve the efficiency of emergency response.
优选地,本发明提供的一种智能视觉感知系统,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库。本发明通过对假目标特征描述信息设定更新速率,进而使假目标反馈特征信息库带有时间特征,更贴合实际,加快该特征库的自适应和自学习速度,进一步提高了系统一次检测准确性,从而提高系统整体性能。Preferably, the intelligent visual perception system provided by the present invention sets an update rate for the false target feature description information, and optimizes the false target feedback feature information database. The present invention sets the update rate for the feature description information of the false target, so that the false target feedback feature information database has time characteristics, which is more realistic, accelerates the self-adaptation and self-learning speed of the feature database, and further improves the accuracy of one-time detection of the system. Performance, thereby improving the overall performance of the system.
优选地,本发明提供的一种智能视觉感知系统,其中,监测区域为N个,其中N≥1;系统监测过程包括:Preferably, the present invention provides an intelligent visual perception system, wherein there are N monitoring areas, where N≥1; the system monitoring process includes:
在第M个监测区域进行一次目标检测,当一次目标检测发现无疑似目标时,调整摄像机的视场方向和焦距,以大视野小分辨率对下一个监测区域进行一次目标检测,其中,1≤M≤N;Perform a target detection in the M-th monitoring area. When a target detection is found to be undoubtedly a target, adjust the camera's field of view direction and focal length, and perform a target detection on the next monitoring area with a large field of view and small resolution, where 1≤ M≤N;
对第M个监测区域的真目标进行跟踪,当真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和视场角,以小视野大分辨率对第M个监测区域的下一个真目标进行跟踪;Track the real target in the M-th monitoring area. When the real target disappears or the tracking time reaches the set value, adjust the camera's field of view direction and angle of view, and use a small field of view and large resolution to monitor the next one in the M-th monitoring area. Real target tracking;
当第M个监测区域的所有真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和焦距,以大视野小分辨率对下一个监测区域进行一次目标检测;When all real targets in the M-th monitoring area disappear or the tracking time reaches the set value, adjust the camera's field of view direction and focal length, and perform a target detection on the next monitoring area with a large field of view and small resolution;
依次对N个监测区域进行目标检测,循环往复。Perform target detection on N monitoring areas in turn, cyclically.
在本发明中,发明人通过对目标跟踪条件的设置,比如是否发现疑似目标、目标消失时间和跟踪时间设定值等,提高了监测区域检测效率,并且实现了对区域内真目标的依次检测和跟踪,以及对多个监测区域的顺序检测,进一步提高了单个装置所能检测区域大小和检测距离,从而提高了设备利用率,减少了设备成本和占地空间,具有良好的 经济效益。In the present invention, the inventor improves the detection efficiency of the monitoring area by setting the target tracking conditions, such as whether a suspected target is found, the target disappearance time, and the tracking time setting value, etc., and realizes the sequential detection of true targets in the area. And tracking, as well as the sequential detection of multiple monitoring areas, further improve the detection area size and detection distance of a single device, thereby improving equipment utilization, reducing equipment costs and floor space, and having good economic benefits.
优选地,本发明提供的一种智能视觉感知系统,其中,分别对所有N个监测区域进行一次目标检测,对一次目标检测发现疑似目标的区域计算各监测区域各自的特征权重;根据各监测区域各自特征权重的大小,按照指定顺序依序对各监测区域进行二次目标检测和跟踪。Preferably, the present invention provides an intelligent visual perception system, in which a target detection is performed on all N monitoring areas respectively, and the respective feature weight of each monitoring area is calculated for the area where a suspect target is found in a target detection; according to each monitoring area The size of the respective feature weights is used to perform secondary target detection and tracking for each monitoring area in the specified order.
在本发明中,发明人通过对各个监测区域进行权重计算,可以对各个监测区域目标的出现概率等属性进行统计,保证系统可以优先监测特征权重比较大的区域,有利于分级预警,使操作者优先处理疑似目标比较多,或者目标移动速率比较快,或者特定监测领域中的其它重点关注目标特征比较集中的监测区域,优先排除该区域中的安全隐患,从而减少了整个安防系统的告警响应时间。在实际应用中,比如在铁路系统中,此项区分区域监测优先级的设置,可以使铁路工作者优先排除轨道及其周边影响列车通行的重要障碍,比如大量羊群经过、路面塌方掩盖铁路轨道等,赢得了事件干预、系统联动的时间,从而挽回多数人员伤亡和数以亿计的财产损失。In the present invention, by calculating the weight of each monitoring area, the inventor can calculate the occurrence probability of each monitoring area target and other attributes, ensuring that the system can prioritize the monitoring of areas with relatively large feature weights, which is beneficial to hierarchical early warning and enables operators Prioritize the processing of more suspected targets, or the target moving speed is relatively fast, or other focused monitoring areas in the specific monitoring field where the characteristics of the target are concentrated, and prioritize the elimination of potential safety hazards in this area, thereby reducing the alarm response time of the entire security system . In practical applications, such as in the railway system, this setting of the priority of regional monitoring can enable railway workers to prioritize the removal of important obstacles affecting the passage of trains on the track and its surroundings, such as a large number of sheep passing by, and road collapse covering the railway track. It won time for incident intervention and system linkage, thus saving most of the casualties and hundreds of millions of property losses.
优选地,本发明提供的一种智能视觉感知系统,其中,系统包括1个或多个智能视觉感知装置,智能视觉感知装置包括:Preferably, the present invention provides an intelligent visual perception system, wherein the system includes one or more intelligent visual perception devices, and the intelligent visual perception devices include:
摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元中的一种或多种;Camera, used for video image acquisition, including one or more of focus motor, zoom motor, drive module, image signal acquisition and processing unit;
传动机构,用于调整摄像机的视场方向和大小;Transmission mechanism, used to adjust the direction and size of the camera's field of view;
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机视场方向和大小,和/或与云端平台或数据中心进行信息交互;The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the direction and size of the camera's field of view, and/or exchange information with the cloud platform or data center;
通信接口单元,用于与云端平台和/或数据中心进行信息交互,和/或与现场其它传感或动作装置,和/或其它关联系统的联动信息交互,包括有线和/或无线接口;The communication interface unit is used for information interaction with the cloud platform and/or data center, and/or linkage information interaction with other sensing or action devices on site, and/or other related systems, including wired and/or wireless interfaces;
电源管理单元,用于给所有耗电单元供电;The power management unit is used to supply power to all power-consuming units;
防护外壳,用于将各单元封装在内,起到防护作用。The protective shell is used to encapsulate each unit and play a protective role.
优选地,本发明提供的一种智能视觉感知系统,其中,智能视觉感知装置还包括光照单元,用于给摄像机所监测的区域补光;光照单元为1个或多个,包括可见光源、红外光源中的一种或多种;传动机构调整光照单元的视场方向和大小。Preferably, the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device further includes a light unit for supplementing light to the area monitored by the camera; there are one or more light units, including visible light sources and infrared light sources. One or more of the light sources; the transmission mechanism adjusts the direction and size of the field of view of the light unit.
在本发明中,视频图像采集和跟踪,可通过摄像机调焦、光照单元补光、传动机构带动摄像机和光照电源的角度调整实现;疑似目标发现、真假目标判断、假目标特征描述等,需要数据处理单元执行视频图像目标检测算法来实现。本发明提供的智能视觉感知装置,通过上述硬件和软件的结合,可以实现监测区域内一次目标检测和二次目标检测,分辨疑似目标是真目标还是假目标;针对假目标进行特征描述,并对真目标进行跟踪等,具有可实现性和技术先进性。本发明的智能视觉感知系统,包括一个或多个上述智能视觉感知装置,有利于扩充监测区域,覆盖面积比较大、安防级别比较高的监控场景,提供全方位的安全预警。In the present invention, video image acquisition and tracking can be achieved through camera focusing, illumination unit fill light, transmission mechanism to drive the angle adjustment of the camera and illumination power supply; suspected target discovery, true and false target judgment, false target feature description, etc., need The data processing unit implements the video image target detection algorithm to achieve this. The intelligent visual perception device provided by the present invention, through the combination of the above hardware and software, can realize primary target detection and secondary target detection in the monitoring area, distinguish whether the suspected target is a true target or a false target; describe the characteristics of the false target, and Real target tracking, etc., is achievable and technologically advanced. The intelligent visual perception system of the present invention includes one or more of the above-mentioned intelligent visual perception devices, which is beneficial to expand the monitoring area, covers a relatively large area and relatively high security level monitoring scene, and provides a full range of safety warnings.
优选地,本发明提供的一种智能视觉感知系统,其中,系统包括1个或多个智能视觉感知装置,智能视觉感知装置中摄像机为一台或多台,摄像机为可见光摄像机和/或红外摄像机,红外摄像机包括近红外摄像机和/或红外热成像摄像机。Preferably, the present invention provides an intelligent visual perception system, wherein the system includes one or more intelligent visual perception devices, in the intelligent visual perception device, there are one or more cameras, and the cameras are visible light cameras and/or infrared cameras , Infrared cameras include near-infrared cameras and/or infrared thermal imaging cameras.
优选地,由一台或多台摄像机完成一次目标检测中的视频图像采集,和/或二次目标检测中的视频图像采集,和/或真目标跟踪时的视频图像采集。Preferably, one or more cameras complete the video image acquisition in the primary target detection, and/or the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
优选地,由红外摄像机完成一次目标检测中的视频图像采集;由可见光摄像机完成二次目标检测中的视频图像采集,和/或真目标跟踪时的视频图像采集。Preferably, the infrared camera completes the video image acquisition in the primary target detection; the visible light camera completes the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
优选地,由可见光摄像机完成一次目标检测中的视频图像采集,和/或二次目标检测中的视频图像采集,和/或真目标跟踪时的视频图像采集。Preferably, a visible light camera completes the video image acquisition in the primary target detection, and/or the video image acquisition in the secondary target detection, and/or the video image acquisition in the real target tracking.
红外热成像摄像机具有在-40℃~+70℃(阳光直射)的室外自然环境下正常工作,可以透烟、雾、霾,图像清晰度高,可夜视并对拍摄物体的温度敏感等特点,用于一次目标检测,对于通常可产生预警信号的活动物体,尤其是生物,具有识别的敏感性。可见光摄像机具有性能稳定、摄像清晰度高等的优点,通过红外热成像摄像机进行一次目标检测,发现疑似目标,使用可见光摄像机进行二次目标检测,对疑似目标进行放大,实现真目标和假目标的判断,可以满足恶劣的环境条件下,系统的稳定性和实用性要求,并且满足系统精准判断的需要。The infrared thermal imaging camera has the characteristics of working normally in the outdoor natural environment of -40℃~+70℃ (direct sunlight), can penetrate smoke, fog, and haze, has high image clarity, can night vision and is sensitive to the temperature of the shooting object, etc. , Used for a target detection, it is sensitive to the identification of moving objects that can usually generate early warning signals, especially living things. The visible light camera has the advantages of stable performance and high camera clarity. The infrared thermal imaging camera performs a target detection and finds the suspected target. The visible light camera is used for secondary target detection, and the suspected target is amplified to realize the judgment of the true target and the false target. , Can meet the requirements of system stability and practicability under harsh environmental conditions, and meet the needs of accurate system judgment.
优选地,传动机构,用于调整摄像机和/或光照单元的水平和/或竖直视场方向和大小,包括驱动电机、水平转轴、竖直转轴;驱动电机驱动摄像机和/或光照单元绕转轴水平转动0~360度,上下转动0~180度。Preferably, the transmission mechanism is used to adjust the horizontal and/or vertical field of view direction and size of the camera and/or the light unit, including a drive motor, a horizontal shaft, and a vertical shaft; the drive motor drives the camera and/or the light unit around the shaft The horizontal rotation is 0-360 degrees, and the vertical rotation is 0-180 degrees.
该传动机构,通过驱动摄像机和光照单元绕水平和竖直转轴以一定角度转动,满足摄像机采集视频图像覆盖所需要监控的区域的要求,以简单的设备结构实现了在现有技术中,多台摄像机相互配合才能实现的图像采集需要,并且可以稳定控制摄像机角度,实现真目标跟踪时图像采集的稳定性,进一步实现本发明目的。The transmission mechanism, by driving the camera and the light unit to rotate at a certain angle around the horizontal and vertical shafts, meets the requirements of the camera to collect video images to cover the area that needs to be monitored, and realizes that in the prior art with a simple device structure, multiple The camera can cooperate with each other to realize the image collection needs, and the camera angle can be stably controlled, the stability of the image collection during real target tracking can be realized, and the object of the present invention can be further achieved.
优选地,数据处理单元,采用具有视频图像处理能力的芯片,集成图像目标检测算法程序,进行实时视频图像处理;当识别出疑似目标为假目标时,对假目标进行特征描述,并将特征反馈给视频图像目标检测算法程序。Preferably, the data processing unit adopts a chip with video image processing capabilities, integrates image target detection algorithm programs, and performs real-time video image processing; when the suspected target is identified as a false target, the false target is characterized, and the characteristics are fed back Give the video image target detection algorithm program.
在本发明中,一次目标检测时需要对监测区域内进行视频图像采集,判断是否存在疑似目标。该判断过程需要将采集的视频图像与特征库进行比对。如果系统判断存在疑似目标,而经过二次目标检测,发现疑似目标为假目标时,说明该假目标特征未收录入特征库,从而导致了系统额外动作。因此,将假目标进行特征描述,并反馈给视频图像目标检测算法,有利于在特定监测领域中,对可能出现的特定障碍物或定期出现的非障碍物进行快速识别,在一次目标检测时,排除出疑似目标范围,节省系统图像采集和判断的时间,从而减少了真目标的告警信息反馈时间,在特定场景下,更有利于工作人员排除妨害,避免事故的发生,具有极强的现实意义。In the present invention, it is necessary to collect video images in the monitoring area during a target detection to determine whether there is a suspected target. The judgment process needs to compare the collected video images with the feature library. If the system judges that there is a suspected target, and after the second target detection, the suspected target is found to be a false target, it means that the features of the false target are not included in the feature database, which leads to additional actions of the system. Therefore, the false target is characterized and fed back to the video image target detection algorithm, which is conducive to the rapid identification of specific obstacles that may appear or non-obstacles that appear regularly in a specific monitoring field. During a target detection, Exclude the range of suspected targets, save the time of image collection and judgment of the system, thereby reducing the time for the warning information feedback of the true target. In certain scenarios, it is more conducive to the staff to eliminate the obstacles and avoid the occurrence of accidents, which has strong practical significance. .
优选地,通信接口单元包括输入接口和输出接口;输入接口用于接收外部设备信号;输出接口用于发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中, 无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。Preferably, the communication interface unit includes an input interface and an output interface; the input interface is used to receive external device signals; the output interface is used to send signals collected or received by the system, and the connection mode includes wireless and/or wired modes; among them, the wireless mode Including one or more of WIFI, BT, ZIGBEE, LORA, 2G, 3G, 4G, 5G, NB-IOT; wired methods include AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, One or more of optical fibers.
优选地,本发明提供的一种智能视觉感知系统,其中,智能视觉感知装置通过输入接口,接收监测区域内的其它传感或动作装置的信号,当传感或动作装置发送异常情况信号和/或报警信息时,智能视觉感知装置调整摄像机的视场方向和焦距,优先对传感或动作装置所在区域进行目标检测。Preferably, the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device receives signals from other sensing or motion devices in the monitoring area through an input interface, and when the sensing or motion device sends abnormal situation signals and/ Or when there is an alarm message, the intelligent visual perception device adjusts the direction and focal length of the camera's field of view, and prioritizes target detection in the area where the sensor or action device is located.
本发明提供的智能视觉感知装置,可接收监测区域内的传感器设备信号,对于外部传感器识别的危险信号进行处理,优先对该区域进行目标检测,实现与外部传感器发送信息的二次验证和目标识别、告警、特征信息发送。该联动机制,真正实现了传感装置的互联互通,节省了系统对外部预警事件的响应时间,同时弥补了外部传感器对于危险源头的具体特征不了解,收到报警信息不能做出准确反映,甚至无所适从,而浪费了事故发生前的宝贵挽救时间的现实问题,从而真正发挥了本系统的安全预警作用。The intelligent visual perception device provided by the present invention can receive the sensor equipment signal in the monitoring area, process the dangerous signal recognized by the external sensor, give priority to the target detection in the area, and realize the secondary verification and target recognition of the information sent with the external sensor , Alarm and characteristic information sending. This linkage mechanism truly realizes the interconnection of sensor devices, saves the system's response time to external early warning events, and at the same time makes up for the fact that external sensors do not understand the specific characteristics of the source of danger, and cannot accurately reflect the alarm information received, even The real problem of being at a loss as to what to do, and wasting the precious saving time before the accident, so as to truly play the role of safety early warning of this system.
优选地,本发明提供的一种智能视觉感知系统,其中,智能视觉感知装置通过输出接口,向监测区域内的其它传感或动作装置提供其感知数据和/或结果信息,用于数据融合和/或联合判断,和/或对传感或动作装置进行直接控制或联动,即系统联动。Preferably, the present invention provides an intelligent visual perception system, wherein the intelligent visual perception device provides its perception data and/or result information to other sensing or action devices in the monitoring area through an output interface for data fusion and / Or joint judgment, and / or direct control or linkage of sensing or action devices, that is, system linkage.
优选地,本发明提供的一种智能视觉感知系统,其中,监测区域内的其它传感或动作装置包括告警声光设备、门禁设备、消防设备、除障设备、动物驱赶设备、拖车设备、清扫设备、巡视设备、冲力减缓设备、紧急止停设备、分流设备、现场和/或外部通讯设备、防爆设备、医疗救助设备、掩蔽设备、无人机运输设备、人员疏散和/或安全撤离设备中的一种或多种;信息包括告警信息和/或目标特征信息。Preferably, the present invention provides an intelligent visual perception system, wherein other sensing or action devices in the monitoring area include alarm sound and light equipment, access control equipment, fire fighting equipment, obstacle removal equipment, animal repelling equipment, trailer equipment, and cleaning equipment. Equipment, inspection equipment, impulse mitigation equipment, emergency stop equipment, shunt equipment, on-site and/or external communication equipment, explosion-proof equipment, medical rescue equipment, shelter equipment, drone transportation equipment, personnel evacuation and/or safety evacuation equipment One or more of the information; the information includes alarm information and/or target characteristic information.
本发明提供的智能视觉感知装置,可以向监测区域内的其它系统或设备提供传感化数据输出。这些系统或设备可以对该智能视觉感知装置输出的数据信号进行数据融合、联合判断,也可以直接控制本地其它设备。由于本发明中的智能视觉感知装置可以对外部传感设备进行直接控制或者系统联动,当监测区域内存在事故隐患或者紧急危险时,不仅仅向监控终端发送告警信息和/或目标特征信息,同时还可以通过控制或联动告警声光设备、门禁设备、消防设备、除障设备等,有针对性地对事故隐患或者危险源进行排除,对正在紧密运行的设备进行紧急制动和现场分流,使用通讯设备与监测区域建立联系等等措施,减少外部对于系统发出的告警信息的响应时间,抓住事故发生前的黄金间隙,实现监测区域内人员、运输装置的紧急避险。这将最大限度地避免事故的发生、减少事故损害、保全人员和财产安全,具有极强的现实意义。The intelligent visual perception device provided by the present invention can provide sensory data output to other systems or equipment in the monitoring area. These systems or devices can perform data fusion and joint judgment on the data signals output by the intelligent visual perception device, and can also directly control other local devices. Since the intelligent visual perception device of the present invention can directly control external sensing equipment or system linkage, when there is a hidden accident or emergency danger in the monitoring area, it not only sends alarm information and/or target feature information to the monitoring terminal, but also It can also control or link alarm sound and light equipment, access control equipment, fire fighting equipment, obstacle removal equipment, etc., to eliminate accidental hazards or sources of danger in a targeted manner, and perform emergency braking and on-site diversion of equipment that is operating closely. Measures such as establishing contact between the communication equipment and the monitoring area, reduce the external response time to the alarm information sent by the system, seize the golden gap before the accident, and realize the emergency avoidance of personnel and transportation devices in the monitoring area. This will avoid accidents to the greatest extent, reduce accident damage, and protect the safety of personnel and property, which is of great practical significance.
优选地,本发明提供的一种智能视觉感知系统,其中,电源管理单元包括集成在系统内部的电池,和/或外部的太阳能电池板,和/或有线电源。Preferably, the present invention provides an intelligent visual perception system, wherein the power management unit includes a battery integrated inside the system, and/or an external solar panel, and/or a wired power supply.
优选地,防护外壳包括接口板、窗口、固定座;Preferably, the protective housing includes an interface board, a window, and a fixing seat;
接口板上有1个或多个接口,与外部单元连接;There are one or more interfaces on the interface board to connect with external units;
窗口采用透光材料,分别透射摄像机采集的视频图像,和/或光照单元发出的光;The window adopts light-transmitting material to respectively transmit the video image collected by the camera and/or the light emitted by the light unit;
固定座用于将防护外壳固定在外部支架上。The fixing seat is used to fix the protective shell on the external bracket.
本发明提供的智能视觉感知装置的防护外壳设计,可以使该装置的摄像机、光照单元、数据处理单元、传动机构、通信接口单元、电源管理单元等处于坚固外壳的保护下。该防护外壳上设置有窗口,用来投射视频图像和光照单元发出的光,采用透光材料等,也可以使摄像机和光照单元不直接接触外部环境,从而提高了系统的稳定性和使用寿命。不同数目的窗口设计,可实现本发明在不同应用条件下具体的图像采集需要。The protective shell design of the intelligent visual perception device provided by the present invention can make the camera, lighting unit, data processing unit, transmission mechanism, communication interface unit, power management unit, etc. of the device be protected by a solid shell. The protective shell is provided with windows for projecting video images and light emitted by the lighting unit, using light-transmitting materials, etc., so that the camera and the lighting unit do not directly contact the external environment, thereby improving the stability and service life of the system. Different numbers of window designs can realize the specific image acquisition needs of the present invention under different application conditions.
优选地,本发明提供的一种智能视觉感知系统,其中,输出告警信息和/或目标特征信息至监控终端,和/或输出告警信息和/或目标特征信息至数据中心和/或云端平台,启动告警处理服务,分发信息至监控终端,完成包括告警处置和/或干预和/或系统联动中的一种或多种功能。Preferably, the present invention provides an intelligent visual perception system, wherein the alarm information and/or target characteristic information are output to the monitoring terminal, and/or the alarm information and/or target characteristic information is output to the data center and/or cloud platform, Start the alarm processing service, distribute information to the monitoring terminal, and complete one or more functions including alarm handling and/or intervention and/or system linkage.
优选地,本发明提供的一种智能视觉感知系统,其中,云端平台和/或数据中心,包括服务器和通过图像目标检测算法进行图像识别的软件;监控终端,为一个或多个,用于显示目标识别的结果信息、接收报警信息、进行远程配置和控制,包括智能终端设备和其运行的管理软件,智能终端设备包括但不限计算机、手持机中的一种或多种。Preferably, the present invention provides an intelligent visual perception system, wherein the cloud platform and/or data center includes a server and software for image recognition through image target detection algorithms; monitoring terminals are one or more for display Target recognition result information, receiving alarm information, remote configuration and control, including intelligent terminal equipment and its running management software, intelligent terminal equipment includes but not limited to one or more of computers and handhelds.
本发明智能视觉识别系统发出的告警信息、目标特征信息等,可以通过云端平台和/或数据中心发给监控终端,保证了数据传输的稳定性和系统的远程控制。同时在数据中心,结合其它特征库,还可以进一步对真目标进行特征识别,包括身份信息等,从而发布更为精准的告警信息和特征信息报告。该方案弥补了现有技术中,在安防领域,监控设备发出模糊信息,导致监控终端收到信息后,无法及时排除妨碍,消除事故危险的缺点。The alarm information, target feature information, etc. sent by the intelligent visual recognition system of the present invention can be sent to the monitoring terminal through the cloud platform and/or data center, ensuring the stability of data transmission and the remote control of the system. At the same time, in the data center, combined with other feature databases, it is possible to further identify the true target, including identity information, so as to issue more accurate alarm information and feature information reports. This solution makes up for the shortcomings in the prior art that in the security field, the monitoring equipment sends out fuzzy information, which causes the monitoring terminal to be unable to eliminate the obstacles and eliminate the danger of accidents in time after receiving the information.
优选地,本发明提供的一种智能视觉感知系统,其中,服务器包括虚拟服务器,包括本地服务器、边缘云、公共云中的一种或多种。Preferably, the present invention provides an intelligent visual perception system, wherein the server includes a virtual server, including one or more of a local server, an edge cloud, and a public cloud.
本发明提供的智能视觉感知系统,能够实时发现目标、跟踪目标、识别目标并产生告警信息和目标特征识别信息,通过一次目标检测,排除非疑似目标;通过对疑似目标的二次目标检测,分辨真假目标,并对假目标进行特征描述补充特征库,减少一次目标检测的误判率,对真目标进行持续跟踪。本发明实现了环境异常的智能捕捉、甄别和特征描述,解决了现有技术中安防领域,对环境的非智能监测,需要操作者紧盯屏幕,实时地人为判断是否存在环境危险,并且以低分辨率,分辨远方存在的现实危险,有很大的误判可能性,影响了应急响应时间,从而延误了事故挽救的时机,可能导致数以亿计的财产损失和众多人员伤亡的现实问题,具有极大的社会价值。The intelligent visual perception system provided by the present invention can find targets, track targets, recognize targets and generate alarm information and target feature recognition information in real time, eliminate non-suspected targets through one target detection, and distinguish between suspected targets through secondary target detection. True and false targets, and feature descriptions of false targets to supplement the feature library, reduce the rate of misjudgment of a target detection, and continue to track true targets. The present invention realizes the intelligent capture, discrimination and feature description of environmental abnormalities, and solves the problem of non-intelligent monitoring of the environment in the security field in the prior art. The operator needs to keep an eye on the screen, and artificially judge whether there is environmental hazard in real time. Resolution, distinguish the actual dangers in the distant place, there is a great possibility of misjudgment, which affects the emergency response time, which delays the timing of accident rescue, and may cause hundreds of millions of property losses and many practical problems of casualties. Has great social value.
本发明的其它特征和优点将在随后的说明书中阐述,并且,部分地从说明书中变得显而易见,或者通过实施本发明而了解。本发明的目的和其他优点可通过在说明书、权利要求书以及附图中所指出的结构来实现和获得。Other features and advantages of the present invention will be described in the following description, and partly become obvious from the description, or understood by implementing the present invention. The purpose and other advantages of the present invention can be realized and obtained through the structures indicated in the specification, claims and drawings.
附图说明Description of the drawings
为了更清楚地说明本发明实施例或现有技术中的技术方案,下面将对实施例或现有 技术描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本发明的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其它的附图。In order to explain the embodiments of the present invention or the technical solutions in the prior art more clearly, the following will briefly introduce the drawings that need to be used in the description of the embodiments or the prior art. Obviously, the drawings in the following description are only These are some embodiments of the present invention. For those of ordinary skill in the art, other drawings can be obtained based on these drawings without creative work.
图1表示本发明提供的智能视觉感知系统工作流程;Figure 1 shows the work flow of the intelligent visual perception system provided by the present invention;
图2表示本发明提供的针对多个监测区域进行目标检测和/或识别的工作流程;Fig. 2 shows the working process of target detection and/or identification for multiple monitoring areas provided by the present invention;
图3表示本发明提供的通过设定区域权重,依序进行目标检测和/或识别的工作流程;FIG. 3 shows a workflow of performing target detection and/or recognition in sequence by setting area weights provided by the present invention;
图4表示本发明提供的智能视觉感知系统的数据流转;Figure 4 shows the data flow of the intelligent visual perception system provided by the present invention;
图5表示本发明提供的一种单摄像机智能视觉感知装置;Figure 5 shows a single-camera intelligent visual perception device provided by the present invention;
图6表示本发明提供的一种单摄像机、单光源智能视觉感知装置;Figure 6 shows a single-camera, single-light source intelligent visual perception device provided by the present invention;
图7表示本发明提供的一种双摄像机、双光源智能视觉感知装置;Figure 7 shows a dual-camera, dual-light source intelligent visual perception device provided by the present invention;
图8表示本发明提供的一种双摄像机、单光源智能视觉感知装置;Figure 8 shows a dual-camera, single-light source intelligent visual perception device provided by the present invention;
图9表示本发明提供的系统在铁路场景中经一次目标检测发现远处有疑似目标;Figure 9 shows that the system provided by the present invention finds that there is a suspected target in the distance after a target detection in a railway scene;
图10表示本发明提供的系统在铁路场景中经二次目标检测发现真目标;FIG. 10 shows that the system provided by the present invention finds a true target through secondary target detection in a railway scene;
图11表示本发明提供的系统在铁路场景中对真目标进行跟踪。Figure 11 shows that the system provided by the present invention tracks real targets in a railway scene.
具体实施方式detailed description
为了进一步阐明本发明,下面给出一系列实施例。需要指出的是,这些实施例完全是例证性的。给出这些实施例的目的是为了充分明示本发明的意义和内容,但并不因此将本发明限制在所述的实施例范围之中。In order to further clarify the present invention, a series of examples are given below. It should be pointed out that these examples are completely illustrative. The purpose of these embodiments is to fully clarify the meaning and content of the present invention, but not to limit the present invention to the scope of the embodiments.
首先需要说明的是,本发明是计算机技术在安防领域的一种应用,在本发明的实现过程中,会涉及到多个软件功能模块的应用。本申请人认为,如在仔细阅读申请文件、准确理解本发明的实现原理和发明目的后,在结合现有公知技术的情况下,本领域技术人员完全可以运用其掌握的软件编程技能实现本发明。First of all, it should be noted that the present invention is an application of computer technology in the field of security and protection. During the implementation of the present invention, the application of multiple software function modules will be involved. The applicant believes that after carefully reading the application documents, accurately understanding the realization principle and purpose of the invention, and combining the existing known technologies, those skilled in the art can fully use the software programming skills they have mastered to realize the invention. .
实施例1:Example 1:
一种智能视觉感知系统的第一种实施方式:The first implementation of an intelligent visual perception system:
智能视觉感知系统包括可变焦距的摄像机和指定监测区域的假目标反馈特征信息库。首先,建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。以第一分辨率对监测区域进行视频图像采集。具体地,如图1所示,系统调整摄像机视场方向和焦距,设定PTZ参数,使视频图像覆盖所需要监控的预设监测区域,该大视野小分辨率对指定监测区域进行视频图像采集。采用一次目标检测算法,对摄像机所采集的视频图像,进行区域内一次目标检测。进行区域内一次目标检测,包括:基于假目标反馈特征信息库判断指定监测区域是否包含疑似目标。进一步地,基于假目标反馈特征信息库,判断监测区域内的图像特征与该区域的假目标反馈特征描述和位置信息匹配度是否满足指定条件,若满足,则不判为疑似目标。一次目标检测结合了 假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,示例性地,当匹配度达到90%,则认为该区域内与假目标反馈特征描述对应位置的对象不是疑似目标,从而则该区域将大概率被划分为背景,即当区域内与假目标反馈特征描述对应位置均符合匹配度达到指定值的情况下,不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。The intelligent visual perception system includes a variable focal length camera and a false target feedback feature information library in a designated monitoring area. First, establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information. Perform video image collection on the monitoring area at the first resolution. Specifically, as shown in Figure 1, the system adjusts the camera's field of view direction and focal length, sets the PTZ parameters, so that the video image covers the preset monitoring area that needs to be monitored, and the large field of view and small resolution perform video image collection on the designated monitoring area . A target detection algorithm is used to perform a target detection in the area on the video image collected by the camera. Perform a target detection in the area, including: judging whether the designated monitoring area contains suspected targets based on the false target feedback feature information database. Further, based on the false target feedback feature information database, it is determined whether the image features in the monitoring area meet the specified conditions with the false target feedback feature description and location information matching degree of the area. If it is satisfied, it is not judged as a suspected target. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, for example, when the matching degree reaches 90%, it is considered that the area has a high matching degree with the false target feedback feature description. The object at the corresponding location of the target feedback feature description is not a suspected target, so the area will be classified as a background with a high probability, that is, when the corresponding location of the false target feedback feature description in the area matches the specified value, it will not be classified Classified as a suspected target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
当一次目标检测发现无疑似目标时,调整摄像机视场方向和焦距,以大视野小分辨率对下一个指定监测区域,即预设监测区域进行视频图像采集,并采用一次目标检测算法,对摄像机所采集的视频图像,进行区域内一次目标检测。When a target detection is found to be undoubtedly a target, adjust the camera's field of view direction and focal length, use a large field of view and small resolution to capture the video image of the next designated monitoring area, that is, the preset monitoring area, and use a target detection algorithm to control the camera. The collected video images are subjected to a target detection in the area.
当一次目标检测发现有疑似目标时,通过调整PTZ参数,调整摄像机视场方向和焦距,缩放目标高度,以第二分辨率对疑似目标进行视频图像采集,第二分辨率大于第一分辨率。本发明实施例中,以小视野大分辨率对监测区域的疑似目标依次进行视频图像采集,并采用二次目标检测算法,对采集的视频图像,进行二次目标检测,分辨疑似目标为真目标或假目标。二次目标检测算法为与背景无关的目标分类检测算法。When a suspected target is found in a target detection, by adjusting the PTZ parameters, adjusting the direction and focal length of the camera's field of view, zooming the target height, the suspected target is captured with a second resolution, and the second resolution is greater than the first resolution. In the embodiment of the present invention, the suspected targets in the monitoring area are collected sequentially with a small field of view and large resolution, and a secondary target detection algorithm is adopted to perform secondary target detection on the collected video images to distinguish the suspected targets as true targets. Or false goals. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
当二次目标检测发现是真目标时,输出告警信息至监控终端,并生成目标特征信息。调整摄像机视场方向和焦距,一段时间持续跟踪目标,同时,将二次目标检测生成的目标特征信息,发送至监控终端。When the secondary target is detected to be a true target, the alarm information is output to the monitoring terminal, and target characteristic information is generated. Adjust the direction and focal length of the camera's field of view, and continue to track the target for a period of time. At the same time, the target feature information generated by the secondary target detection is sent to the monitoring terminal.
目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、指定异物,即不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种,或者表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, designated foreign objects, that is, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks One or more of, mudslides, and human remains, or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, and the license plate of the vehicle. The specific information types and contents are defined according to the specific application environment.
对真目标进行跟踪,当真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和大小,以小视野大分辨率对下一个真目标进行跟踪。Track the real target. When the real target disappears or the tracking time reaches the set value, adjust the direction and size of the camera's field of view to track the next real target with a small field of view and large resolution.
当二次目标检测发现一次目标检测的疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,获取假目标位置信息,并提取假目标特征描述信息,更新假目标反馈特征信息库,降低系统一次目标检测的误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。When the secondary target detection finds that the suspected target of the primary target detection is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image to obtain the position information of the false target, and extract the false target feature description information. Update the false target feedback feature information database to reduce the false detection probability of a target detection in the system. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
在进行二次目标检测跟踪时,当所有真目标都消失或跟踪时间达到设定值时,调整摄像机视场方向和焦距,以大视野小分辨率对下一个预设监测区域进行目标识别,识别方法与第一预设监测区域目标识别方法相同。When performing secondary target detection and tracking, when all true targets disappear or the tracking time reaches the set value, adjust the camera's field of view direction and focal length, and perform target recognition on the next preset monitoring area with a large field of view and small resolution. The method is the same as the target recognition method in the first preset monitoring area.
依次类推,如图2所示,对N(0<N)个预设监测区域分别进行目标检测和/或识别,最后,重新从第一预设监测区域开始目标检测和/或识别,重复执行上述步骤。在第M个监测区域中分别对其中的T个疑似目标进行二次检测,其中,二次检测以大视野小分辨率指通过设定PTZ参数,使视频图像覆盖所需要监控的区域;然后对下一个区域,即 第M+1个监测区域进行二次检测。By analogy, as shown in Figure 2, target detection and/or recognition are performed on N (0<N) preset monitoring areas, and finally, target detection and/or recognition are restarted from the first preset monitoring area, and the execution is repeated The above steps. In the M-th monitoring area, perform secondary detection on T suspected targets. The secondary detection with large field of view and small resolution means that the video image covers the area that needs to be monitored by setting the PTZ parameters; then The next area, that is, the M+1 monitoring area, performs a second inspection.
小视野大分辨率指通过调整PTZ参数,使目标高度缩放到视频图像高度的1/10~4/5,并调整到视野中心。Small field of view and large resolution means that by adjusting the PTZ parameters, the target height is zoomed to 1/10 to 4/5 of the video image height, and adjusted to the center of the field of view.
实施例2:Example 2:
在上述实施例的基础上,还提供一种智能视觉感知系统的第二种实施方式:On the basis of the foregoing embodiment, a second implementation manner of an intelligent visual perception system is also provided:
建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。如图1所示,系统调整摄像机视场方向,通过设定PTZ参数,使视频图像覆盖所需要监控的区域,以该大视野小分辨率对第一预设监测区域进行视频图像采集。对摄像机所采集的视频图像采用一次目标检测算法,进行第一预设监测区域内一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information. As shown in Figure 1, the system adjusts the direction of the camera's field of view, sets the PTZ parameters so that the video image covers the area that needs to be monitored, and collects video images in the first preset monitoring area with the large field of view and small resolution. A target detection algorithm is used for the video image collected by the camera to perform a target detection in the first preset monitoring area. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
当一次目标检测发现无疑似目标时,智能视觉感知装置继续以大视野小分辨率对下一个预设监测区域进行视频图像采集,采用一次目标检测算法,对采集的视频图像,进行区域内一次目标检测。When a target detection is found to be undoubtedly a target, the intelligent visual perception device continues to collect video images of the next preset monitoring area with a large field of view and small resolution. A target detection algorithm is used to perform a target in the area on the collected video image. Detection.
当一次目标检测发现有疑似目标时,给出第一预设监测区域特征权重Q1;同时,系统继续以大视野小分辨率对其它预设监测区域进行视频图像采集,并采用二次目标检测算法,对采集的视频图像进行监测区域内一次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。When a suspected target is found in a target detection, the first preset monitoring area feature weight Q1 is given; at the same time, the system continues to collect video images of other preset monitoring areas with a large field of view and small resolution, and uses a secondary target detection algorithm , Perform a target detection in the monitoring area on the collected video images. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
示例性地,监测区域为N个,其中N≥1;系统监测过程包括:Exemplarily, there are N monitoring areas, where N≥1; the system monitoring process includes:
在第M个监测区域进行一次目标检测,当一次目标检测发现无疑似目标时,调整摄像机的视场方向和焦距,以大视野小分辨率对下一个监测区域进行一次目标检测,其中,1≤M≤N;Perform a target detection in the M-th monitoring area. When a target detection is found to be undoubtedly a target, adjust the camera's field of view direction and focal length, and perform a target detection on the next monitoring area with a large field of view and small resolution, where 1≤ M≤N;
对第M个监测区域的真目标进行跟踪,当真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和视场角,以小视野大分辨率对第M个监测区域的下一个真目标进行跟踪;Track the real target in the M-th monitoring area. When the real target disappears or the tracking time reaches the set value, adjust the camera's field of view direction and angle of view, and use a small field of view and large resolution to monitor the next one in the M-th monitoring area. Real target tracking;
当第M个监测区域的所有真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和焦距,以大视野小分辨率对下一个监测区域进行一次目标检测;When all real targets in the M-th monitoring area disappear or the tracking time reaches the set value, adjust the camera's field of view direction and focal length, and perform a target detection on the next monitoring area with a large field of view and small resolution;
依次对N个监测区域进行目标检测,循环往复。Perform target detection on N monitoring areas in turn, cyclically.
进一步地,如图3所示,分别对所有N个监测区域进行一次目标检测,对一次目标检测发现疑似目标的区域计算各监测区域各自的特征权重;根据各监测区域各自特征权重的大小,按照指定顺序依序对各监测区域进行二次目标检测和跟踪。示例性地,依次对N(0<N)个预设监测区域进行疑似目标的一次目标检测,找出存在疑似目标的区域,对存在疑似目标的区域给出特征权重Qi。依据区域特征权重Qi大小,对存在疑似目标 的监测区域进行重新排序,权重最大的为第1区域,其权重为Q1,权重最小的区域为第W(W<=N)区域,其权重为Qw。Further, as shown in Fig. 3, a target detection is performed on all N monitoring areas respectively, and the respective feature weight of each monitoring area is calculated for the area where a suspected target is found in a target detection; according to the size of the respective feature weight of each monitoring area, according to Perform secondary target detection and tracking on each monitoring area in the specified order. Exemplarily, a target detection of suspected targets is performed on N (0<N) preset monitoring areas in sequence, the area where the suspected target exists is found, and the feature weight Qi is given to the area where the suspected target exists. According to the size of the regional feature weight Qi, reorder the monitoring areas with suspected targets. The largest weight is the first area, and its weight is Q1, and the smallest weight is the W (W<=N) area, and its weight is Qw .
系统调整摄像机的视场方向和焦距,通过调整PTZ参数,缩放目标高度,以小视野大分辨率对存在疑似目标的第1区域的疑似目标进行视频图像采集,并经过视频图像目标检测算法,对采集的视频图像进行二次目标检测。The system adjusts the camera's field of view direction and focal length, adjusts the PTZ parameters, zooms the target height, and collects the video image of the suspected target in the first area where the suspected target exists with a small field of view and large resolution, and passes the video image target detection algorithm to The collected video images are subjected to secondary target detection.
当二次目标检测发现一次目标检测的疑似目标是真目标时,输出第一告警信息,并生成第一目标特征信息。调整摄像机视场方向和焦距,一段时间持续跟踪目标,同时,将二次目标检测生成的第一目标特征信息发送至云平台或数据中心,云端平台或数据中心依据智能视觉感知装置发送的目标特征信息1经过目标识别算法,进行目标识别,并将得到的第二目标特征信息和第二告警信息,发送至监控终端;监控终端对第二告警信息、第一目标特征信息和/或第二目标特征信息进行处理和显示。When the secondary target detection finds that the suspected target of the primary target detection is a true target, the first alarm information is output, and the first target characteristic information is generated. Adjust the direction and focal length of the camera's field of view, and continue to track the target for a period of time. At the same time, the first target feature information generated by the secondary target detection is sent to the cloud platform or data center, and the cloud platform or data center is based on the target feature sent by the intelligent visual perception device Information 1 passes through the target recognition algorithm to perform target recognition, and sends the obtained second target characteristic information and second alarm information to the monitoring terminal; the monitoring terminal responds to the second alarm information, the first target characteristic information and/or the second target Characteristic information is processed and displayed.
第一目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。第二目标特征信息包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。The first target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides , And one or more of human remains. The second target characteristic information includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
对真目标进行跟踪,当真目标消失或跟踪时间达到设定值时,调整摄像机的视场方向和大小,以小视野大分辨率对下一个真目标进行跟踪。Track the real target. When the real target disappears or the tracking time reaches the set value, adjust the direction and size of the camera's field of view to track the next real target with a small field of view and large resolution.
当二次目标检测发现一次目标检测的疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步降低系统一次目标检测的误检概率。When the secondary target detection finds that the suspected target of the primary target detection is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, the false target feature description information is extracted, and the false target feature description information is set Determine the update rate, optimize the false target feedback feature information database, and further reduce the probability of false detection of a target detection in the system.
对第1区域进行二次目标检测跟踪时,当所有真目标消失或跟踪时间达到设定值时,调整摄像机视场方向和焦距,以小视野大分辨率对存在疑似目标的第2区域进行目标检测和/或识别,检测和/或识别方法与存在疑似目标的第1区域目标检测和/或识别方法相同。When performing secondary target detection and tracking on the first area, when all true targets disappear or the tracking time reaches the set value, adjust the camera's field of view direction and focal length, and target the second area where there are suspected targets with a small field of view and large resolution The detection and/or recognition method is the same as the detection and/or recognition method for the first area target where there is a suspected target.
依次类推,重复执行上述步骤,对所有存在疑似目标的W个区域进行二次目标检测和/或识别。By analogy, the above steps are repeated to perform secondary target detection and/or recognition for all W regions with suspected targets.
最后,重新从对第一预设监测区域进行目标检测和/或识别,自动执行上述步骤,对N个区域循环进行目标检测和/或识别。Finally, the target detection and/or recognition is performed again on the first preset monitoring area, and the above steps are automatically executed, and the target detection and/or recognition is performed on the N areas in a loop.
其中,大视野小分辨率指通过设定PTZ参数,使视频图像覆盖所需要监控的区域;Among them, the large field of view and small resolution means that the video image covers the area that needs to be monitored by setting the PTZ parameters;
小视野大分辨率指通过调整PTZ参数,使目标高度缩放到视频图像高度的1/6~2/3,并调整到视野中心。Small field of view and large resolution means that by adjusting the PTZ parameters, the target height is zoomed to 1/6 to 2/3 of the video image height, and adjusted to the center of the field of view.
实施例3:Example 3:
在上述实施例的基础上,还提供一种智能视觉感知系统的第三种实施方式:On the basis of the foregoing embodiment, a third implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置 的一种结构由1个可见光摄像机、1传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1防护外壳组成。外壳留有1窗口及电源和信号线接口,窗口采用可透光的材料密封,用于摄像机进行视频图像采集。传动机构通过数据处理单元发送的控制命令调整摄像机的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. A structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, and a protective casing. The shell is left with a window and power and signal line interfaces. The window is sealed with a light-permeable material for the camera to collect video images. The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line. The drive motor drives the camera to rotate 0-360 degrees horizontally around the axis of rotation, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。传动机构控制摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the camera angle, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The transmission mechanism controls the left and right swing positions of the camera.
如图5所示,智能视觉感知装置包含可见光摄像机501、传动机构502、数据处理单元503、通信接口单元504、电源管理单元505、防护外壳组成507。As shown in FIG. 5, the intelligent visual perception device includes a visible light camera 501, a transmission mechanism 502, a data processing unit 503, a communication interface unit 504, a power management unit 505, and a protective housing 507.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;固定座用于固定防护外壳,可固定在外部支架上。在本实施例中,外壳留有窗口508,以及电源和信号线接口板510,窗口508采用可透光的材料密封,用于摄像机进行视频图像采集。传动机构502通过接收数据处理单元503发送的控制命令,调整摄像机501上下左右视场方向。通信接口单元504主要包括有线和无线通信接口。电源管理单元505主要用于给整个智能视频感知装置供电,在本实施例中为系统内部的电池。其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the fixing seat is used to fix the protective shell and can be fixed on the external bracket. In this embodiment, the housing has a window 508 and a power and signal line interface board 510. The window 508 is sealed with a light-permeable material for video image capture by the camera. The transmission mechanism 502 adjusts the up, down, left, and right field of view directions of the camera 501 by receiving control commands sent by the data processing unit 503. The communication interface unit 504 mainly includes wired and wireless communication interfaces. The power management unit 505 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元503。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 503 in real time.
(3)数据处理单元503经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 503 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像 机501视场方向,使得摄像机501视场方向对准第二预设监测区域,并调整摄像机501焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元503经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view , The camera 501 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 501 is adjusted to collect video images in this area with a large field of view and small resolution. The data processing unit 503 performs a video image target detection algorithm on the collected The video image performs a target detection of the target.
(5)当一次目标检测发现有疑似目标时,调整PTZ参数:数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,使得摄像机501视场方向对准疑似目标,调整摄像机501焦距。通过调整PTZ参数,将疑似目标高度缩放到视频图像高度的1/3,以该小视野大分辨率对疑似目标进行视频图像采集,数据处理单元503经过二次目标检测算法,对采集的视频图像进行二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(5) When a suspected target is found in a target detection, adjust the PTZ parameters: the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the field of view direction of the camera 501 so that the field of view direction of the camera 501 is aligned For the suspected target, adjust the focus of the camera 501. By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/3 of the height of the video image, and the video image of the suspected target is collected with this small field of view and large resolution. The data processing unit 503 performs a secondary target detection algorithm on the collected video image. Perform secondary target detection. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(6)当二次目标检测判断出是真目标时,数据处理单元503输出告警信息,并生成该真目标特征信息,同时,数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,实时调整摄像机501焦距,对检测出的真目标进行实时跟踪,同时,数据处理单元503将该真目标特征信息和告警信息发送至监控终端。(6) When the secondary target detection determines that it is a true target, the data processing unit 503 outputs alarm information and generates the true target characteristic information. At the same time, the data processing unit 503 sends control commands to the camera 501 and the transmission mechanism 502, the transmission mechanism 502 adjusts the direction of the field of view of the camera 501, adjusts the focal length of the camera 501 in real time, and tracks the real target detected in real time. At the same time, the data processing unit 503 sends the characteristic information and alarm information of the real target to the monitoring terminal.
目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种,或者表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(7)摄像机501对真目标进行跟踪时,当真目标消失或跟踪时间达到设定值时,数据处理单元503发送控制命令至摄像机501和传动机构502,传动机构502调整摄像机501视场方向,使得摄像机501视场方向对准下一个真目标进行跟踪。(7) When the camera 501 is tracking a real target, when the real target disappears or the tracking time reaches the set value, the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the field of view of the camera 501 so that The direction of the field of view of the camera 501 is aimed at the next real target for tracking.
(8)当二次目标检测判断出是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元503对假目标进行特征描述,提取假目标特征描述信息,更新假目标反馈特征信息库,以降低算法在此之后的一次目标检测误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。(8) When the secondary target detection determines that it is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 503 characterizes the false target and extracts the false target feature description Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000001
Figure PCTCN2021087921-appb-000001
(9)摄像机501对真目标进行跟踪时,当所有真目标消失或跟踪时间达到设定值时,针对第二预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元503,数据处理单元503经过视频图像目标 检测算法,对采集的视频图像进行目标的一次目标检测。(9) When the camera 501 is tracking a real target, when all real targets disappear or the tracking time reaches the set value, the PTZ parameters are set for the second preset monitoring area, and the video image is collected with the large field of view and small resolution , Transmit the collected video images to the data processing unit 503 in real time, and the data processing unit 503 performs a target detection on the collected video images through a video image target detection algorithm.
(10)依次类推,按(2)~(9)步骤对N个预设区域进行目标检测和/或识别,最后,重新从第一预设监测区域开始目标检测和/或识别,自动循环。(10) By analogy, follow the steps (2) to (9) to perform target detection and/or recognition on N preset areas, and finally, restart target detection and/or recognition from the first preset monitoring area, and automatically loop.
实施例4:Example 4:
在上述实施例的基础上,还提供一种智能视觉感知系统的第四种实施方式:On the basis of the foregoing embodiment, a fourth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置的一种结构由1个可见光摄像机、1传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1防护外壳组成。外壳留有1窗口及电源和信号线接口,窗口采用可透光的材料密封,用于摄像机进行视频图像采集。传动机构通过数据处理单元发送的控制命令调整摄像机的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. A structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, and a protective shell. The shell is left with a window and power and signal line interfaces. The window is sealed with a light-permeable material for the camera to collect video images. The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line. The drive motor drives the camera to rotate 0-360 degrees horizontally around the axis of rotation, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。传动机构控制摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the camera angle, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The transmission mechanism controls the left and right swing positions of the camera.
如图5所示,智能视觉感知装置包含可见光摄像机501、传动机构502、数据处理单元503、通信接口单元504、电源管理单元505、防护外壳组成507。As shown in FIG. 5, the intelligent visual perception device includes a visible light camera 501, a transmission mechanism 502, a data processing unit 503, a communication interface unit 504, a power management unit 505, and a protective housing 507.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口508,以及电源和信号线接口板510,窗口508采用可透光的材料密封,用于摄像机进行视频图像采集。传动机构502通过接收数据处理单元503发送的控制命令,调整摄像机501上下左右视场方向。通信接口单元504主要包括有线和无线通信接口。电源管理单元505主要用于给整个智能视频感知装置供电,在本实施例中为系统内部的电池。其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the fixing seat is used to fix the protective shell and is fixed on the external bracket. In this embodiment, the housing has a window 508 and a power and signal line interface board 510. The window 508 is sealed with a light-permeable material for video image capture by the camera. The transmission mechanism 502 adjusts the up, down, left, and right field of view directions of the camera 501 by receiving control commands sent by the data processing unit 503. The communication interface unit 504 mainly includes wired and wireless communication interfaces. The power management unit 505 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元503。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 503 in real time.
(3)数据处理单元503经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 503 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,使得摄像机501视场方向对准第二预设监测区域,并调整摄像机501焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元503经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view , The camera 501 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 501 is adjusted to collect video images in this area with a large field of view and small resolution. The data processing unit 503 performs a video image target detection algorithm on the collected The video image performs a target detection of the target.
(5)当一次目标检测发现有疑似目标时,数据处理单元503给出第一预设监测区域特征权重Q1,Q1由该区域内疑似目标数量获得;设定PTZ参数,以大视野小分辨率覆盖第二预设监测区域,对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元503,经过视频图像目标检测算法进行目标的一次目标检测。(5) When a suspected target is found in a target detection, the data processing unit 503 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; set the PTZ parameter to a large field of view and a small resolution Cover the second preset monitoring area, perform video image collection on this area, transmit the collected video image to the data processing unit 503 in real time, and perform a target detection of the target through the video image target detection algorithm.
(6)依次类推,重复上述步骤(1)~(5),对N个监测区域进行一次目标检测,区分出存在疑似目标的W个区域,并依据区域特征权重Qi的大小对W个区域重新排序,权重最大的为第1区域,权重最小为第W区域。(6) By analogy, repeat the above steps (1) ~ (5), perform a target detection on N monitoring areas, distinguish W areas where there are suspected targets, and renew the W areas according to the size of the area feature weight Qi Sorting, the largest weight is the first area, and the smallest weight is the W-th area.
本实施例中,各区域特征权重Qi(0<i≤N)分别由各区域中疑似目标的数量获得。In this embodiment, each area feature weight Qi (0<i≤N) is obtained from the number of suspected targets in each area.
(7)智能视觉感知装置调整摄像机501视场方向和焦距,通过调整PTZ参数,将疑似目标高度缩放到视野高度的1/2,以该小视野大分辨率对存在疑似目标的第1区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元503,数据处理单元503经过二次目标检测算法,对采集的视频图像依次进行区域内疑似目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(7) The intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 501. By adjusting the PTZ parameters, the height of the suspected target is zoomed to 1/2 of the height of the field of view, and the first area with the suspected target is performed with the small field of view and large resolution. Video image collection, real-time transmission of the collected video images to the data processing unit 503, the data processing unit 503 through the secondary target detection algorithm, on the collected video images in order to perform secondary target detection of suspected targets in the area. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(8)当二次目标检测判断出疑似目标是真目标时,数据处理单元生成该真目标的目标特征信息1和告警信息1。数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,实时调整摄像机501焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元503将二次检测的真目标的目标特征信息1发送至云平台或数据中心;云端平台或数据中心依据智能视觉感知装置发送的目标特征信息1经过目标识别算法,进行目标识别,并将得到目标的特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理和显示。(8) When the secondary target detection determines that the suspected target is a true target, the data processing unit generates target feature information 1 and warning information 1 of the true target. The data processing unit 503 sends control commands to the camera 501 and the transmission mechanism 502. The transmission mechanism 502 adjusts the field of view direction of the camera 501, adjusts the focal length of the camera 501 in real time, and tracks the real target detected in real time; The target characteristic information 1 of the detected real target is sent to the cloud platform or data center; the cloud platform or data center is based on the target characteristic information 1 sent by the intelligent visual perception device through the target recognition algorithm, and the target is identified, and the target characteristic information is obtained 2 And alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(9)摄像机501对真目标进行跟踪时,当该目标消失或跟踪时间达到设定值时,数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,使得摄像机501视场方向对准下一个真目标,调整摄像机501焦距,以小视野大分辨率对下一个真目标进行视频图像采集,将采集的视频图像实时传输至数据处理单元503,数据处理单元503经过视频图像目标检测算法,对采集的视频图像继续进行二次目标检测。(9) When the camera 501 is tracking a real target, when the target disappears or the tracking time reaches a set value, the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view. Make the field of view of the camera 501 align with the next real target, adjust the focal length of the camera 501, collect video images of the next real target with a small field of view and large resolution, and transmit the collected video images to the data processing unit 503 in real time. 503 continues to perform secondary target detection on the collected video images through the video image target detection algorithm.
(10)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元503对假目标进行特征描述,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步提高了一次检测算法的精确度。(10) When the secondary target detection determines that the suspected target is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 503 characterizes the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000002
Figure PCTCN2021087921-appb-000002
(11)摄像机501对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,数据处理单元503发送控制命令给摄像机501和传动机构502,传动机构502调整摄像机501视场方向,使得摄像机501视场方向对准第2区域,调整摄像机501焦距,以小视野大分辨率对第2区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元503,进行目标的二次目标检测,检测方法与第1区域目标二次检测方法相同。(11) The camera 501 tracks the real target. When all real targets disappear or the tracking time reaches the set value, the data processing unit 503 sends a control command to the camera 501 and the transmission mechanism 502, and the transmission mechanism 502 adjusts the direction of the camera 501's field of view. Align the field of view of the camera 501 with the second area, adjust the focal length of the camera 501, collect video images in the second area with a small field of view and large resolution, and transmit the collected video images to the data processing unit 503 in real time to perform the secondary target Target detection, the detection method is the same as the second detection method of the first area target.
(12)依次类推,按(7)~(11)步骤对存在疑似目标的W个区域进行二次目标检测和/或识别,最后,重新从第(2)步开始,从第一预设监测区域开始目标识别,自动循环。(12) By analogy, follow steps (7) to (11) to perform secondary target detection and/or recognition for W areas with suspected targets, and finally, start again from step (2) and start from the first preset monitoring The area starts to recognize the target, and it loops automatically.
(13)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(13) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
实施例5:Example 5:
在上述实施例的基础上,还提供一种智能视觉感知系统的第五种实施方式:On the basis of the foregoing embodiment, a fifth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置的一种结构由1个可见光摄像机、1传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1光照单元、1防护外壳组成。外壳留有2窗口及电源和信号线接口,2窗口采用可透光的材料密封,其中一个窗口,用于摄像机进行视频图像采集,另外一个窗口用于光照单元进行补光。传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。 驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. A structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, a lighting unit, and a protective housing. There are 2 windows and power and signal wire interfaces in the shell. The 2 windows are sealed with light-permeable materials. One of the windows is used for video image collection by the camera, and the other window is used for the lighting unit to fill light. The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元视场方向和大小,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、视场方向范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the direction and size of the field of view of the camera and/or the light unit, and/or exchange information with the cloud platform or data center . The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图6所示,智能视觉感知装置包含可见光摄像机201、传动机构202、数据处理单元203、通信接口单元204、电源管理单元205、光照单元206、防护外壳组成207。As shown in FIG. 6, the intelligent visual perception device includes a visible light camera 201, a transmission mechanism 202, a data processing unit 203, a communication interface unit 204, a power management unit 205, an illumination unit 206, and a protective housing 207.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口208和窗口209,以及电源和信号线接口板210,窗口208和窗口209采用可透光的材料密封,其中窗口208用于摄像机进行视频图像采集,窗口209用于补光。传动机构202通过接收数据处理单元203发送的控制命令,调整摄像机201上下视场方向,同时通过延伸至防护外壳外并安装在固定支架上的转动轴211,调整摄像机201左右视场方向。通信接口单元204主要包括有线和无线通信接口。电源管理单元205主要用于给整个智能视频感知装置供电,在本实施例中为系统内部的电池。光照单元206为可见光光源,主要包括发光设备和光照强度、光照范围调整单元。光照单元206和摄像机201固定在一起,传动机构202共同控制光照单元和摄像机的左右上下摆动位置,传动机构包括驱动电机和转台。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell and is fixed to the external bracket superior. In this embodiment, the shell is left with windows 208 and 209, as well as the power and signal line interface board 210. The windows 208 and 209 are sealed with light-permeable materials. The window 208 is used for video image capture by the camera, and the window 209 Used to fill light. The transmission mechanism 202 adjusts the vertical field of view direction of the camera 201 by receiving the control command sent by the data processing unit 203, and adjusts the horizontal field of view direction of the camera 201 through the rotating shaft 211 extending outside the protective housing and mounted on the fixed bracket. The communication interface unit 204 mainly includes wired and wireless communication interfaces. The power management unit 205 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. The illumination unit 206 is a visible light source, and mainly includes a light-emitting device and a unit for adjusting light intensity and light range. The illumination unit 206 and the camera 201 are fixed together, and the transmission mechanism 202 jointly controls the left and right swing positions of the illumination unit and the camera. The transmission mechanism includes a driving motor and a turntable.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数及位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and location of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元203。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 203 in real time.
(3)数据处理单元203经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 203 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准第二预设监测区域,并调整摄像机201焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元203经过视频图像目标检测算法,对采集的视频图像进行一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201 field of view , The camera 201 field of view direction is aligned with the second preset monitoring area, and the focal length of the camera 201 is adjusted to collect video images in this area with a large field of view and small resolution. The data processing unit 203 performs a video image target detection algorithm on the collected The video image undergoes a target detection.
(5)当一次目标检测发现有疑似目标时,调整PTZ参数:数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准疑似目标,调整摄像机201焦距,将疑似目标高度缩放到视频图像高度的1/10,以该小视野大分辨率对疑似目标进行视频图像采集,数据处理单元203经过二次目标检测算法,对采集的视频图像进行二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(5) When a suspected target is found in a target detection, adjust the PTZ parameters: the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the field of view of the camera 201 so that the direction of the field of view of the camera 201 is aligned For the suspected target, adjust the focal length of the camera 201, zoom the height of the suspected target to 1/10 of the height of the video image, and collect the video image of the suspected target with this small field of view and large resolution. The data processing unit 203 performs a secondary target detection algorithm to collect The video image is subjected to secondary target detection. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(6)当二次目标检测判断出是真目标时,数据处理单元203输出告警信息,并生成该真目标特征信息,同时,数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,实时调整摄像机201焦距,对检测出的真目标进行实时跟踪,同时,数据处理单元203将该真目标特征信息和告警信息发送至监控终端。(6) When the secondary target detection determines that it is a true target, the data processing unit 203 outputs alarm information and generates the true target characteristic information. At the same time, the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202. The transmission mechanism 202 adjusts the direction of the field of view of the camera 201, adjusts the focal length of the camera 201 in real time, and tracks the detected real target in real time. At the same time, the data processing unit 203 sends the real target feature information and alarm information to the monitoring terminal.
目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种,或者表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(7)摄像机201对真目标进行跟踪,当真目标消失或跟踪时间达到设定值时,数据处理单元203发送控制命令至摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准下一个真目标进行跟踪。(7) The camera 201 tracks the real target. When the real target disappears or the tracking time reaches the set value, the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202. The transmission mechanism 202 adjusts the direction of the camera 201's field of view so that the camera 201 The direction of the field of view is aligned with the next real target for tracking.
(8)当二次目标检测判断出是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元203对假目标进行特征描述,提取假目标特征描述信息,更新假目标反馈特征信息库,以降低算法在此之后的一次目标检测误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。(8) When the secondary target detection determines that it is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 203 characterizes the false target and extracts the false target feature description Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000003
Figure PCTCN2021087921-appb-000003
(9)摄像机201对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时, 针对第二预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元203,数据处理单元203经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(9) The camera 201 tracks the real target. When all real targets disappear or the tracking time reaches the set value, the PTZ parameter is set for the second preset monitoring area, and the video image is collected with the large field of view and small resolution. The collected video images are transmitted to the data processing unit 203 in real time, and the data processing unit 203 performs a target detection on the collected video images through a video image target detection algorithm.
(10)依次类推,按(2)~(9)步骤对N个预设区域进行目标检测和/或识别,最后,重新从第一预设监测区域开始目标检测和/或识别,自动循环。(10) By analogy, follow the steps (2) to (9) to perform target detection and/or recognition on N preset areas, and finally, restart target detection and/or recognition from the first preset monitoring area, and automatically loop.
(11)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(11) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
在智能视觉感知装置工作中,数据处理单元203对采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元206,调整光照强度和光场角度,使得摄像机201采集的视频图像亮度适中。In the work of the intelligent visual perception device, the data processing unit 203 performs real-time analysis on the brightness of the captured video image. When the brightness is insufficient, it sends a control command to the lighting unit 206 in time to adjust the light intensity and light field angle to make the video image captured by the camera 201 The brightness is moderate.
实施例6:Example 6:
在上述实施例的基础上,还提供一种智能视觉感知系统的第六种实施方式:On the basis of the foregoing embodiment, a sixth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置的一种结构由1个可见光摄像机、1传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1光照单元、1防护外壳组成。外壳留有2窗口及电源和信号线接口,2窗口采用可透光的材料密封,其中一个窗口,用于摄像机进行视频图像采集,另外一个窗口用于光照单元进行补光。传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. A structure of the intelligent visual perception device is composed of a visible light camera, a transmission mechanism, a data processing unit, a communication interface unit, a power management unit, a lighting unit, and a protective housing. There are 2 windows and power and signal wire interfaces in the shell. The 2 windows are sealed with light-permeable materials. One of the windows is used for video image collection by the camera, and the other window is used for the lighting unit to fill light. The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including a drive motor, a horizontal shaft, a vertical shaft, and a control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、视场方向范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图6所示,智能视觉感知装置包含可见光摄像机201、传动机构202、数据处理单元203、通信接口单元204、电源管理单元205、光照单元206及防护外壳组成207。As shown in FIG. 6, the intelligent visual perception device includes a visible light camera 201, a transmission mechanism 202, a data processing unit 203, a communication interface unit 204, a power management unit 205, an illumination unit 206, and a protective housing 207.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,包括固定在转台上,和/或固定在外部支架上。在本实施例中,外壳留有窗口208和窗口209,以及电源和信号线接口板210,窗口208和窗口209采用可 透光的材料密封,其中窗口208用于摄像机进行视频图像采集,窗口209用于光照单元进行补光。传动机构202通过接收数据处理单元203发送的控制命令,调整摄像机201上下视场方向,同时通过延伸至防护外壳外并安装在固定支架上的转动轴211,调整摄像机201左右视场方向。通信接口单元204主要包括有线和无线通信接口。电源管理单元205主要用于给整个智能视频感知装置供电,在本实施例中为外部的太阳电池板。光照单元206为可见光,主要包括发光设备和光照强度、光照范围调整单元。光照单元206和摄像机201固定在一起,传动机构202共同控制光照单元和摄像机的左右上下摆动位置,传动机构包括驱动电机和转台。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell, including fixing on the turntable On, and/or fixed to an external bracket. In this embodiment, the shell is left with windows 208 and 209, as well as the power and signal line interface board 210. The windows 208 and 209 are sealed with light-permeable materials. The window 208 is used for video image capture by the camera, and the window 209 It is used to fill light in the lighting unit. The transmission mechanism 202 adjusts the vertical field of view direction of the camera 201 by receiving the control command sent by the data processing unit 203, and adjusts the horizontal field of view direction of the camera 201 through the rotating shaft 211 extending outside the protective housing and mounted on the fixed bracket. The communication interface unit 204 mainly includes wired and wireless communication interfaces. The power management unit 205 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is an external solar panel. The light unit 206 is visible light, and mainly includes a light-emitting device and a unit for adjusting light intensity and light range. The illumination unit 206 and the camera 201 are fixed together, and the transmission mechanism 202 jointly controls the left and right swing positions of the illumination unit and the camera. The transmission mechanism includes a driving motor and a turntable.
其中,摄像机用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including focus motor, zoom motor, drive module, image signal acquisition and processing unit, etc.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元203。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 203 in real time.
(3)数据处理单元203经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 203 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准第二预设监测区域,调整摄像机201焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元203经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201 field of view , The camera 201 field of view direction is aligned with the second preset monitoring area, the focal length of the camera 201 is adjusted, and the video image is collected in this area with a large field of view and small resolution. The data processing unit 203 performs a video image target detection algorithm on the collected video The image performs a target detection of the target.
(5)当一次目标检测发现有疑似目标时,数据处理单元203给出第一预设监测区域特征权重Q1,Q1由该区域内疑似目标数量获得;设定PTZ参数,以大视野小分辨率覆盖第二预设监测区域,对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元203,经过视频图像目标检测算法进行目标的一次目标检测。(5) When a suspected target is found in a target detection, the data processing unit 203 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; set the PTZ parameter to a large field of view and a small resolution Cover the second preset monitoring area, perform video image collection on this area, transmit the collected video image to the data processing unit 203 in real time, and perform a target detection of the target through the video image target detection algorithm.
(6)依次类推,重复上述步骤(1)~(5),对N个监测区域目标的一次目标检测,区分出存在疑似目标的W个区域,并依据区域特征权重Qi的大小对W个区域重新排序,权重最大的为第1区域,权重最小为第W区域。(6) By analogy, repeat the above steps (1) ~ (5), perform a target detection of N monitoring area targets, distinguish W areas where there are suspected targets, and compare W areas according to the size of the area feature weight Qi Re-order, the largest weight is the first area, and the smallest weight is the W-th area.
本实施例中,各区域特征权重Qi(0<i≤N)分别由各区域中疑似目标的数量获得。In this embodiment, each area feature weight Qi (0<i≤N) is obtained from the number of suspected targets in each area.
(7)智能视觉感知装置调整摄像机201视场方向和焦距,通过调整PTZ参数,将疑似目标高度缩放到视频图像高度的4/5,以该小视野大分辨率对存在疑似目标的第1 区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元203,数据处理单元203经过二次目标检测算法,对采集的视频图像依次进行区域内疑似目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(7) The intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 201, and by adjusting the PTZ parameters, the height of the suspected target is zoomed to 4/5 of the height of the video image, and the first area where the suspected target exists with the small field of view and large resolution Video image collection is performed, and the collected video images are transmitted to the data processing unit 203 in real time, and the data processing unit 203 performs secondary target detection of suspected targets in the area sequentially on the collected video images through a secondary target detection algorithm. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(8)当二次目标检测判断出疑似目标是真目标时,数据处理单元生成该真目标的目标特征信息1和告警信息1。数据处理单元203发送控制命令至摄像机201和传动机构202,传动机构202调整摄像机201视场方向,实时调整摄像机201焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元203将二次检测的真目标的目标特征信息1发送至云平台或数据中心;云端平台或数据中心依据智能视觉感知装置发送的目标特征信息1经过目标识别算法,进行目标识别,并将得到目标的特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理和显示。(8) When the secondary target detection determines that the suspected target is a true target, the data processing unit generates target feature information 1 and warning information 1 of the true target. The data processing unit 203 sends control commands to the camera 201 and the transmission mechanism 202. The transmission mechanism 202 adjusts the field of view direction of the camera 201, adjusts the focal length of the camera 201 in real time, and tracks the real target detected in real time; The target characteristic information 1 of the detected real target is sent to the cloud platform or data center; the cloud platform or data center is based on the target characteristic information 1 sent by the intelligent visual perception device through the target recognition algorithm, and the target is identified, and the target characteristic information is obtained 2 And alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(9)摄像机201对真目标进行跟踪时,当该目标消失或跟踪时间达到设定值时,数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准下一个真目标,调整摄像机201焦距,以小视野大分辨率对下一个真目标进行视频图像采集,将采集的视频图像实时传输至数据处理单元203,数据处理单元203经过视频图像目标检测算法,对采集的视频图像继续进行二次目标检测。(9) When the camera 201 is tracking a real target, when the target disappears or the tracking time reaches the set value, the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the field of view of the camera 201, Make the field of view of the camera 201 aim at the next real target, adjust the focal length of the camera 201, collect video images of the next real target with a small field of view and large resolution, and transmit the collected video images to the data processing unit 203 in real time. 203 After the video image target detection algorithm, continue to perform secondary target detection on the collected video image.
(10)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元203对假目标进行特征描述,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步提高了一次检测算法的精确度。(10) When the secondary target detection determines that the suspected target is a false target, the false target determined in the large-resolution image is mapped to the small-resolution image, and the data processing unit 203 features description of the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000004
Figure PCTCN2021087921-appb-000004
(11)摄像机201对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,数据处理单元203发送控制命令给摄像机201和传动机构202,传动机构202调整摄像机201视场方向,使得摄像机201视场方向对准第2区域,调整摄像机201焦距,以小 视野大分辨率对第2区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元203,进行目标的二次目标检测,检测方法与第1区域目标二次检测方法相同。(11) The camera 201 tracks the real target. When all real targets disappear or the tracking time reaches the set value, the data processing unit 203 sends a control command to the camera 201 and the transmission mechanism 202, and the transmission mechanism 202 adjusts the direction of the camera 201's field of view. Align the field of view of the camera 201 with the second area, adjust the focal length of the camera 201, collect video images of the second area with a small field of view and large resolution, and transmit the collected video images to the data processing unit 203 in real time to perform the secondary target Target detection, the detection method is the same as the second detection method of the first area target.
(12)依次类推,按(7)~(11)步骤对存在疑似目标的W个区域进行二次目标检测和/或识别,最后,重新从第(2)步开始,从第一预设监测区域开始目标识别,自动循环。(12) By analogy, follow steps (7) to (11) to perform secondary target detection and/or recognition for W areas with suspected targets, and finally, start again from step (2) and start from the first preset monitoring The area starts to recognize the target, and it loops automatically.
(13)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(13) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
在整个智能视觉感知装置工作中,数据处理单元203对采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元206,调整光照强度,使得摄像机201采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 203 analyzes the brightness of the captured video image in real time. When the brightness is insufficient, it sends a control command to the lighting unit 206 in time to adjust the light intensity so that the video image captured by the camera 201 has a moderate brightness. .
实施例7:Example 7:
在上述实施例的基础上,还提供一种智能视觉感知系统的第七种实施方式:On the basis of the foregoing embodiment, a seventh implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由2个可见光摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、2光照单元及1防护外壳组成。外壳留有4窗口及电源和信号线接口,4窗口采用可透光的材料密封,其中2个窗口,用于摄像机进行视频图像采集,另外2个窗口用于光照单元进行补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 2 visible light cameras, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing. The shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. Two of the windows are used for the camera to collect video images, and the other two windows are used for the illumination unit to fill light. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、光照范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图7所示,智能视觉感知装置包含可见光摄像机301-1、可见光摄像机301-2、传动机构302-1、传动机构302-2、数据处理单元303、通信接口单元304、电源管理单元305、光照单元306-1、光照单元306-2、防护外壳组成307。As shown in Figure 7, the intelligent visual perception device includes a visible light camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, a power management unit 305, The light unit 306-1, the light unit 306-2, and the protective shell constitute 307.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口308-1、窗口308-2、 窗口309-1、窗口309-2,以及电源和信号线接口板310,窗口308-1、窗口308-2、窗口309-1、窗口309-2都采用可透光的材料密封,其中窗口308-1和窗口308-2用于摄像机301-1和摄像机301-2的视频图像采集,窗口309-1和窗口309-2用于光照单元306-1和光照单元306-2透光,给摄像机301-1和摄像机301-2监测目标补光。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell and is fixed to the external bracket superior. In this embodiment, the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1. The windows 309-2 are all sealed with light-permeable materials. The windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
传动机构302-1通过接收数据处理单元303发送的控制命令,调整摄像机301-1左右和上下视场方向位置,传动机构302-2通过接收数据处理单元303发送的控制命令,调整摄像机301-2左右和上下视场方向位置。通信接口单元304主要包括有线和无线通信接口。电源管理单元305主要用于给整个智能视频感知装置供电,在本实施例中为有线电源。光照单元306-1和306-2主要包括发光设备和光照强度调整单元,光源分别为可见光和/或红外光。光照单元306-1和摄像机301-1固定在一起,光照单元306-2和摄像机301-2固定在一起,传动机构302-1共同控制光照单元306-1和摄像机301-1的左右上下摆动位置,传动机构302-2共同控制光照单元306-2和摄像机301-2的左右上下摆动位置。其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。The transmission mechanism 302-1 adjusts the position of the camera 301-1 left and right and the vertical field of view by receiving the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the camera 301-2 by receiving the control commands sent by the data processing unit 303 The position of the field of view from left to right and up and down. The communication interface unit 304 mainly includes wired and wireless communication interfaces. The power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, which is a wired power supply in this embodiment. The illumination units 306-1 and 306-2 mainly include a light emitting device and a light intensity adjustment unit, and the light sources are visible light and/or infrared light, respectively. The light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2. Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 303 in real time.
(3)数据处理单元303经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 303 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元303发送控制命令给摄像机301-1和传动机构302-1,传动机构302-1调整摄像机301-1视场方向,使得摄像机301-1视场方向对准第二预设监测区域,调整摄像机301-1焦距,以大视野小分辨率对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 303 sends a control command to the camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302-1 Adjust the field of view direction of the camera 301-1 so that the field of view direction of the camera 301-1 is aligned with the second preset monitoring area. The video image is transmitted to the data processing unit 303 in real time to perform a target detection of the target.
(5)当一次目标检测发现有疑似目标时,调整PTZ参数:数据处理单元303发送控制命令给摄像机301-2和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,使得摄像机301-2视场方向对准疑似目标,调整摄像机301-2焦距。通过调整PTZ参数,将疑似目标高度缩放到视频图像高度的1/6,以该小视野大分辨率对疑似目标进行视频图像采集,数据处理单元303经过二次目标检测算法,对采集的视频图像进行目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(5) When a suspected target is found in a target detection, adjust the PTZ parameters: the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302-2 adjusts the field of view direction of the camera 301-2. The direction of the field of view of the camera 301-2 is aligned with the suspected target, and the focal length of the camera 301-2 is adjusted. By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/6 of the height of the video image, and the video image of the suspected target is collected with this small field of view and large resolution. The data processing unit 303 performs a secondary target detection algorithm on the collected video image. Perform the secondary target detection of the target. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(6)当二次目标检测判断出疑似目标是真目标时,数据处理单元303输出告警信息1,并生成该真目标特征信息1,同时,数据处理单元303发送控制命令给摄像机301-2和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,实时调整摄像机301-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元303将该真目标特征信息1发送至云平台或数据中心,云端平台或数据中心依据智能视觉感知装置发送的该真目标特征信息1经过目标识别算法,进行目标识别,生成该真目标特征信息2和告警信息2,并将真目标特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理和显示。(6) When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information 1 and generates the true target characteristic information 1. At the same time, the data processing unit 303 sends control commands to the cameras 301-2 and 301-2. The transmission mechanism 302-2 and the transmission mechanism 302-2 adjust the field of view direction of the camera 301-2, adjust the focal length of the camera 301-2 in real time, and track the real target detected in real time; at the same time, the data processing unit 303 obtains the characteristic information of the real target 1 Send to the cloud platform or data center, and the cloud platform or data center sends the true target characteristic information according to the intelligent visual perception device. 1 After the target recognition algorithm, the target recognition is performed, and the true target characteristic information 2 and alarm information 2 are generated. The true target characteristic information 2 and the alarm information 2 are sent to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(7)摄像机301-2对真目标进行跟踪时,当该真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令至摄像机301-2和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,使得摄像机301-2视场方向对准下一个真目标,调整摄像机301-2焦距,以小视野大分辨率对下一个真目标进行跟踪。(7) When the camera 301-2 is tracking a real target, when the real target disappears or the tracking time reaches the set value, the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302 -2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aimed at the next real target, and adjust the focal length of the camera 301-2 to track the next real target with a small field of view and large resolution.
(8)当二次目标检测判断出是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元303对假目标进行特征描述,提取假目标特征描述信息,更新假目标反馈特征信息库,以降低算法在此之后的一次目标检测误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。(8) When the secondary target detection determines that it is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 303 performs a feature description on the false target and extracts the feature description of the false target Information, update the false target feedback feature information database to reduce the probability of a target detection error after the algorithm. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000005
Figure PCTCN2021087921-appb-000005
(9)摄像机301-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,针对第二预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,进行一次目标检测,检测方法与第一预设监测区域目标检测方法相同。(9) The camera 301-2 tracks the real target. When all real targets disappear or the tracking time reaches the set value, set the PTZ parameters for the second preset monitoring area, and perform video images with the large field of view and small resolution Acquisition, real-time transmission of the collected video images to the data processing unit 303 to perform a target detection, the detection method is the same as the first preset monitoring area target detection method.
(10)依次类推,按(2)~(9)步骤对N个区域进行目标检测和/或识别,最后,重新从第一预设监测区域开始目标检测和/或识别,自动循环。(10) By analogy, follow the steps (2) to (9) to perform target detection and/or recognition on N areas, and finally, start target detection and/or recognition from the first preset monitoring area again, and automatically loop.
在整个智能视觉感知装置工作中,数据处理单元303对摄像机301-1和摄像机301-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元306-1 和光照单元306-2,调整光照单元光照强度和光场角度,使得摄像机301-1和摄像机301-2采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and the lighting unit in time. 306-2. Adjust the light intensity and light field angle of the light unit so that the brightness of the video images collected by the camera 301-1 and the camera 301-2 is moderate.
实施例8:Example 8:
在上述实施例的基础上,还提供一种智能视觉感知系统的第八种实施方式:On the basis of the foregoing embodiment, an eighth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由2个可见光摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、2光照单元及1防护外壳组成。外壳留有4窗口及电源和信号线接口,4窗口采用可透光的材料密封,其中2个窗口,用于摄像机进行视频图像采集,另外2个窗口用于补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 2 visible light cameras, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing. The shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image acquisition by the camera, and the other 2 windows are used for light supplementation. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、视场方向范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图7所示,智能视觉感知装置包含可见光摄像机301-1、可见光摄像机301-2、传动机构302-1、传动机构302-2、数据处理单元303、通信接口单元304、电源管理单元305、光照单元306-1、光照单元306-2及防护外壳组成307。As shown in Figure 7, the intelligent visual perception device includes a visible light camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, a power management unit 305, The light unit 306-1, the light unit 306-2 and the protective housing constitute 307.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,或固定在外部支架上。在本实施例中,防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,分别透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,包括固定在外部支架上。在本实施例中,外壳留有窗口308-1、窗口308-2、窗口309-1、窗口309-2,以及电源和信号线接口板310,窗口308-1、窗口308-2、窗口309-1、窗口309-2都采用可透光的材料密封,其中窗口308-1和窗口308-2用于摄像机301-1和摄像机301-2的视频图像采集,窗口309-1和窗口309-2用于光照单元306-1和光照单元306-2透光,给摄像机301-1和摄像机301-2监测目标补光。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window is made of light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell, or fix it on the outside On the stand. In this embodiment, the protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the windows are made of light-transmitting materials, which respectively transmit the video images collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell, including the On the external bracket. In this embodiment, the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1. The windows 309-2 are all sealed with light-permeable materials. The windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target.
传动机构302-1通过接收数据处理单元303发送的控制命令调整摄像机301-1左右和上下视场方向位置,传动机构302-2通过接收数据处理单元303发送的控制命令调整 摄像机301-2左右和上下视场方向位置。通信接口单元304主要包括有线和无线通信接口。电源管理单元305主要用于给整个智能视频感知装置供电,在本实施例中为系统内部的电池。光照单元306-1和306-2主要包括发光设备和光照强度调整单元,光源为可见光和/或红外光。光照单元306-1和摄像机301-1固定在一起,光照单元306-2和摄像机301-2固定在一起,传动机构302-1共同控制光照单元306-1和摄像机301-1的左右上下摆动位置,传动机构302-2共同控制光照单元306-2和摄像机301-2的左右上下摆动位置。The transmission mechanism 302-1 adjusts the left and right positions of the camera 301-1 and the vertical field of view by receiving the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the left and right positions of the camera 301-2 by receiving the control commands sent by the data processing unit 303. The position of the up and down field of view. The communication interface unit 304 mainly includes wired and wireless communication interfaces. The power management unit 305 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. The lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, and the light source is visible light and/or infrared light. The light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(2) For the first preset monitoring area, set the PTZ parameters, perform video image collection with the large field of view and small resolution, and transmit the collected video images to the data processing unit 303 in real time.
(3)数据处理单元303经过一次目标检测算法,对所采集的视频图像进行目标的一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元303发送控制命令给摄像机301-1和传动机构302,传动机构302-1调整摄像机301-1视场方向,使得摄像机301-1视场方向对准第二预设监测区域,并调整摄像机301-1焦距,以大视野小分辨率对第二预设监测区域进行视频图像采集,数据处理单元303进行第二预设监测区域目标的一次目标检测,检测方法与第一预设监测区域目标一次检测方法相同。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 303 sends a control command to the camera 301-1 and the transmission mechanism 302, and the transmission mechanism 302-1 adjusts the camera The direction of the field of view 301-1 makes the direction of the field of view of the camera 301-1 align with the second preset monitoring area, and adjust the focal length of the camera 301-1 to collect video images of the second preset monitoring area with a large field of view and a small resolution, The data processing unit 303 performs one-time target detection of the target in the second preset monitoring area, and the detection method is the same as the first-time detection method of the target in the first preset monitoring area.
(5)当一次目标检测发现有疑似目标时,数据处理单元303给出第一预设监测区域特征权重Q1,Q1由该区域内疑似目标数量获得;同时,设定PTZ参数,以大视野小分辨率覆盖第二预设监测区域,使摄像机301-1对该监测区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(5) When a suspected target is found in a target detection, the data processing unit 303 gives the first preset monitoring area feature weight Q1, which is obtained from the number of suspected targets in the area; at the same time, the PTZ parameters are set so that the field of view is large and small. The resolution covers the second preset monitoring area, so that the camera 301-1 collects video images of the monitoring area, and transmits the collected video images to the data processing unit 303 in real time.
(6)依次类推,重复上述步骤(1)~(5),对N个监测区域目标的一次目标检测,区分出存在疑似目标的W个区域,并依据区域特征权重Qi的大小对W个区域重新排序,权重最大的为第1区域,权重最小为第W区域。(6) By analogy, repeat the above steps (1) ~ (5), perform a target detection of N monitoring area targets, distinguish W areas where there are suspected targets, and compare W areas according to the size of the area feature weight Qi Re-order, the largest weight is the first area, and the smallest weight is the W-th area.
本实施例中,各区域特征权重Qi(0<i≤N)分别由各区域中疑似目标的数量获得。In this embodiment, each area feature weight Qi (0<i≤N) is obtained from the number of suspected targets in each area.
(7)智能视觉感知装置调整摄像机301-2视场方向和焦距,将疑似目标高度缩放到视频图像高度的2/3,并调整到视野中心,以该小视野大分辨率对存在疑似目标的第1 区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303经过二次目标检测算法,对采集的视频图像依次进行疑似目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(7) The intelligent visual perception device adjusts the direction and focal length of the field of view of the camera 301-2, zooms the height of the suspected target to 2/3 of the height of the video image, and adjusts it to the center of the field of view, and uses the small field of view and large resolution for the suspected target The first area collects video images, and transmits the collected video images to the data processing unit 303 in real time. The data processing unit 303 performs secondary target detection of suspected targets on the collected video images through a secondary target detection algorithm. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(8)当二次目标检测判断出疑似目标是真目标时,数据处理单元303生成该真目标的目标特征信息和告警信息,同时,数据处理单元303发送控制命令给摄像机301和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,实时调整摄像机301-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元303将目标特征信息和告警信息发送至监控终端。(8) When the secondary target detection determines that the suspected target is a real target, the data processing unit 303 generates target feature information and alarm information of the real target, and at the same time, the data processing unit 303 sends control commands to the camera 301 and the transmission mechanism 302- 2. The transmission mechanism 302-2 adjusts the field of view direction of the camera 301-2, adjusts the focal length of the camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 sends the target feature information and alarm information to the monitoring terminal.
目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种,或者表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, etc., or characteristic information that characterizes the specific identity of the target, including one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(9)摄像机301-2对真目标进行跟踪,当该真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令给摄像机301-2和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,使得摄像机301-2视场方向对准下一个真目标进行跟踪。(9) The camera 301-2 tracks the real target. When the real target disappears or the tracking time reaches the set value, the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302- 2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aimed at the next real target for tracking.
(10)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元303对假目标进行特征描述,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步提高了一次检测算法的精确度。(10) When the secondary target detection determines that the suspected target is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000006
Figure PCTCN2021087921-appb-000006
(11)摄像机301-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令给摄像机301-2和传动机构302-2,传动机构302-2调整摄像机301-2视场方向,使得摄像机301-2视场方向对准第2区域,调整摄像机301-2焦距,以小视野大分辨率对第2区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303经过视频图像目标检测算法,对采集的视频图像进行第2区域目标的二次目标检测,检测方法与第1区域目标二次检测方法相同。(11) The camera 301-2 tracks the true target. When all true targets disappear or the tracking time reaches the set value, the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302- 2 Adjust the field of view direction of the camera 301-2 so that the field of view direction of the camera 301-2 is aligned with the second area, adjust the focal length of the camera 301-2, and collect the video image of the second area with a small field of view and large resolution, and the captured video The image is transmitted to the data processing unit 303 in real time, and the data processing unit 303 performs secondary target detection of the second area target on the collected video image through the video image target detection algorithm. The detection method is the same as the first area target second detection method.
(12)依次类推,按(7)~(11)步骤对W个区域进行二次目标检测和/或识别,最后,重新从第(2)步开始,从第一预设监测区域开始检测和/或目标识别,自动循环。(12) By analogy, follow steps (7) to (11) to perform secondary target detection and/or recognition for W areas, and finally, start from step (2) again, starting from the first preset monitoring area to detect and / Or target recognition, automatic loop.
在整个智能视觉感知装置工作中,数据处理单元303对摄像机301-1和摄像机301-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元306-1 和光照单元306-2,调整光照单元光照强度,使得摄像机301-1和摄像机301-2采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and the lighting unit in time. 306-2. Adjust the light intensity of the light unit so that the brightness of the video images collected by the camera 301-1 and the camera 301-2 is moderate.
实施例9:Example 9:
在上述实施例的基础上,还提供一种智能视觉感知系统的第九种实施方式:On the basis of the foregoing embodiment, a ninth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由1个可见光摄像机、1个近红外摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、2光照单元及1防护外壳组成。外壳留有4窗口及电源和信号线接口,4窗口采用可透光的材料密封,其中2个窗口,用于2摄像机进行视频图像采集,另外2个窗口用于光照单元进行补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 1 visible light camera, 1 near-infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing. The shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. 2 windows are used for video image acquisition by 2 cameras, and the other 2 windows are used for light supplementation by the lighting unit. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,输入接口用于接收外部设备信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA中的一种或多种;有线方式包括RS485、RS422、RS232、CAN总线中的一种或多种;输出接口用于发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces, the input interface is used to receive external device signals, and its connection methods include wireless and/or wired methods; among them, wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA ; Wired methods include one or more of RS485, RS422, RS232, CAN bus; the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more of, 3G, 4G, 5G, NB-IOT; wired mode includes one or more of LAN and optical fiber.
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互,和/或与现场其它传感器,和/或其它关联系统的联动信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、光照范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center, and/or Interaction information with other sensors on site and/or other related systems. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图7所示,智能视觉感知装置包含近红外摄像机301-1、可见光摄像机301-2、传动机构302-1、传动机构302-2、数据处理单元303、通信接口单元304、电源管理单元305、光照单元306-1、光照单元306-2、防护外壳组成307。As shown in Figure 7, the intelligent visual perception device includes a near-infrared camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, and a power management unit 305 , The illumination unit 306-1, the illumination unit 306-2, and the protective shell constitute 307.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,包括固定在外部支架上。在本实施例中,外壳留有窗口308-1、窗口308-2、窗口309-1、窗口309-2,以及电源和信号线接口板310,窗口308-1、窗口308-2、窗口309-1、窗口309-2都采用可透光的材料密封,其中窗口308-1和窗口308-2用于摄像机301-1和摄像机301-2的视频图像采集,窗口309-1和窗口309-2用于光照单元306-1和光照单元306-2透光,给摄像机301-1和摄像机301-2监测目标补光。传动机构302-1通过数据处理单元303发送的控制命令调整摄像机301-1左右和上下视场 方向位置,传动机构302-2通过数据处理单元303发送的控制命令调整摄像机301-2左右和上下视场方向位置。通信接口单元304主要包括有线和无线通信接口。电源管理单元305主要用于给整个智能视频感知装置供电,在本实施例中为外部的太阳能电池板。光照单元306-1和306-2主要包括发光设备和光照强度调整单元,均为红外光。光照单元306-1和摄像机301-1固定在一起,光照单元306-2和摄像机301-2固定在一起,传动机构302-1共同控制光照单元306-1和摄像机301-1的左右上下摆动位置,传动机构302-2共同控制光照单元306-2和摄像机301-2的左右上下摆动位置。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material to transmit the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell, including fixing on the outside On the stand. In this embodiment, the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1. The windows 309-2 are all sealed with light-permeable materials. The windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target. The transmission mechanism 302-1 adjusts the left-right and vertical field of view position of the camera 301-1 through the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the left-right and vertical vision of the camera 301-2 through the control commands sent by the data processing unit 303. Field direction position. The communication interface unit 304 mainly includes wired and wireless communication interfaces. The power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, and in this embodiment is an external solar panel. The lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, both of which are infrared light. The light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,近红外摄像机301-1以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(2) For the first preset monitoring area, PTZ parameters are set, the near-infrared camera 301-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 303 in real time.
(3)数据处理单元303经过一次目标检测算法,对所采集的视频图像进行目标的一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元303发送控制命令给红外摄像机301-1和传动机构302-1,传动机构302-1调整红外摄像机301-1视场方向,使得红外摄像机301-1视场方向对准第二预设监测区域,调整红外摄像机301-1焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元303经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 303 sends control commands to the infrared camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302- 1 Adjust the direction of the infrared camera 301-1's field of view so that the direction of the infrared camera 301-1's field of view is aligned with the second preset monitoring area, adjust the focal length of the infrared camera 301-1, and collect video images in this area with a large field of view and a small resolution The data processing unit 303 performs a target detection of the target on the collected video image through the video image target detection algorithm.
(5)当一次目标检测发现有疑似目标时,调整PTZ参数:数据处理单元303发送控制命令给可见光摄像机301-2和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,使得可见光摄像机301-2视场方向对准疑似目标,调整可见光摄像机301-2焦距。通过调整PTZ参数,将疑似目标高度缩放到视频图像高度的1/3,以该小视野大分辨率对疑似目标进行视频图像采集,数据处理单元303经过二次目标检测算法,对采集的视频图像进行二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(5) When a suspected target is found in a target detection, adjust the PTZ parameters: the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302-2 adjusts the field of view of the visible light camera 301-2 Direction so that the visual field of the visible light camera 301-2 is aimed at the suspected target, and the focal length of the visible light camera 301-2 is adjusted. By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/3 of the height of the video image, and the video image of the suspected target is collected with the small field of view and large resolution. The data processing unit 303 performs the secondary target detection algorithm to collect the video image Perform secondary target detection. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(6)当二次目标检测判断出疑似目标是真目标时,数据处理单元303输出告警信息至监控终端,并生成该真目标特征信息,同时,数据处理单元303发送控制命令给可见光摄像机301-2和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,实时调整可见光摄像机301-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理 单元303将真目标特征信息和告警信息发送至监控终端。(6) When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information to the monitoring terminal and generates the true target characteristic information. At the same time, the data processing unit 303 sends a control command to the visible light camera 301- 2 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the visual field direction of the visible light camera 301-2, adjusts the focal length of the visible light camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 will Target feature information and alarm information are sent to the monitoring terminal.
目标特征信息包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种,或表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, and One or more of human remains, or characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of a car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(7)可见光摄像机301-2对真目标进行跟踪,当该真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令至可见光摄像机301-2和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,使得可见光摄像机301-2视场方向对准下一个真目标进行跟踪。(7) The visible light camera 301-2 tracks the real target. When the real target disappears or the tracking time reaches the set value, the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the direction of the field of view of the visible light camera 301-2, so that the direction of the field of view of the visible light camera 301-2 is aligned with the next real target for tracking.
(8)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元303对假目标进行特征描述,提取假目标特征描述信息,更新假目标反馈特征信息库,以降低算法在此之后的一次目标检测误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。(8) When the secondary target detection determines that the suspected target is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, update the false target feedback feature information database, in order to reduce the probability of a target detection error detection algorithm after this. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000007
Figure PCTCN2021087921-appb-000007
(9)可见光摄像机301-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,针对第二预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303:数据处理单元303发送控制命令给红外摄像机301-1和传动机构302-1,传动机构302-1调整红外摄像机301-1视场方向,使得红外摄像机301-1视场方向对准第二预设监测区域,调整红外摄像机301-1焦距,对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测,检测方法与第一预设监测区域目标检测方法相同。(9) The visible light camera 301-2 tracks the real target. When all real targets disappear or the tracking time reaches the set value, set the PTZ parameters for the second preset monitoring area, and perform video with the large field of view and small resolution Image acquisition, real-time transmission of the collected video images to the data processing unit 303: The data processing unit 303 sends control commands to the infrared camera 301-1 and the transmission mechanism 302-1, and the transmission mechanism 302-1 adjusts the direction of the infrared camera 301-1 field of view , The direction of the infrared camera 301-1 field of view is aligned with the second preset monitoring area, the focal length of the infrared camera 301-1 is adjusted, the video image is collected in this area, and the collected video image is transmitted to the data processing unit 303 in real time, and the data is processed The unit 303 performs a target detection of the target on the collected video image through the video image target detection algorithm, and the detection method is the same as the target detection method of the first preset monitoring area.
(10)依次类推,按(2)~(9)步骤对N个区域进行目标识别,最后,重新从第一监测区域开始目标识别,自动循环。(10) By analogy, follow the steps (2) to (9) to perform target recognition on N areas, and finally, start target recognition from the first monitoring area again, and loop automatically.
在整个智能视觉感知装置工作中,数据处理单元303对红外摄像机301-1和可见光摄像机301-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元306-1和光照单元306-2,调整光照单元光照强度和光照范围,使得红外摄像机301-1和可见光摄像机301-2采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 303 analyzes the brightness of the video images collected by the infrared camera 301-1 and the visible light camera 301-2 in real time, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and 306-1 in time. The light unit 306-2 adjusts the light intensity and light range of the light unit so that the brightness of the video images collected by the infrared camera 301-1 and the visible light camera 301-2 is moderate.
实施例10:Example 10:
在上述实施例的基础上,还提供一种智能视觉感知系统的第十种实施方式:On the basis of the foregoing embodiment, a tenth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由1个可见光摄像机、1个近红外摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、2光照单元及1防护外壳组成。外壳留有4窗口及电源和信号线接口,4窗口采用可透光的材料密封,其中2个窗口,用于2摄像机进行视频图像采集,另外2个窗口用于光照单元进行补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 1 visible light camera, 1 near-infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 2 illumination units and 1 protective housing. The shell has 4 windows and power and signal wire interfaces. The 4 windows are sealed with light-permeable materials. 2 windows are used for video image acquisition by 2 cameras, and the other 2 windows are used for light supplementation by the lighting unit. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互,和/或与现场其它传感器,和/或其它关联系统的联动信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、光照范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focus, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center, and/or Interaction information with other sensors on site and/or other related systems. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图7所示,智能视觉感知装置包含近红外摄像机301-1、可见光摄像机301-2、传动机构302-1、传动机构302-2、数据处理单元303、通信接口单元304、电源管理单元305、光照单元306-1、光照单元306-2及防护外壳组成307。As shown in Figure 7, the intelligent visual perception device includes a near-infrared camera 301-1, a visible light camera 301-2, a transmission mechanism 302-1, a transmission mechanism 302-2, a data processing unit 303, a communication interface unit 304, and a power management unit 305 , The illumination unit 306-1, the illumination unit 306-2 and the protective shell constitute 307.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,分别透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口308-1、窗口308-2、窗口309-1、窗口309-2,以及电源和信号线接口板310,窗口308-1、窗口308-2、窗口309-1、窗口309-2都采用可透光的材料密封,其中窗口308-1和窗口308-2用于摄像机301-1和摄像机301-2的视频图像采集,窗口309-1和窗口309-2用于光照单元306-1和光照单元306-2透光,给摄像机301-1和摄像机301-2监测目标补光。传动机构302-1通过数据处理单元303发送的控制命令调整摄像机301-1左右和上下视场方向位置,传动机构302-2通过数据处理单元303发送的控制命令调整摄像机301-2左右和上下视场方向位置。通信接口单元304主要包括有线和无线通信接口。电源管理单元305主要用于给整个智能视频感知装置供电,在本实施例中为有线电源。光照单元306-1和306-2主要包括发光设备和光照强度调整单元,光源为可见光和/或红外光。光照单元306-1和摄像机301-1固定在一起,光照单元306-2和摄像机301-2固定在一起, 传动机构302-1共同控制光照单元306-1和摄像机301-1的左右上下摆动位置,传动机构302-2共同控制光照单元306-2和摄像机301-2的左右上下摆动位置。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the windows are made of light-transmitting materials, which respectively transmit the video images collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell and is fixed on the outside On the stand. In this embodiment, the shell leaves window 308-1, window 308-2, window 309-1, window 309-2, and power and signal line interface board 310, window 308-1, window 308-2, window 309 -1. The windows 309-2 are all sealed with light-permeable materials. The windows 308-1 and 308-2 are used for the video image collection of the camera 301-1 and the camera 301-2, and the windows 309-1 and 309- 2 is used for the light transmission of the light unit 306-1 and the light unit 306-2, and supplement light for the camera 301-1 and the camera 301-2 to monitor the target. The transmission mechanism 302-1 adjusts the left-right and vertical field of view position of the camera 301-1 through the control commands sent by the data processing unit 303, and the transmission mechanism 302-2 adjusts the left-right and vertical vision of the camera 301-2 through the control commands sent by the data processing unit 303. Field direction position. The communication interface unit 304 mainly includes wired and wireless communication interfaces. The power management unit 305 is mainly used to supply power to the entire intelligent video sensing device, which is a wired power supply in this embodiment. The lighting units 306-1 and 306-2 mainly include light-emitting devices and light intensity adjustment units, and the light source is visible light and/or infrared light. The light unit 306-1 and the camera 301-1 are fixed together, the light unit 306-2 and the camera 301-2 are fixed together, and the transmission mechanism 302-1 jointly controls the left and right swing positions of the light unit 306-1 and the camera 301-1 , The transmission mechanism 302-2 jointly controls the left and right swing positions of the light unit 306-2 and the camera 301-2.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,近红外摄像机301-1以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(2) For the first preset monitoring area, PTZ parameters are set, the near-infrared camera 301-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 303 in real time.
(3)数据处理单元303经过一次目标检测算法,对所采集的视频图像进行目标的一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 303 performs a target detection of the target on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元303发送控制命令给红外摄像机301-1和传动机构302,传动机构302-1调整近红外摄像机301-1视场方向,使得近红外摄像机301-1视场方向对准第二预设监测区域,调整近红外摄像机301-1焦距,以大视野小分辨率对第二预设监测区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303进行第二预设监测区域目标的一次目标检测,检测方法与第一预设监测区域目标一次检测方法相同。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 303 sends a control command to the infrared camera 301-1 and the transmission mechanism 302, and the transmission mechanism 302-1 is adjusted The direction of the field of view of the near-infrared camera 301-1 is such that the direction of the field of view of the near-infrared camera 301-1 is aligned with the second preset monitoring area, and the focal length of the near-infrared camera 301-1 is adjusted to monitor the second preset with a large field of view and a small resolution The video image is collected in the area, and the collected video images are transmitted to the data processing unit 303 in real time. The data processing unit 303 performs a target detection of the target in the second preset monitoring area. The detection method is the same as the first detection method of the target in the first preset monitoring area. .
(5)当一次目标检测发现有疑似目标时,数据处理单元303给出第一预设监测区域特征权重Q1,Q1由该区域内疑似目标数量及移动目标的移动速率加权获得;设定PTZ参数,以大视野小分辨率覆盖第二预设监测区域,由近红外摄像机301-1对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303。(5) When a suspected target is found in a target detection, the data processing unit 303 gives the first preset monitoring area feature weight Q1, which is weighted by the number of suspected targets in the area and the moving speed of the moving target; set the PTZ parameter , Cover the second preset monitoring area with a large field of view and small resolution, the near-infrared camera 301-1 collects video images of this area, and transmits the collected video images to the data processing unit 303 in real time.
(6)依次类推,重复上述步骤(1)~(5),对N个监测区域目标的一次目标检测,区分出存在疑似目标的W个区域,并依据区域特征权重Qi的大小对W个区域重新排序,权重最大的为第1区域,权重最小为第W区域。(6) By analogy, repeat the above steps (1) ~ (5), perform a target detection of N monitoring area targets, distinguish W areas where there are suspected targets, and compare W areas according to the size of the area feature weight Qi Re-order, the largest weight is the first area, and the smallest weight is the W-th area.
本实施例中,各区域特征权重Qi(0<i≤N)分别由各区域中疑似目标的数量及移动目标的移动速率加权获得。In this embodiment, each area feature weight Qi (0<i≤N) is respectively weighted by the number of suspected targets in each area and the moving speed of the moving target.
(7)智能视觉感知装置调整近红外摄像机301-1视场方向和焦距,通过调整PTZ参数,将疑似目标高度缩放到视野高度的1/2,并调整到视野中心,以该小视野大分辨率对存在疑似目标的第1区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303经过二次目标检测算法,对采集的视频图像依次进行疑似目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(7) The intelligent visual perception device adjusts the direction and focal length of the field of view of the near-infrared camera 301-1. By adjusting the PTZ parameters, the height of the suspected target is zoomed to 1/2 of the height of the field of view, and adjusted to the center of the field of view, so that the small field of view can be used for large resolution Rate the video image collection of the first area where there are suspected targets, and transmit the collected video images to the data processing unit 303 in real time. The data processing unit 303 performs secondary target detection algorithms on the collected video images sequentially. Target Detection. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(8)当二次目标检测判断出疑似目标是真目标时,数据处理单元303生成该真目标的目标特征信息1和告警信息1,同时,数据处理单元303发送控制命令给摄像机301和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,实时调整可见光摄像机301-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元303将二次检测真目标的目标特征信息1发送至云平台或数据中心;云端平台或数据中心依据智能视觉感知装置发送的目标特征信息1经过目标识别算法,进行目标识别,并将得到目标的特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理和显示。(8) When the secondary target detection determines that the suspected target is a real target, the data processing unit 303 generates target feature information 1 and alarm information 1 of the real target, and at the same time, the data processing unit 303 sends control commands to the camera 301 and the transmission mechanism 302-2, the transmission mechanism 302-2 adjusts the visual field direction of the visible light camera 301-2, adjusts the focal length of the visible light camera 301-2 in real time, and tracks the real target detected in real time; at the same time, the data processing unit 303 detects the real target twice The target characteristic information 1 of the target is sent to the cloud platform or data center; the cloud platform or data center uses the target recognition algorithm to identify the target according to the target characteristic information 1 sent by the intelligent visual perception device, and obtains the target characteristic information 2 and alarm information 2 Send to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(9)可见光摄像机301-2对真目标进行跟踪,当该真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令给摄像机301-2和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,使得可见光摄像机301-2视场方向对准下一个真目标进行跟踪。(9) The visible light camera 301-2 tracks the true target. When the true target disappears or the tracking time reaches the set value, the data processing unit 303 sends a control command to the camera 301-2 and the transmission mechanism 302-2, and the transmission mechanism 302 -2 Adjust the direction of the field of view of the visible light camera 301-2 so that the direction of the field of view of the visible light camera 301-2 is aimed at the next real target for tracking.
(10)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元303对假目标进行特征描述,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步提高了一次检测算法的精确度。(10) When the secondary target detection determines that the suspected target is a false target, the false target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 303 performs feature description on the false target and extracts the false target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000008
Figure PCTCN2021087921-appb-000008
(11)可见光摄像机301-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,数据处理单元303发送控制命令给可见光摄像机301-2和传动机构302-2,传动机构302-2调整可见光摄像机301-2视场方向,使得可见光摄像机301-2视场方向对准第2区域,调整可见光摄像机301-2焦距,以小视野大分辨率对第2区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元303,数据处理单元303经过视频图像目标检测算法,对采集的视频图像进行第2区域目标的二次目标检测,检测方法与第1区域目标二次检测方法相同。(11) The visible light camera 301-2 tracks the real target. When all real targets disappear or the tracking time reaches the set value, the data processing unit 303 sends a control command to the visible light camera 301-2 and the transmission mechanism 302-2, the transmission mechanism 302-2 Adjust the field of view direction of the visible light camera 301-2 so that the field of view direction of the visible light camera 301-2 is aligned with the second area, adjust the focal length of the visible light camera 301-2, and collect video images of the second area with a small field of view and large resolution , The collected video images are transmitted to the data processing unit 303 in real time. The data processing unit 303 performs the second target detection of the second area target on the collected video image through the video image target detection algorithm. The detection method is the same as the second target detection method of the first area target. The detection method is the same.
(12)依次类推,按(7)~(11)步骤对W个区域进行二次目标检测和识别,最后,重新从第(1)步开始,从第一预设监测区域开始目标识别,自动循环。(12) By analogy, follow the steps (7) to (11) to perform secondary target detection and recognition for W areas, and finally, start from step (1) again, start target recognition from the first preset monitoring area, and automatically cycle.
(13)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(13) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
在整个智能视觉感知装置工作中,数据处理单元303对摄像机301-1和可见光摄像机301-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元306-1和光照单元306-2,调整光照单元光照强度,使得摄像机301采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 303 performs real-time analysis on the brightness of the video images collected by the camera 301-1 and the visible light camera 301-2, and when the brightness is insufficient, it sends control commands to the lighting unit 306-1 and lighting in time. The unit 306-2 adjusts the light intensity of the light unit so that the brightness of the video image collected by the camera 301 is moderate.
实施例11:Example 11:
在上述实施例的基础上,还提供一种智能视觉感知系统的第十一种实施方式:On the basis of the foregoing embodiment, an eleventh implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由1个可见光摄像机、1个红外热成像摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1光照单元及1防护外壳组成。外壳留有3窗口及电源和信号线接口,3窗口采用可透光的材料密封,其中2个窗口,用于2摄像机进行视频图像采集,另外1个窗口用于补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 1 visible light camera, 1 infrared thermal imaging camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 1 lighting unit and 1 protective housing. The shell has 3 windows and power and signal line interfaces. The 3 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image collection by 2 cameras, and the other window is used for light supplementation. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,用于接收外部设备信号及发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces for receiving external device signals and sending signals collected or received by the system. Its connection methods include wireless and/or wired methods; among them, wireless methods include WIFI, BT, ZIGBEE, LORA One or more of, 2G, 3G, 4G, 5G, NB-IOT; wired methods include one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber .
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、光照范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, a light intensity, and light range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图8所示,智能视觉感知装置包含红外热成像摄像机401-1、可见光摄像机401-2、传动机构402-1、传动机构402-2、数据处理单元403、通信接口单元404、电源管理单元405、光照单元406、防护外壳407。As shown in Figure 8, the intelligent visual perception device includes an infrared thermal imaging camera 401-1, a visible light camera 401-2, a transmission mechanism 402-1, a transmission mechanism 402-2, a data processing unit 403, a communication interface unit 404, and a power management unit 405. Illumination unit 406, and protective housing 407.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口408-1、窗口408-2、窗口409,以及电源和信号线接口板410,窗口408-1、窗口408-2、窗口409都采用可透光的材料密封,其中窗口408-1用于红外热成像摄像机401-1视频图像采集,窗口408-2用于可见光摄像机401-2的视频图像采集,窗口409用于光照单元406透光,给可见光摄像机401-2监测目标补光。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell and is fixed to the external bracket superior. In this embodiment, the shell is left with windows 408-1, 408-2, and 409, and a power and signal line interface board 410. The windows 408-1, 408-2, and 409 are all made of light-permeable materials. The window 408-1 is used for the video image collection of the infrared thermal imaging camera 401-1, the window 408-2 is used for the video image collection of the visible light camera 401-2, and the window 409 is used for the light unit 406 to transmit light to the visible light camera 401 -2 The monitoring target fills in light.
传动机构402-1通过接收数据处理单元403发送的控制命令,调整红外热成像摄像机401-1左右和上下视场方向位置;传动机构402-2通过接收数据处理单元403发送的控制命令调整摄像机401-2左右和上下视场方向位置。通信接口单元404主要包括有线和无线通信接口。电源管理单元405主要用于给整个智能视频感知装置供电,在本实施例中为有线电源。光照单元406主要包括发光设备和光照强度调整单元,光源为激光。光照单元406和摄像机401-2固定在一起,传动机构402-1控制红外热成像摄像机401-1的左右上下摆动位置,传动机构402-2共同控制光照单元406和摄像机401-2的左右上下摆动位置。The transmission mechanism 402-1 adjusts the position of the infrared thermal imaging camera 401-1 in the left-right and vertical field of view by receiving the control commands sent by the data processing unit 403; the transmission mechanism 402-2 adjusts the camera 401 by receiving the control commands sent by the data processing unit 403 -2 The position of the left and right field of view and up and down. The communication interface unit 404 mainly includes wired and wireless communication interfaces. The power management unit 405 is mainly used to supply power to the entire intelligent video perception device, which is a wired power supply in this embodiment. The light unit 406 mainly includes a light emitting device and a light intensity adjustment unit, and the light source is a laser. The light unit 406 and the camera 401-2 are fixed together, the transmission mechanism 402-1 controls the left and right swinging positions of the infrared thermal imaging camera 401-1, and the transmission mechanism 402-2 jointly controls the light unit 406 and the camera 401-2 to swing left and right. Location.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,红外热成像摄像机401-1以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元403。(2) For the first preset monitoring area, PTZ parameters are set, the infrared thermal imaging camera 401-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 403 in real time.
(3)数据处理单元403经过一次目标检测算法,对所采集的视频图像进行目标的一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 403 performs a target detection of the target on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元403发送控制命令给红外热成像摄像机401-1和传动机构402-1,传动机构402-1调整红外热成像摄像机401-1视场方向,使得红外热成像摄像机401-1视场方向对准第二预设监测区域,调整红外热成像摄像机401-1焦距,以大视野小分辨率对第二预设监测区域进行视频图像采集,数据处理单元403经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 403 sends a control command to the infrared thermal imaging camera 401-1 and the transmission mechanism 402-1, the transmission mechanism 402-1 adjusts the field of view direction of the infrared thermal imaging camera 401-1 so that the field of view of the infrared thermal imaging camera 401-1 is aligned with the second preset monitoring area, and adjusts the focal length of the infrared thermal imaging camera 401-1 to achieve a large field of view and a small resolution Video image collection is performed on the second preset monitoring area at a high rate, and the data processing unit 403 performs a target detection of the target on the collected video image through the video image target detection algorithm.
(5)当一次目标检测发现有疑似目标时,调整PTZ参数:数据处理单元403发送控制命令至可见光摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,使得可见光摄像机401-2视场方向对准疑似目标,调整可见光摄像机401-2焦距。通过调整PTZ参数,将疑似目标高度缩放到视频图像高度的1/3,并调整到视野中心,以该小视野大分辨率对疑似目标进行视频图像采集,数据处理单元403经过二次目标检测算法,对采集的视频图像进行二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(5) When a suspected target is found in a target detection, adjust the PTZ parameters: the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402-2 adjusts the field of view of the visible light camera 401-2 Direction so that the visual field of the visible light camera 401-2 is aimed at the suspected target, and the focal length of the visible light camera 401-2 is adjusted. By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/3 of the height of the video image, and adjusted to the center of the field of view. The video image of the suspected target is collected with this small field of view and large resolution. The data processing unit 403 undergoes a secondary target detection algorithm. , Perform secondary target detection on the collected video images. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(6)当二次目标检测判断出疑似目标是真目标时,数据处理单元303输出告警信息1至监控终端,并生成该真目标特征信息1。数据处理单元403发送控制命令给可见 光摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,实时调整可见光摄像机401-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元403将该真目标特征信息1发送至云平台或数据中心,云端平台或数据中心依据智能视觉感知装置发送的该真目标特征信息1经过目标识别算法,进行目标识别,生成该真目标特征信息2和告警信息2,并将真目标特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理。(6) When the secondary target detection determines that the suspected target is a true target, the data processing unit 303 outputs alarm information 1 to the monitoring terminal, and generates the true target characteristic information 1. The data processing unit 403 sends control commands to the visible light camera 401-2 and the transmission mechanism 402-2. The transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view, adjusts the focal length of the visible light camera 401-2 in real time, and responds to the detected real target Perform real-time tracking; at the same time, the data processing unit 403 sends the true target characteristic information 1 to the cloud platform or data center, and the cloud platform or data center performs target recognition algorithm according to the true target characteristic information 1 sent by the intelligent visual perception device. Identify, generate the true target feature information 2 and alarm information 2, and send the true target feature information 2 and alarm information 2 to the monitoring terminal; the monitoring terminal responds to the alarm information 1, alarm information 2, target feature information 1 and/or target feature Information 2 is processed.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(7)可见光摄像机401-2对真目标进行跟踪,当该真目标消失或跟踪时间达到设定值时,数据处理单元403发送控制命令至可见光摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,使得可见光摄像机401-2视场方向对准下一个真目标进行跟踪。(7) The visible light camera 401-2 tracks the real target. When the real target disappears or the tracking time reaches the set value, the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, the transmission mechanism 402-2 adjusts the field of view direction of the visible light camera 401-2 so that the field of view direction of the visible light camera 401-2 is aimed at the next real target for tracking.
(8)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元403对假目标进行特征描述,提取假目标特征描述信息,更新假目标反馈特征信息库,以降低算法在此之后的一次目标检测误检概率。随着系统运行时间的增长,假目标特征描述越来越精确,系统再进行一次目标检测的误检概率将会越来越低,准确度会越来越高,系统性能自动得到提升。(8) When the secondary target detection judges that the suspected target is a fake target, the fake target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 403 characterizes the fake target and extracts the fake target Feature description information, update the false target feedback feature information database, in order to reduce the probability of a target detection error detection algorithm after this. As the running time of the system increases, the false target feature description becomes more and more accurate, the false detection probability of another target detection by the system will be lower and lower, the accuracy will be higher and higher, and the system performance will be automatically improved.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000009
Figure PCTCN2021087921-appb-000009
(9)可见光摄像机401-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,针对第二预设监测区域,设定PTZ参数,以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元403:数据处理单元403发送控制命令给红外热成像摄像机401-1和传动机构402-1,传动机构402-1调整红外热成像摄像机401-1视场方向,使得红外热成像摄像机401-1视场方向对准第二预设监测区域,调整红外热成像摄像机401-1焦距,以大视野小分辨率对第二预设监测区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元403,数据处理单元403经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测,检测方法与第一预设监测区域目标检测方法相同。(9) The visible light camera 401-2 tracks the real target. When all real targets disappear or the tracking time reaches the set value, set the PTZ parameters for the second preset monitoring area, and perform video with the large field of view and small resolution Image acquisition, real-time transmission of the collected video images to the data processing unit 403: The data processing unit 403 sends control commands to the infrared thermal imaging camera 401-1 and the transmission mechanism 402-1, and the transmission mechanism 402-1 adjusts the infrared thermal imaging camera 401- 1 Field of view direction, so that the field of view of the infrared thermal imaging camera 401-1 is aligned with the second preset monitoring area, adjust the focal length of the infrared thermal imaging camera 401-1, and video the second preset monitoring area with a large field of view and small resolution Image acquisition, real-time transmission of the collected video images to the data processing unit 403. The data processing unit 403 performs a target detection on the collected video images through the video image target detection algorithm. The detection method is the same as the first preset monitoring area target detection The method is the same.
(10)依次类推,按(2)~(9)步骤对N个区域进行目标识别,最后,重新从第一预设监测区域开始目标识别,自动循环。(10) By analogy, follow the steps (2) to (9) to perform target recognition on N areas, and finally, start target recognition from the first preset monitoring area again, and loop automatically.
(11)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(11) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
在整个智能视觉感知装置工作中,数据处理单元403对可见光摄像机401-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元406,光照单元406调整光照强度和光照范围,使得可见光摄像机401-2采集视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 403 performs real-time analysis on the brightness of the video image collected by the visible light camera 401-2, and when the brightness is insufficient, it sends a control command to the lighting unit 406 in time, and the lighting unit 406 adjusts the light intensity and light. Range, so that the visible light camera 401-2 captures video images with moderate brightness.
实施例12:Example 12:
在上述实施例的基础上,还提供一种智能视觉感知系统的第十二种实施方式:On the basis of the foregoing embodiment, a twelfth implementation manner of an intelligent visual perception system is also provided:
一种智能视觉感知系统,包含一个或多个智能视觉感知装置。该智能视觉感知装置由1个可见光摄像机、1个红外摄像机、2传动机构、1数据处理单元、1通信接口单元、1电源管理单元、1光照单元及1防护外壳组成。外壳留有3窗口及电源和信号线接口,3窗口采用可透光的材料密封,其中2个窗口,用于2摄像机进行视频图像采集,另外1个窗口用于补光。2传动机构通过数据处理单元发送的控制命令调整摄像机和光照单元的水平和上下视场方向位置,包括驱动电机、水平转轴、竖直转轴、控制线等。驱动电机驱动摄像机和光照单元绕转轴水平转动0~360度,上下转动0~180度。An intelligent visual perception system includes one or more intelligent visual perception devices. The intelligent visual perception device is composed of 1 visible light camera, 1 infrared camera, 2 transmission mechanisms, 1 data processing unit, 1 communication interface unit, 1 power management unit, 1 lighting unit and 1 protective housing. The shell has 3 windows and power and signal line interfaces. The 3 windows are sealed with light-permeable materials. Among them, 2 windows are used for video image collection by 2 cameras, and the other window is used for light supplementation. 2 The transmission mechanism adjusts the horizontal and vertical field of view positions of the camera and the illumination unit through the control commands sent by the data processing unit, including the drive motor, the horizontal shaft, the vertical shaft, and the control line. The driving motor drives the camera and the light unit to rotate 0-360 degrees horizontally around the rotating shaft, and 0-180 degrees up and down.
通信接口单元主要包括有线和无线通信接口,输入接口用于接收外部设备信号,其连接方式包括无线和/或有线方式;其中,无线方式包括WIFI、BT、ZIGBEE、LORA中的一种或多种;有线方式包括RS485、RS422、RS232、CAN总线中的一种或多种;输出接口用于发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,无线方式包括2G、3G、4G、5G、NB-IOT中的一种或多种;有线方式包括LAN、光纤中的一种或多种。The communication interface unit mainly includes wired and wireless communication interfaces, the input interface is used to receive external device signals, and its connection methods include wireless and/or wired methods; among them, wireless methods include one or more of WIFI, BT, ZIGBEE, and LORA ; Wired methods include one or more of RS485, RS422, RS232, CAN bus; the output interface is used to send the signals collected or received by the system, and its connection methods include wireless and/or wired methods; among them, the wireless method includes 2G One or more of, 3G, 4G, 5G, NB-IOT; wired mode includes one or more of LAN and optical fiber.
数据处理单元,用于对摄像机采集的视频数据进行分析处理,控制摄像机进行焦距调整,控制传动机构调整摄像机和/或光照单元角度,和/或与云端平台或数据中心进行信息交互。电源管理单元主要用于给整个智能视频感知装置供电。光照单元主要包括发光设备和光照强度、视场方向范围调整单元,光照单元和摄像机固定在一起,传动机构共同控制光照单元和摄像机的左右上下摆动位置。The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the angle of the camera and/or the light unit, and/or exchange information with the cloud platform or data center. The power management unit is mainly used to supply power to the entire intelligent video sensing device. The light unit mainly includes a light-emitting device, light intensity, and field of view direction range adjustment unit. The light unit and the camera are fixed together, and the transmission mechanism jointly controls the left and right swing positions of the light unit and the camera.
如图8所示,智能视觉感知装置包含红外摄像机401-1、可见光摄像机401-2、传动机构402-1、传动机构402-2、数据处理单元403、通信接口单元404、电源管理单元405、光照单元406、防护外壳407。As shown in Figure 8, the intelligent visual perception device includes an infrared camera 401-1, a visible light camera 401-2, a transmission mechanism 402-1, a transmission mechanism 402-2, a data processing unit 403, a communication interface unit 404, a power management unit 405, Illumination unit 406, protective housing 407.
防护外壳包括接口板、窗口、固定座。接口板上有1个或多个接口,与外部单元连接;窗口采用透光材料,透射摄像机采集的视频图像,和/或光照单元发出的光;固定座用于固定防护外壳,固定在外部支架上。在本实施例中,外壳留有窗口408-1、窗口408-2、窗口409,以及电源和信号线接口板410,窗口408-1、窗口408-2、窗口409都采用可 透光的材料密封,其中窗口408-1用于红外摄像机401-1视频图像采集,窗口408-2用于可见光摄像机401-2的视频图像采集,窗口409用于光照单元406透光,给可见光摄像机401-2监测目标补光。The protective shell includes an interface board, a window, and a fixing seat. There are one or more interfaces on the interface board, which are connected to the external unit; the window adopts light-transmitting material, which transmits the video image collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell and is fixed to the external bracket superior. In this embodiment, the shell is left with windows 408-1, 408-2, and 409, and a power and signal line interface board 410. The windows 408-1, 408-2, and 409 are all made of light-permeable materials. The window 408-1 is used for the video image collection of the infrared camera 401-1, the window 408-2 is used for the video image collection of the visible light camera 401-2, and the window 409 is used for the light unit 406 to transmit light to the visible light camera 401-2 The monitoring target fills in light.
传动机构402-1通过数据处理单元403发送的控制命令,调整红外摄像机401-1左右和上下视场方向位置;传动机构402-2通过数据处理单元403发送的控制命令,调整摄像机401-2左右和上下视场方向位置。通信接口单元404主要包括有线和无线通信接口。电源管理单元405主要用于给整个智能视频感知装置供电,在本实施例中为系统内部的电池。光照单元406主要包括发光设备和光照强度和光照范围调整单元,光源为红外光。光照单元406和摄像机401-2固定在一起,传动机构402-1控制红外摄像机401-1的左右上下摆动位置,传动机构402-2共同控制光照单元406和摄像机401-2的左右上下摆动位置。The transmission mechanism 402-1 adjusts the left and right positions of the infrared camera 401-1 and the vertical field of view through the control commands sent by the data processing unit 403; the transmission mechanism 402-2 adjusts the left and right positions of the camera 401-2 through the control commands sent by the data processing unit 403 And the position of the up and down field of view. The communication interface unit 404 mainly includes wired and wireless communication interfaces. The power management unit 405 is mainly used to supply power to the entire intelligent video perception device, and in this embodiment is a battery inside the system. The light unit 406 mainly includes a light emitting device and a unit for adjusting light intensity and light range, and the light source is infrared light. The light unit 406 and the camera 401-2 are fixed together, the transmission mechanism 402-1 controls the left and right swing positions of the infrared camera 401-1, and the transmission mechanism 402-2 jointly controls the left and right swing positions of the light unit 406 and the camera 401-2.
其中,摄像机,用于视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元等。Among them, the camera is used for video image acquisition, including a focus motor, a zoom motor, a drive module, an image signal acquisition and processing unit, and so on.
系统工作流程如下:The system workflow is as follows:
(1)根据装置所需监测的范围,预设监测区域的个数位置,每个区域以能覆盖该区域的最大倍率来设置该区域的PTZ参数。建立特定监测区域的假目标反馈特征信息库,包括假目标的位置信息和假目标特征描述信息。(1) According to the monitoring range of the device, the number and position of the monitoring area are preset, and the PTZ parameter of each area is set with the maximum magnification that can cover the area. Establish a false target feedback feature information database in a specific monitoring area, including false target location information and false target feature description information.
(2)针对第一预设监测区域,设定PTZ参数,红外热成像摄像机401-1以该大视野小分辨率进行视频图像采集,将采集的视频图像实时传输至数据处理单元403。(2) For the first preset monitoring area, PTZ parameters are set, the infrared thermal imaging camera 401-1 collects video images with the large field of view and small resolution, and transmits the collected video images to the data processing unit 403 in real time.
(3)数据处理单元403经过一次目标检测算法,对所采集的视频图像进行一次目标检测。一次目标检测结合了假目标反馈特征库信息,如果某区域的图像特征与该区域的假目标反馈特征描述具有高匹配度,则该区域将大概率被划分为背景,而不将其归为疑似目标。一次目标检测算法为基于固定背景模型的运动目标检测算法,和/或与背景无关的目标分类检测算法。(3) The data processing unit 403 performs a target detection on the collected video image through a target detection algorithm. A target detection combines the false target feedback feature library information. If the image feature of a certain area has a high degree of matching with the false target feedback feature description of the area, the area will be classified as a background with a high probability, and it will not be classified as a suspect Target. The primary target detection algorithm is a moving target detection algorithm based on a fixed background model, and/or a target classification detection algorithm that has nothing to do with the background.
(4)当一次目标检测发现无疑似目标时,设定PTZ参数,覆盖第二预设监测区域:数据处理单元403发送控制命令给红外摄像机401-1和传动机构402,传动机构402-1调整红外摄像机401-1视场方向,使得红外摄像机401-1视场方向对准第二预设监测区域,并调整红外摄像机401-1焦距,以大视野小分辨率对该区域进行视频图像采集,数据处理单元403经过视频图像目标检测算法,对采集的视频图像进行目标的一次目标检测。。(4) When a target detection is found to be undoubtedly a target, set the PTZ parameters to cover the second preset monitoring area: the data processing unit 403 sends a control command to the infrared camera 401-1 and the transmission mechanism 402, and the transmission mechanism 402-1 is adjusted The field of view direction of the infrared camera 401-1 is such that the field of view direction of the infrared camera 401-1 is aligned with the second preset monitoring area, and the focal length of the infrared camera 401-1 is adjusted to collect video images of this area with a large field of view and a small resolution. The data processing unit 403 performs a target detection of the target on the collected video image through the video image target detection algorithm. .
(5)当一次目标检测发现有疑似目标时,数据处理单元403给出第一预设监测区域特征权重Q1,Q1由该区域内疑似目标数量及移动目标的移动速率加权获得;设定PTZ参数,以大视野小分辨率覆盖第二预设监测区域,使红外摄像机401-1对该区域进行视频图像采集,将采集的视频图像实时传输至数据处理单元403,经过视频图像目标检测算法,进行一次目标检测。(5) When a suspected target is found in a target detection, the data processing unit 403 gives the first preset monitoring area feature weight Q1, which is weighted by the number of suspected targets in the area and the moving speed of the moving target; set the PTZ parameter , Cover the second preset monitoring area with a large field of view and small resolution, so that the infrared camera 401-1 collects video images in this area, and transmits the collected video images to the data processing unit 403 in real time. After the video image target detection algorithm, One target detection.
(6)依次类推,重复上述步骤(1)~(5),对N个监测区域目标的一次目标检测, 区分出存在疑似目标的W个区域,并依据区域特征权重Qi的大小对W个区域重新排序,权重最大的为第1区域,权重最小为第W区域。(6) By analogy, repeat the above steps (1) ~ (5), perform a target detection of N monitoring area targets, distinguish W areas where there are suspected targets, and compare the W areas according to the size of the area feature weight Qi Re-order, the largest weight is the first area, and the smallest weight is the W-th area.
本实施例中,各区域特征权重Qi(0<i≤N)分别由各区域中疑似目标的数量及移动目标的移动速率加权获得。In this embodiment, each area feature weight Qi (0<i≤N) is respectively weighted by the number of suspected targets in each area and the moving speed of the moving target.
(7)智能视觉感知装置调整可见光摄像机401-2视场方向和焦距,通过调整PTZ参数,将疑似目标高度缩放到视野高度的1/6,并调整到视野中心,以该小视野大分辨率对存在疑似目标的第1区域进行视频图像采集,数据处理单元403将采集的视频图像,经过二次目标检测算法,依序进行疑似目标的二次目标检测。二次目标检测算法为与背景无关的目标分类检测算法。(7) The intelligent visual perception device adjusts the field of view direction and focal length of the visible light camera 401-2. By adjusting the PTZ parameters, the height of the suspected target is zoomed to 1/6 of the field of view height, and adjusted to the center of the field of view, so that the small field of view has a large resolution Video image acquisition is performed on the first area where the suspected target exists, and the data processing unit 403 passes the collected video image through a secondary target detection algorithm to sequentially perform secondary target detection of the suspected target. The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with the background.
(8)当二次目标检测判断出疑似目标是真目标时,数据处理单元403生成该真目标的目标特征信息1和告警信息1。数据处理单元403发送控制命令至摄像机401和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,实时调整可见光摄像机401-2焦距,对检测出的真目标进行实时跟踪;同时,数据处理单元403将二次检测的真目标的目标特征信息1发送至云平台或数据中心;云端平台或数据中心依据智能视觉感知装置发送的目标特征信息1经过目标识别算法,进行目标识别,并将得到目标的特征信息2和告警信息2发送至监控终端;监控终端对告警信息1、告警信息2、目标特征信息1和/或目标特征信息2进行处理和显示。(8) When the secondary target detection determines that the suspected target is a true target, the data processing unit 403 generates target feature information 1 and warning information 1 of the true target. The data processing unit 403 sends control commands to the camera 401 and the transmission mechanism 402-2. The transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view, adjusts the focal length of the visible light camera 401-2 in real time, and tracks the real target detected in real time. ; At the same time, the data processing unit 403 sends the target feature information 1 of the real target detected twice to the cloud platform or data center; the cloud platform or data center uses the target feature information 1 sent by the intelligent visual perception device to perform the target recognition algorithm Identify and send the obtained target characteristic information 2 and alarm information 2 to the monitoring terminal; the monitoring terminal processes and displays the alarm information 1, the alarm information 2, the target characteristic information 1 and/or the target characteristic information 2.
目标特征信息1包括表征目标具体分类的特征信息,包括人、动物、车和/或车型、飞行物、不应当出现的其它异物,包括自然掉落物和/或扩散物,例如落石、泥石流,和人类遗落物等中的一种或多种。目标特征信息2包括表征目标具体身份的特征信息,包括人的身份、动物的种类、车的车牌、其它异物的种类中的一种或多种。具体信息种类和内容根据特定的应用环境进行定义。Target feature information 1 includes feature information that characterizes the specific classification of the target, including people, animals, cars and/or models, flying objects, other foreign objects that should not appear, including natural falling objects and/or diffuse objects, such as falling rocks, mudslides, And one or more of human remains. The target characteristic information 2 includes characteristic information that characterizes the specific identity of the target, including one or more of the identity of a person, the type of animal, the license plate of the car, and the type of other foreign objects. The specific information types and contents are defined according to the specific application environment.
(9)可见光摄像机401-2对真目标进行跟踪,当该真目标消失或跟踪时间达到设定值时,数据处理单元403发送控制命令给摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,使得可见光摄像机401-2视场方向对准下一个真目标进行跟踪。(9) The visible light camera 401-2 tracks the real target. When the real target disappears or the tracking time reaches the set value, the data processing unit 403 sends a control command to the camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402 -2 Adjust the direction of the field of view of the visible light camera 401-2 so that the direction of the field of view of the visible light camera 401-2 is aimed at the next real target for tracking.
(10)当二次目标检测判断出疑似目标是假目标时,将大分辨率图像下确定的假目标,映射到小分辨率图像中,数据处理单元403对假目标进行特征描述,提取假目标特征描述信息,对假目标特征描述信息设定更新速率,优化假目标反馈特征信息库,进一步提高了一次检测算法的精确度。(10) When the secondary target detection determines that the suspected target is a fake target, the fake target determined under the large-resolution image is mapped to the small-resolution image, and the data processing unit 403 features the fake target and extracts the fake target Feature description information, set the update rate for the false target feature description information, optimize the false target feedback feature information database, and further improve the accuracy of a detection algorithm.
对比例为一次图像采集,并采用基于深度神经网络的YOLO V3目标分类检测算法进行一次目标检测。The comparative example is an image acquisition, and the YOLO V3 target classification detection algorithm based on the deep neural network is used to perform a target detection.
经试验验证,随运行周期的增加,系统一次目标检测准确度分别为:It has been verified by experiments that with the increase of the operating cycle, the accuracy of the system's one-time target detection is as follows:
Figure PCTCN2021087921-appb-000010
Figure PCTCN2021087921-appb-000010
Figure PCTCN2021087921-appb-000011
Figure PCTCN2021087921-appb-000011
(11)可见光摄像机401-2对真目标进行跟踪,当所有真目标消失或跟踪时间达到设定值时,数据处理单元403发送控制命令给可见光摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,使得可见光摄像机401-2视场方向对准第2区域,调整可见光摄像机401-2焦距,以小视野大分辨率对第2区域进行视频图像采集,数据处理单元403对采集的视频图像经过视频图像目标检测算法,进行第2区域目标的二次目标检测,检测方法与第1区域目标二次检测方法相同。(11) The visible light camera 401-2 tracks the real target. When all real targets disappear or the tracking time reaches the set value, the data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, the transmission mechanism 402-2 Adjust the field of view direction of the visible light camera 401-2 so that the field of view direction of the visible light camera 401-2 is aligned with the second area, adjust the focal length of the visible light camera 401-2, and collect video images of the second area with a small field of view and large resolution The data processing unit 403 performs the secondary target detection of the second area target through the video image target detection algorithm on the collected video image, and the detection method is the same as the first area target second detection method.
(12)依次类推,按(7)~(11)步骤对W个区域进行二次目标检测和识别,最后,重新从第(1)步开始,从第一预设监测区域开始目标检测,自动循环。(12) By analogy, follow steps (7) to (11) to perform secondary target detection and recognition for W areas, and finally, start again from step (1), start target detection from the first preset monitoring area, and automatically cycle.
(13)多个智能视觉感知装置组成智能视觉感知系统,通过对各智能视觉感知装置设定不同监测范围,对各监测范围内预设的各监测区域进行如上目标检测和/或识别,以实现对更广区域的目标监测。(13) Multiple intelligent visual perception devices form an intelligent visual perception system. By setting different monitoring ranges for each intelligent visual perception device, the above-mentioned target detection and/or recognition are performed on the preset monitoring areas within each monitoring range to achieve Target monitoring of a wider area.
在整个智能视觉感知装置工作中,数据处理单元403对摄像机401-1和摄像机401-2采集的视频图像亮度做实时分析,当亮度不足时,及时发送控制命令给光照单元406-1和光照单元406-2,调整光照单元光照强度和光照范围,使得摄像机401-1和摄像机401-2采集的视频图像亮度适中。During the work of the entire intelligent visual perception device, the data processing unit 403 performs real-time analysis on the brightness of the video images collected by the camera 401-1 and the camera 401-2, and when the brightness is insufficient, it sends control commands to the light unit 406-1 and the light unit in time. 406-2. Adjust the light intensity and light range of the light unit, so that the video images captured by the camera 401-1 and the camera 401-2 have moderate brightness.
实施例13:Example 13:
在上述实施例的基础上,还提供本发明提供的智能视觉感知系统,在铁路应用中,依据实施例12的技术方案,采用红外摄像机401-1,根据当前监测区域的范围,计算出能覆盖整个区域的最大倍率,通过设定PTZ参数,以该大视野小分辨率对该监测区域进行视频图像采集,所采集的视频图像,如图9所示。采用数据处理单元403对所采集的视频图像,经过视频图像目标检测算法,进行一次目标检测,发现有疑似目标。On the basis of the above-mentioned embodiment, the intelligent visual perception system provided by the present invention is also provided. In railway applications, according to the technical solution of embodiment 12, infrared camera 401-1 is used. According to the scope of the current monitoring area, it can be calculated to cover The maximum magnification of the entire area, by setting the PTZ parameters, the monitoring area is video image collected with the large field of view and small resolution, and the collected video image is shown in FIG. 9. The data processing unit 403 is used to perform a target detection on the collected video image through a video image target detection algorithm, and a suspicious target is found.
数据处理单元403发送控制命令给可见光摄像机401-2和传动机构402-2,传动机构402-2调整可见光摄像机401-2视场方向,使得可见光摄像机401-2视场方向对准疑似目标,调整可见光摄像机401-2焦距。通过调整PTZ参数,将疑似目标高度缩放到视野高度的1/3,并调整到视野中心,以该小视野大分辨率对疑似目标进行视频图像采集,所采集的视频图像,如图10所示。The data processing unit 403 sends a control command to the visible light camera 401-2 and the transmission mechanism 402-2, and the transmission mechanism 402-2 adjusts the direction of the visible light camera 401-2 field of view so that the direction of the visible light camera 401-2’s field of view is aligned with the suspected target, and adjust Visible light camera 401-2 focal length. By adjusting the PTZ parameters, the height of the suspected target is scaled to 1/3 of the height of the field of view, and adjusted to the center of the field of view. The video image of the suspected target is collected with this small field of view and large resolution. The collected video image is shown in Figure 10. .
数据处理单元403经过视频图像目标检测算法,对采集的视频图像进行二次目标检测,判断为真目标。数据处理单元将告警信息1、告警信息2、目标特征信息1、和/或目标特征信息2发送至监控终端。同时,可见光摄像机401-2对真目标进行跟踪,如图11所示。告警信息包括可疑人员闯入铁路运行的特定监测区域,滞留时间,可疑行为,带来的现实危险等,目标特征信息包括人及人的身体特征,甚至经过面部特征比对后的人的身份信息。The data processing unit 403 performs secondary target detection on the collected video image through a video image target detection algorithm, and judges it as a true target. The data processing unit sends the alarm information 1, the alarm information 2, the target characteristic information 1, and/or the target characteristic information 2 to the monitoring terminal. At the same time, the visible light camera 401-2 tracks the real target, as shown in Fig. 11. Warning information includes suspicious persons breaking into the specific monitoring area of railway operation, detention time, suspicious behavior, real dangers, etc. The target characteristic information includes people and their physical characteristics, and even the identity information of the person after facial feature comparison .
采用本发明的智能视觉感知系统,实现了对铁路沿线周边行车环境检测,当铁路行 车限界内出现非法人员闯入、异物被抛入、落石掉入、边坡坍塌等情况时,及时进行智能捕捉、放大、特征描述和告警,以方便采取后续措施,保证行车安全。本发明解决了当前铁路方面对铁轨周边环境的非智能监测,一方面需要监控者紧盯屏幕,不敢懈怠,实时地人为判断是否存在运行障碍;并且对于远方可能存在的现实危险,可能因为图像不清,加以忽略,或者因为不能及时进行图像放大,产生误判,而影响了应急响应时间,从而延误了事故挽救的时机,可能导致数以亿计的财产损失和众多人员伤亡的现实危险。可见,本发明具有技术先进性和极强的社会应用价值。The intelligent visual perception system of the present invention realizes the detection of the surrounding driving environment along the railway. When illegal personnel break in, foreign objects are thrown in, falling rocks, and slope collapse occur within the railway driving limit, intelligent capture is carried out in time , Enlargement, feature description and warning to facilitate follow-up measures to ensure driving safety. The present invention solves the current non-intelligent monitoring of the surrounding environment of the railroad tracks. On the one hand, the monitor needs to keep a close eye on the screen, not dare to slack off, and artificially judge whether there are operational obstacles in real time; and for the actual danger that may exist in the distant place, it may be due to the image If it is not clear, it is ignored, or because the image cannot be zoomed in time, misjudgment is generated, which affects the emergency response time, delays the time of accident rescue, and may cause hundreds of millions of property losses and a real danger of numerous casualties. It can be seen that the present invention has technological advancement and extremely strong social application value.
实施例14:Example 14:
在上述实施例的基础上,还提供一种智能视觉感知系统的一种实施方式,基于上述各实施例,智能视觉感知装置通过通信接口的输入接口,接收放置于智能视觉感知装置所能覆盖的检测区域内的其它传感或动作装置信号,当其它传感或动作装置感知到异常情况时,发送报警信息给智能视觉感知装置,智能感知装置接收到该传感或动作装置发送的报警信息,优先调整摄像机的视场方向和焦距,对该传感或动作装置所在区域进行目标检测。On the basis of the above-mentioned embodiments, an implementation of an intelligent visual perception system is also provided. Based on the above-mentioned embodiments, the intelligent visual perception device receives information that can be covered by the intelligent visual perception device through the input interface of the communication interface. Signals of other sensing or motion devices in the detection area, when other sensing or motion devices sense abnormal conditions, send alarm information to the intelligent visual perception device, and the intelligent perception device receives the alarm information sent by the sensing or motion device, Prioritize the adjustment of the camera's field of view direction and focal length, and perform target detection in the area where the sensing or action device is located.
例如,传感装置为埋在地面的振动传感器,在正常情况下,振动传感器所在区域不会有车辆等经过,智能视觉感知装置不对该区域进行监测,当有车辆等经过时,振动传感器给智能视觉感知装置发送振动信号,智能视觉感知装置收到振动信号后,即可调整视场方向和焦距,对振动传感器所在的区域进行目标识别。For example, the sensing device is a vibration sensor buried in the ground. Under normal circumstances, no vehicles will pass by the area where the vibration sensor is located. The intelligent visual perception device does not monitor the area. When a vehicle passes by, the vibration sensor will The visual perception device sends a vibration signal. After the intelligent visual perception device receives the vibration signal, it can adjust the direction and focal length of the field of view, and perform target recognition on the area where the vibration sensor is located.
一种智能视觉感知系统的另一种实施方式,基于上述各实施例,智能视觉感知装置通过输出接口,向监测区域内的其它传感或动作装置提供其感知数据和/或信息;进一步地,与监测区域内的其它传感或动作装置进行数据融合和/或联合判断,和/或对监测区域内的其它传感或动作装置进行直接控制或系统联动。监测区域内的其它传感或动作装置包括告警声光设备、门禁设备、消防设备、除障设备、动物驱赶设备、拖车设备、清扫设备、巡视设备、冲力减缓设备、紧急止停设备、分流设备、现场和/或外部通讯设备、防爆设备、医疗救助设备、掩蔽设备、无人机运输设备、人员疏散和/或安全撤离设备中的一种或多种;信息包括告警信息和/或目标特征信息。Another implementation of an intelligent visual perception system, based on the above embodiments, the intelligent visual perception device provides its perception data and/or information to other sensing or action devices in the monitoring area through an output interface; further, Perform data fusion and/or joint judgment with other sensing or action devices in the monitoring area, and/or directly control or system linkage to other sensing or action devices in the monitoring area. Other sensing or action devices in the monitoring area include alarm sound and light equipment, access control equipment, firefighting equipment, obstacle removal equipment, animal repelling equipment, trailer equipment, cleaning equipment, patrol equipment, impulse mitigation equipment, emergency stop equipment, and shunt equipment One or more of, on-site and/or external communication equipment, explosion-proof equipment, medical rescue equipment, masking equipment, drone transportation equipment, personnel evacuation and/or safe evacuation equipment; the information includes warning information and/or target characteristics information.
例如,在实施例13中,本发明的智能视觉感知系统发现了真目标是不明原因出现在铁道上的可疑人员,立即向监测区域内的传感或动作装置发送感知数据和信息,传感或动作装置例如巡视车辆、现场通讯设备等,并联动广播播放警告信息和列车到站信息。如系统通过特征识别,发现真目标是羊群等,可联动动物驱赶设备,使羊群远离列车行进轨道,避免重大事故的发生。同时,根据本发明的智能视觉感知系统发布的告警信息和目标特征信息,启动人员处警流程,由相关责任人员派员前往真目标区域,现场指挥、控制可疑人员并排除妨碍,保证铁路正常运行和人员安全。For example, in Example 13, the intelligent visual perception system of the present invention found that the true target was a suspicious person who appeared on the railway for unknown reasons, and immediately sent perception data and information to the sensing or action device in the monitoring area, sensing or Action devices such as patrol vehicles, on-site communication equipment, etc., are connected in parallel to broadcast warning information and train arrival information. For example, through feature recognition, the system finds that the true target is a flock, etc., it can link the animal driving equipment to keep the flock away from the train track and avoid major accidents. At the same time, according to the alarm information and target feature information issued by the intelligent visual perception system of the present invention, the personnel handling process is initiated, and the relevant responsible personnel are sent to the real target area to direct and control suspicious personnel and remove obstacles on the spot to ensure the normal operation of the railway. Personnel safety.
在上述实施例中,可以全部或部分地通过软件、硬件任意组合来实现。例如,智能视觉感知装置内部的摄像机可转动的视场方向角度较小,在需要其它方向区域进行监测 时,智能视觉感知装置的外壳固定在转台上,智能视觉感知装置的数据处理单元通过控制线与转台内的传动机构连接,智能视觉感知装置控制转台大角度旋转,进而实现其它区域的监测。例如,智能视觉感知装置中有多台摄像机时,可以使用多台摄像机进行交替目标的一次目标检测和目标的二次目标检测,提高识别效率。在监测区域内有较多目标进行实时监控时,可以通过5G技术将视频图像数据实时上传至边缘云中的服务器,如图4所示,在服务其中进行目标二次和/或三次识别。例如,本发明的智能视觉感知系统包含多个智能视觉感知装置,分别覆盖不同的监测范围,形成监测网络,从而扩大监测领域,真正实现防护区域的整体监测。In the above-mentioned embodiments, it may be implemented in whole or in part by any combination of software and hardware. For example, the camera inside the intelligent visual perception device has a small rotatable field of view. When other areas are required for monitoring, the housing of the intelligent visual perception device is fixed on the turntable, and the data processing unit of the intelligent visual perception device passes through the control line. Connected with the transmission mechanism in the turntable, the intelligent visual perception device controls the turntable to rotate at a large angle, and then realizes the monitoring of other areas. For example, when there are multiple cameras in the intelligent visual perception device, multiple cameras can be used to alternate the primary target detection of the target and the secondary target detection of the target to improve the recognition efficiency. When there are many targets in the monitoring area for real-time monitoring, the video image data can be uploaded to the server in the edge cloud in real time through 5G technology, as shown in Figure 4, the target is identified twice and/or three times in the service. For example, the intelligent visual perception system of the present invention includes a plurality of intelligent visual perception devices, which respectively cover different monitoring ranges to form a monitoring network, thereby expanding the monitoring field and truly realizing the overall monitoring of the protection area.
云端平台和/或数据中心,包括服务器和通过图像目标检测算法进行图像识别的软件;监控终端,为一个或多个,用于显示目标识别的结果信息、接收报警信息、进行远程配置和控制,包括智能终端设备和其运行的管理软件,智能终端设备包括但不限计算机、手持机中的一种或多种。服务器可以为虚拟服务器,包括本地服务器、边缘云、公共云中的一种或多种。Cloud platform and/or data center, including server and software for image recognition through image target detection algorithm; monitoring terminal, one or more, used to display target recognition result information, receive alarm information, perform remote configuration and control, Including smart terminal equipment and its running management software. The smart terminal equipment includes but is not limited to one or more of a computer and a handheld. The server may be a virtual server, including one or more of a local server, an edge cloud, and a public cloud.
以上所述,仅为本发明的具体实施方式,但本发明的保护范围并不局限于此,任何熟悉本技术领域的技术人员在本发明揭露的技术范围内,可轻易想到变化或替换,都应涵盖在本发明的保护范围之内。因此,本发明的保护范围应以所述权利要求的保护范围为准。The above are only specific embodiments of the present invention, but the protection scope of the present invention is not limited thereto. Any person skilled in the art can easily think of changes or substitutions within the technical scope disclosed by the present invention. It should be covered within the protection scope of the present invention. Therefore, the protection scope of the present invention should be subject to the protection scope of the claims.

Claims (22)

  1. 一种智能视觉感知系统,其特征在于,所述系统包括可变焦距的摄像机和指定监测区域的假目标反馈特征信息库;An intelligent visual perception system, characterized in that the system includes a camera with a variable focal length and a false target feedback feature information database in a designated monitoring area;
    所述摄像机以第一分辨率对所述监测区域进行视频图像采集,对采集的视频图像,采用一次目标检测算法,进行区域内一次目标检测,包括:基于假目标反馈特征信息库判断所述指定监测区域是否包含疑似目标;The camera collects video images of the monitored area at the first resolution, and uses a target detection algorithm to perform a target detection in the area on the collected video images, including: judging the designated based on the false target feedback feature information database Whether the monitoring area contains suspected targets;
    当发现所述监测区域包含疑似目标时,调整所述摄像机,以第二分辨率对所述疑似目标进行视频图像采集,并基于采集的视频图像,采用二次目标检测算法,进行二次目标检测,以确定所述疑似目标为真目标或假目标;When it is found that the monitoring area contains a suspected target, the camera is adjusted to collect a video image of the suspected target at the second resolution, and based on the collected video image, a secondary target detection algorithm is used for secondary target detection To determine whether the suspected target is a true target or a fake target;
    当所述疑似目标为假目标时,根据所述假目标的信息更新假目标反馈特征信息库;When the suspected target is a false target, update the false target feedback feature information database according to the information of the false target;
    当所述疑似目标为真目标时,对所述真目标进行跟踪监测;When the suspected target is a true target, tracking and monitoring the true target;
    所述第二分辨率大于所述第一分辨率;The second resolution is greater than the first resolution;
    所述假目标反馈特征信息库包括假目标的位置信息和假目标特征描述信息;The false target feedback feature information database includes the location information of the false target and the false target feature description information;
    所述摄像机以第一分辨率对所述监测区域进行视频图像采集包括:摄像机以大视野小分辨率对所述监测区域进行视频图像采集;The video image collection of the monitoring area by the camera at the first resolution includes: the video image collection of the monitoring area by the camera with a large field of view and a small resolution;
    所述一次目标检测包括:基于假目标反馈特征信息库,判断所述监测区域内的图像特征与该区域的假目标特征描述信息和位置信息匹配度是否满足指定条件,若满足,则不判为疑似目标;所述一次目标检测算法包括基于固定背景模型的运动目标检测算法和与背景无关的目标分类检测算法;The one-time target detection includes: based on the false target feedback feature information database, judging whether the image feature in the monitoring area matches the false target feature description information and position information of the area meets a specified condition, if it is satisfied, it is not judged as Suspected target; the primary target detection algorithm includes a moving target detection algorithm based on a fixed background model and a target classification detection algorithm that has nothing to do with the background;
    当发现监测区域包含一个或多个疑似目标时,调整所述摄像机的视场方向和视场角,以第二分辨率对所述疑似目标进行视频图像采集,包括:以小视野大分辨率依次对所述疑似目标进行视频图像采集;When it is found that the monitoring area contains one or more suspected targets, adjust the field of view direction and angle of view of the camera, and collect video images of the suspected targets at the second resolution, including: successively with a small field of view and a large resolution Video image collection of the suspected target;
    所述二次目标检测算法为与背景无关的目标分类检测算法;The secondary target detection algorithm is a target classification detection algorithm that has nothing to do with background;
    根据所述假目标的信息更新假目标反馈特征信息库包括:将所述大分辨率图像下确定的假目标,映射到所述小分辨率图像中,获取假目标位置信息,并提取所述假目标特征描述信息,更新所述假目标反馈特征信息库;Updating the false target feedback feature information database according to the information of the false target includes: mapping the false target determined under the large-resolution image to the small-resolution image, acquiring the position information of the false target, and extracting the false target Target feature description information, updating the false target feedback feature information database;
    当所述疑似目标为真目标时,对所述真目标进行跟踪监测,并输出大分辨率视频数据,输出告警信息和/或目标特征信息;When the suspected target is a true target, tracking and monitoring the true target, and outputting large-resolution video data, outputting alarm information and/or target characteristic information;
    所述大视野小分辨率指通过设定PTZ参数,使所述视频图像覆盖需要监控的区域;The large field of view and small resolution means that the video image covers the area that needs to be monitored by setting PTZ parameters;
    所述小视野大分辨率指通过调整PTZ参数,使所述目标缩放到所述视频图像高度的1/10~4/5;The small field of view and large resolution means that by adjusting PTZ parameters, the target is scaled to 1/10 to 4/5 of the height of the video image;
    对所述假目标特征描述信息设定更新速率,优化所述假目标反馈特征信息库。An update rate is set for the false target characteristic description information, and the false target feedback characteristic information database is optimized.
  2. 如权利要求1所述的智能视觉感知系统,其特征在于,所述小视野大分辨率指通过调整PTZ参数,使所述目标缩放到所述视频图像高度的1/6~2/3,并调整到视野中心。The intelligent visual perception system of claim 1, wherein the small field of view and large resolution refers to adjusting the PTZ parameters to make the target zoom to 1/6 to 2/3 of the height of the video image, and Adjust to the center of the field of view.
  3. 如权利要求1所述的智能视觉感知系统,其特征在于,所述目标特征信息包括表征目标具体分类的特征信息和/或表征目标具体身份的特征信息;所述表征目标具体分类的特征信息包括人、动物、车和/或车型、飞行物、指定异物中的一种或多种,其中所述指定异物包括自然掉落物和/或扩散物、人类遗落物中的一种或多种;所述表征目标具体身份的特征信息包括人的身份、动物的种类、车的车牌、其它异物种类中的一种或多种。The intelligent visual perception system according to claim 1, wherein the target feature information includes feature information that characterizes the specific classification of the target and/or feature information that characterizes the specific identity of the target; the feature information that characterizes the specific classification of the target includes One or more of humans, animals, vehicles and/or models, flying objects, and designated foreign objects, wherein the designated foreign objects include one or more of natural falling objects and/or diffuse objects, and human remains The characteristic information that characterizes the specific identity of the target includes one or more of the identity of the person, the type of animal, the license plate of the car, and the type of other foreign objects.
  4. 如权利要求1所述的智能视觉感知系统,其特征在于,所述监测区域为N个,其中N≥1;系统监测过程包括:The intelligent visual perception system of claim 1, wherein there are N monitoring areas, where N≥1; the system monitoring process includes:
    在第M个监测区域进行一次目标检测,当所述一次目标检测发现无疑似目标时,调整所述摄像机的视场方向和焦距,以大视野小分辨率对下一个监测区域进行一次目标检测,其中,1≤M≤N;Perform a target detection in the M-th monitoring area. When the first target detection finds that there is no doubt that it is a target, adjust the field of view direction and focal length of the camera, and perform a target detection on the next monitoring area with a large field of view and a small resolution. Among them, 1≤M≤N;
    对第M个监测区域的所述真目标进行跟踪,当所述真目标消失或跟踪时间达到设定值时,调整所述摄像机的视场方向和视场角,以小视野大分辨率对第M个监测区域的下一个所述真目标进行跟踪;The real target in the M-th monitoring area is tracked. When the real target disappears or the tracking time reaches a set value, adjust the field of view direction and angle of the camera, and use a small field of view and a large resolution on the first Tracking the next said real target in M monitoring areas;
    当第M个监测区域的所有所述真目标消失或跟踪时间达到设定值时,调整所述摄像机的视场方向和焦距,以所述大视野小分辨率对下一个监测区域进行一次目标检测;When all the true targets in the M-th monitoring area disappear or the tracking time reaches the set value, adjust the field of view direction and focal length of the camera, and perform a target detection on the next monitoring area with the large field of view and small resolution ;
    依次对所述N个监测区域进行目标检测,循环往复。Perform target detection on the N monitoring areas in turn, cyclically.
  5. 如权利要求1所述的智能视觉感知系统,其特征在于,所述监测区域为N个,N≥1;The intelligent visual perception system of claim 1, wherein there are N monitoring areas, and N≥1;
    分别对所有N个监测区域进行一次目标检测,对一次目标检测发现疑似目标的区域计算各监测区域各自的特征权重;根据各监测区域各自特征权重的大小,按照指定顺序依序对所述各监测区域进行二次目标检测和跟踪。Perform a target detection on all N monitoring areas respectively, and calculate the respective feature weights of each monitoring area for the area where a suspected target is found in a target detection; according to the size of the respective feature weights of each monitoring area, perform the monitoring of each of the monitoring areas in order according to the specified order Area for secondary target detection and tracking.
  6. 如权利要求1所述的智能视觉感知系统,其特征在于,所述系统包括1个或多个智能视觉感知装置,所述智能视觉感知装置包括:The intelligent visual perception system of claim 1, wherein the system comprises one or more intelligent visual perception devices, and the intelligent visual perception devices comprise:
    所述摄像机,用于所述视频图像采集,包括聚焦电机、变焦电机、驱动模块、图像信号采集处理单元中的一种或多种;The camera is used for the video image collection, and includes one or more of a focus motor, a zoom motor, a drive module, and an image signal collection and processing unit;
    传动机构,用于调整所述摄像机的视场方向和大小;A transmission mechanism for adjusting the direction and size of the field of view of the camera;
    数据处理单元,用于对所述摄像机采集的视频数据进行分析处理,控制所述摄像机进行焦距调整,控制所述传动机构调整所述摄像机视场方向和大小,和/或与云端平台或数据中心进行信息交互;The data processing unit is used to analyze and process the video data collected by the camera, control the camera to adjust the focal length, control the transmission mechanism to adjust the direction and size of the camera's field of view, and/or interact with the cloud platform or data center Carry out information exchange;
    通信接口单元,用于与所述云端平台和/或数据中心进行信息交互,和/或与现场其它传感或动作装置,和/或其它关联系统的联动信息交互,包括有线和/或无线接口;The communication interface unit is used for information interaction with the cloud platform and/or data center, and/or linkage information interaction with other on-site sensing or action devices, and/or other associated systems, including wired and/or wireless interfaces ;
    电源管理单元,用于给所有耗电单元供电;The power management unit is used to supply power to all power-consuming units;
    防护外壳,用于将各单元封装在内,起到防护作用。The protective shell is used to encapsulate each unit and play a protective role.
  7. 如权利要求6所述的智能视觉感知系统,其特征在于,所述智能视觉感知装置还包括光照单元,用于给所述摄像机所监测的区域补光;所述光照单元为1个或多个, 包括可见光源、红外光源中的一种或多种;所述传动机构调整所述光照单元的视场方向和大小。The intelligent visual perception system according to claim 6, wherein the intelligent visual perception device further comprises a light unit for supplementing light to the area monitored by the camera; the light unit is one or more , Including one or more of a visible light source and an infrared light source; the transmission mechanism adjusts the direction and size of the field of view of the illumination unit.
  8. 如权利要求6所述的智能视觉感知系统,其特征在于,所述智能视觉感知装置中摄像机为一台或多台,所述摄像机为可见光摄像机和/或红外摄像机,所述红外摄像机包括近红外摄像机和/或红外热成像摄像机。The intelligent visual perception system according to claim 6, wherein there are one or more cameras in the intelligent visual perception device, the cameras are visible light cameras and/or infrared cameras, and the infrared cameras include near-infrared cameras. Camera and/or infrared thermal imaging camera.
  9. 如权利要求8所述的智能视觉感知系统,其特征在于,由一台或多台摄像机完成所述一次目标检测中的视频图像采集,和/或所述二次目标检测中的视频图像采集,和/或所述真目标跟踪时的视频图像采集。8. The intelligent visual perception system of claim 8, wherein one or more cameras complete the video image acquisition in the primary target detection and/or the video image acquisition in the secondary target detection, And/or the video image collection during the real target tracking.
  10. 如权利要求9所述的智能视觉感知系统,其特征在于,由所述红外摄像机完成所述一次目标检测中的视频图像采集;由所述可见光摄像机完成所述二次目标检测中的视频图像采集,和/或所述真目标跟踪时的视频图像采集。The intelligent visual perception system of claim 9, wherein the infrared camera completes the video image acquisition in the primary target detection; the visible light camera completes the video image acquisition in the secondary target detection , And/or the video image acquisition during the real target tracking.
  11. 如权利要求9所述的智能视觉感知系统,其特征在于,由所述可见光摄像机完成所述一次目标检测中的视频图像采集,和/或所述二次目标检测中的视频图像采集,和/或所述真目标跟踪时的视频图像采集。The intelligent visual perception system of claim 9, wherein the visible light camera completes the video image acquisition in the primary target detection, and/or the video image acquisition in the secondary target detection, and/ Or the video image collection during the real target tracking.
  12. 如权利要求7所述的智能视觉感知系统,其特征在于,所述传动机构,用于调整所述摄像机和/或光照单元的水平和/或竖直视场方向和大小,包括驱动电机、水平转轴、竖直转轴;所述驱动电机驱动所述摄像机和/或光照单元绕所述转轴水平转动0~360度,上下转动0~180度。The intelligent visual perception system according to claim 7, wherein the transmission mechanism is used to adjust the horizontal and/or vertical field of view direction and size of the camera and/or the light unit, and includes a drive motor, a horizontal A rotating shaft and a vertical rotating shaft; the driving motor drives the camera and/or the light unit to rotate horizontally around the rotating shaft by 0-360 degrees, and up and down by 0-180 degrees.
  13. 如权利要求6所述的智能视觉感知系统,其特征在于,所述数据处理单元,采用具有视频图像处理能力的芯片,集成图像目标检测算法程序,进行实时视频图像处理;当识别出所述疑似目标为假目标时,对所述假目标进行特征描述,并将所述特征反馈给所述图像目标检测算法程序。The intelligent visual perception system of claim 6, wherein the data processing unit adopts a chip with video image processing capabilities, integrates an image target detection algorithm program, and performs real-time video image processing; when the suspect is identified When the target is a false target, the false target is characterized, and the characteristic is fed back to the image target detection algorithm program.
  14. 如权利要求6所述的智能视觉感知系统,其特征在于,所述通信接口单元包括输入接口和输出接口;所述输入接口用于接收外部设备信号;所述输出接口用于发送系统所采集或接收的信号,其连接方式包括无线和/或有线方式;其中,所述无线方式包括WIFI、BT、ZIGBEE、LORA、2G、3G、4G、5G、NB-IOT中的一种或多种;所述有线方式包括AI/AO、DI/DO、RS485、RS422、RS232、CAN总线、LAN、光纤中的一种或多种。The intelligent visual perception system according to claim 6, wherein the communication interface unit includes an input interface and an output interface; the input interface is used to receive signals from external devices; the output interface is used to transmit or The connection mode of the received signal includes wireless and/or wired mode; wherein, the wireless mode includes one or more of WIFI, BT, ZIGBEE, LORA, 2G, 3G, 4G, 5G, and NB-IOT; The wired mode includes one or more of AI/AO, DI/DO, RS485, RS422, RS232, CAN bus, LAN, and optical fiber.
  15. 如权利要求14所述的智能视觉感知系统,其特征在于,所述智能视觉感知装置通过所述输入接口,接收所述监测区域内的其它传感或动作装置的信号,当所述传感或动作装置发送异常情况信号和/或报警信息时,所述智能视觉感知装置调整所述摄像机的视场方向和焦距,优先对所述传感或动作装置所在区域进行目标检测。The intelligent visual perception system of claim 14, wherein the intelligent visual perception device receives signals from other sensing or action devices in the monitoring area through the input interface, and when the sensing or When the action device sends an abnormal situation signal and/or alarm information, the intelligent visual perception device adjusts the direction and focal length of the camera's field of view, and prioritizes target detection in the area where the sensing or action device is located.
  16. 如权利要求14所述的智能视觉感知系统,其特征在于,所述智能视觉感知装置通过所述输出接口,向所述监测区域内的其它传感或动作装置提供其感知数据和/或结果信息,用于数据融合和/或联合判断,和/或对所述传感或动作装置进行直接控制或联动。The intelligent visual perception system of claim 14, wherein the intelligent visual perception device provides its perception data and/or result information to other sensing or action devices in the monitoring area through the output interface , Used for data fusion and/or joint judgment, and/or direct control or linkage of the sensing or action device.
  17. 如权利要求15或16所述的智能视觉感知系统,其特征在于,所述监测区域内的其它传感或动作装置包括告警声光设备、门禁设备、消防设备、除障设备、动物驱赶设备、拖车设备、清扫设备、巡视设备、冲力减缓设备、紧急止停设备、分流设备、现场和/或外部通讯设备、防爆设备、医疗救助设备、掩蔽设备、无人机运输设备、人员疏散和/或安全撤离设备中的一种或多种;所述信息包括告警信息和/或目标特征信息。The intelligent visual perception system according to claim 15 or 16, wherein other sensing or action devices in the monitoring area include alarm sound and light equipment, access control equipment, fire fighting equipment, obstacle removal equipment, animal repelling equipment, Trailer equipment, cleaning equipment, inspection equipment, impulse mitigation equipment, emergency stop equipment, shunt equipment, on-site and/or external communication equipment, explosion-proof equipment, medical rescue equipment, shelter equipment, drone transportation equipment, personnel evacuation and/or One or more of the safe evacuation equipment; the information includes alarm information and/or target characteristic information.
  18. 如权利要求6所述的智能视觉感知系统,其特征在于,所述电源管理单元包括集成在系统内部的电池,和/或外部的太阳能电池板,和/或有线电源。The intelligent visual perception system of claim 6, wherein the power management unit includes a battery integrated inside the system, and/or an external solar panel, and/or a wired power supply.
  19. 如权利要求7所述的智能视觉感知系统,其特征在于,所述防护外壳包括接口板、窗口、固定座;所述接口板上有1个或多个接口,与外部单元连接;所述窗口采用透光材料,分别透射所述摄像机采集的视频图像,和/或所述光照单元发出的光;所述固定座用于将所述防护外壳固定在外部支架上。The intelligent visual perception system according to claim 7, wherein the protective housing includes an interface board, a window, and a fixing seat; the interface board has one or more interfaces, which are connected to an external unit; the window The light-transmitting material is used to respectively transmit the video images collected by the camera and/or the light emitted by the light unit; the fixing seat is used to fix the protective shell on the external support.
  20. 如权利要求1所述的智能视觉感知系统,其特征在于,输出告警信息和/或目标特征信息至监控终端,和/或输出所述告警信息和/或目标特征信息至数据中心和/或云端平台,启动告警处理服务,分发信息至所述监控终端,完成包括告警处置和/或干预和/或系统联动中的一种或多种功能。The intelligent visual perception system of claim 1, wherein the alarm information and/or target characteristic information is output to a monitoring terminal, and/or the alarm information and/or target characteristic information is output to a data center and/or cloud The platform starts the alarm processing service, distributes information to the monitoring terminal, and completes one or more functions including alarm handling and/or intervention and/or system linkage.
  21. 如权利要求20所述的智能视觉感知系统,其特征在于,所述云端平台和/或数据中心,包括服务器和通过图像目标检测算法进行图像识别的软件;所述监控终端,为一个或多个,用于显示目标识别的结果信息、接收报警信息、进行远程配置和控制,包括智能终端设备和其运行的管理软件,所述智能终端设备包括但不限计算机、手持机中的一种或多种。The intelligent visual perception system of claim 20, wherein the cloud platform and/or data center includes a server and software for image recognition through image target detection algorithms; the monitoring terminal is one or more , Used to display the result information of target recognition, receive alarm information, perform remote configuration and control, including intelligent terminal equipment and its running management software, said intelligent terminal equipment including but not limited to one or more of computers and handhelds kind.
  22. 如权利要求21所述的智能视觉感知系统,其特征在于,所述服务器包括虚拟服务器,包括本地服务器、边缘云、公共云中的一种或多种。The intelligent visual perception system of claim 21, wherein the server comprises a virtual server, including one or more of a local server, an edge cloud, and a public cloud.
PCT/CN2021/087921 2020-06-15 2021-04-16 Intelligent visual perception system WO2021253961A1 (en)

Applications Claiming Priority (2)

Application Number Priority Date Filing Date Title
CN202010544411.2A CN111770266B (en) 2020-06-15 2020-06-15 Intelligent visual perception system
CN202010544411.2 2020-06-15

Publications (1)

Publication Number Publication Date
WO2021253961A1 true WO2021253961A1 (en) 2021-12-23

Family

ID=72721021

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2021/087921 WO2021253961A1 (en) 2020-06-15 2021-04-16 Intelligent visual perception system

Country Status (2)

Country Link
CN (1) CN111770266B (en)
WO (1) WO2021253961A1 (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114218570A (en) * 2022-02-22 2022-03-22 广州思涵信息科技有限公司 Method and system for preventing computer information from being stolen and stolen based on AI and RPA technology
CN114332779A (en) * 2022-03-15 2022-04-12 云丁网络技术(北京)有限公司 Method for monitoring target object and related equipment
CN114640853A (en) * 2022-05-18 2022-06-17 滨州市人防工程与指挥保障中心 Unmanned aerial vehicle image processing system that cruises
CN114884842A (en) * 2022-04-13 2022-08-09 哈工大机器人(合肥)国际创新研究院 Visual security detection system and method for dynamically configuring tasks
CN114977511A (en) * 2022-06-21 2022-08-30 众芯汉创(北京)科技有限公司 Wind power plant transformer substation monitoring system based on block chain
CN116485799A (en) * 2023-06-25 2023-07-25 成都考拉悠然科技有限公司 Method and system for detecting foreign matter coverage of railway track
CN117241447A (en) * 2023-11-14 2023-12-15 深圳市创先照明科技有限公司 Light control method, light control device, electronic equipment and computer readable storage medium

Families Citing this family (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN111770266B (en) * 2020-06-15 2021-04-06 北京世纪瑞尔技术股份有限公司 Intelligent visual perception system
CN112383714A (en) * 2020-11-13 2021-02-19 珠海大横琴科技发展有限公司 Target object tracking method and device
CN112511610A (en) * 2020-11-19 2021-03-16 上海营邑城市规划设计股份有限公司 Vehicle-mounted patrol intelligent method and system based on urban fine management conditions
CN112543497A (en) * 2020-12-10 2021-03-23 四川省数字商企智能科技有限公司 Safety monitoring system of degree of depth perception
CN113055656B (en) * 2021-03-30 2023-07-11 深圳市多酷科技有限公司 Monitoring system and monitoring method
CN113301308B (en) * 2021-05-25 2022-06-17 浙江安防职业技术学院 Video monitoring device for safety monitoring
CN113507577A (en) * 2021-07-07 2021-10-15 杭州海康威视系统技术有限公司 Target object detection method, device, equipment and storage medium
CN113822216A (en) * 2021-09-29 2021-12-21 上海商汤智能科技有限公司 Event detection method, device, system, electronic equipment and storage medium
CN115002341B (en) * 2022-04-28 2024-11-05 中科蓝卓(北京)信息科技有限公司 Target monitoring method and system based on defense segmentation
CN117872974A (en) * 2023-12-20 2024-04-12 枣庄市喜神科技有限公司 Production system, method and device based on digitalization and Internet of things
CN117727161A (en) * 2023-12-22 2024-03-19 西安长地空天科技有限公司 Highway falling stone monitoring system and method

Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110128385A1 (en) * 2009-12-02 2011-06-02 Honeywell International Inc. Multi camera registration for high resolution target capture
CN106023249A (en) * 2016-05-13 2016-10-12 电子科技大学 Moving object detection method based on local binary similarity pattern
CN106842350A (en) * 2016-12-26 2017-06-13 首都师范大学 Combine moving target detecting system and detection method with platform different resolution sensor
CN109282799A (en) * 2018-11-29 2019-01-29 上海航天控制技术研究所 Method is quickly tracked down and arrested in a kind of classification for target
CN109635146A (en) * 2018-11-28 2019-04-16 武汉虹信技术服务有限责任公司 A kind of target query method and system based on characteristics of image
CN109685062A (en) * 2019-01-02 2019-04-26 南方科技大学 Target detection method, device, equipment and storage medium
CN111770266A (en) * 2020-06-15 2020-10-13 北京世纪瑞尔技术股份有限公司 Intelligent visual perception system

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN101465033B (en) * 2008-05-28 2011-01-26 丁国锋 Automatic tracking recognition system and method
CN101924923B (en) * 2010-08-03 2013-01-09 杭州翰平电子技术有限公司 Embedded intelligent automatic zooming snapping system and method thereof
CN109993065B (en) * 2019-03-06 2022-08-23 开易(北京)科技有限公司 Driver behavior detection method and system based on deep learning

Patent Citations (7)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US20110128385A1 (en) * 2009-12-02 2011-06-02 Honeywell International Inc. Multi camera registration for high resolution target capture
CN106023249A (en) * 2016-05-13 2016-10-12 电子科技大学 Moving object detection method based on local binary similarity pattern
CN106842350A (en) * 2016-12-26 2017-06-13 首都师范大学 Combine moving target detecting system and detection method with platform different resolution sensor
CN109635146A (en) * 2018-11-28 2019-04-16 武汉虹信技术服务有限责任公司 A kind of target query method and system based on characteristics of image
CN109282799A (en) * 2018-11-29 2019-01-29 上海航天控制技术研究所 Method is quickly tracked down and arrested in a kind of classification for target
CN109685062A (en) * 2019-01-02 2019-04-26 南方科技大学 Target detection method, device, equipment and storage medium
CN111770266A (en) * 2020-06-15 2020-10-13 北京世纪瑞尔技术股份有限公司 Intelligent visual perception system

Cited By (11)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN114218570A (en) * 2022-02-22 2022-03-22 广州思涵信息科技有限公司 Method and system for preventing computer information from being stolen and stolen based on AI and RPA technology
CN114218570B (en) * 2022-02-22 2022-05-17 广州思涵信息科技有限公司 Method and system for preventing computer information from being stolen and stolen based on AI and RPA technologies
CN114332779A (en) * 2022-03-15 2022-04-12 云丁网络技术(北京)有限公司 Method for monitoring target object and related equipment
CN114884842A (en) * 2022-04-13 2022-08-09 哈工大机器人(合肥)国际创新研究院 Visual security detection system and method for dynamically configuring tasks
CN114884842B (en) * 2022-04-13 2023-09-05 哈工大机器人(合肥)国际创新研究院 Visual security detection system and method for dynamic configuration task
CN114640853A (en) * 2022-05-18 2022-06-17 滨州市人防工程与指挥保障中心 Unmanned aerial vehicle image processing system that cruises
CN114977511A (en) * 2022-06-21 2022-08-30 众芯汉创(北京)科技有限公司 Wind power plant transformer substation monitoring system based on block chain
CN116485799A (en) * 2023-06-25 2023-07-25 成都考拉悠然科技有限公司 Method and system for detecting foreign matter coverage of railway track
CN116485799B (en) * 2023-06-25 2023-09-15 成都考拉悠然科技有限公司 Method and system for detecting foreign matter coverage of railway track
CN117241447A (en) * 2023-11-14 2023-12-15 深圳市创先照明科技有限公司 Light control method, light control device, electronic equipment and computer readable storage medium
CN117241447B (en) * 2023-11-14 2024-03-05 深圳市创先照明科技有限公司 Light control method, light control device, electronic equipment and computer readable storage medium

Also Published As

Publication number Publication date
CN111770266A (en) 2020-10-13
CN111770266B (en) 2021-04-06

Similar Documents

Publication Publication Date Title
WO2021253961A1 (en) Intelligent visual perception system
US11645904B2 (en) Drone-augmented emergency response services
US20210311476A1 (en) Patrol robot and patrol robot management system
CN108297058B (en) Intelligent security robot and automatic inspection method thereof
EP4105101A1 (en) Monitoring system, monitoring method, and monitoring device for railway train
CN101465033B (en) Automatic tracking recognition system and method
CN113486777B (en) Behavior analysis method and device of target object, electronic equipment and storage medium
CN107483889A (en) The tunnel monitoring system of wisdom building site control platform
CN108206932A (en) A kind of campus intelligent monitoring management system
CN112288984A (en) Three-dimensional visual unattended substation intelligent linkage system based on video fusion
KR20070029760A (en) Monitoring devices
CN103929592A (en) All-dimensional intelligent monitoring equipment and method
CN210899299U (en) Tunnel monitoring system
KR101326707B1 (en) Camera system for vehicle number recognition and security
CN108205896A (en) A kind of Campus transport safety intelligence monitoring and controlling manages system
KR101925807B1 (en) Tunnel Autonomous operating system based Embedded using Multimedia and CPS Technology
KR20080044812A (en) The automatic guard system to prevent the crime and accident using computer video image analysis technology
CN108638082A (en) Security robot system based on Internet of Things
CN108234926A (en) A kind of human behavior discriminance analysis system
RU2268497C2 (en) System and method for automated video surveillance and recognition of objects and situations
CN114566056B (en) Highway tunnel driving safety risk identification, prevention and control method and system
CN113096337B (en) Moving target identification processing method for complex background and intelligent security system
CN109544870A (en) Alarm decision method and intelligent monitor system for intelligent monitor system
CN114913663B (en) Abnormality detection method, abnormality detection device, computer device, and storage medium
KR102513372B1 (en) Hybrid type object tracking system based on sound and image and control method using the same

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 21825604

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 21825604

Country of ref document: EP

Kind code of ref document: A1