Nothing Special   »   [go: up one dir, main page]

WO2022141271A1 - Control method and control device for platform system, platform system, and storage medium - Google Patents

Control method and control device for platform system, platform system, and storage medium Download PDF

Info

Publication number
WO2022141271A1
WO2022141271A1 PCT/CN2020/141711 CN2020141711W WO2022141271A1 WO 2022141271 A1 WO2022141271 A1 WO 2022141271A1 CN 2020141711 W CN2020141711 W CN 2020141711W WO 2022141271 A1 WO2022141271 A1 WO 2022141271A1
Authority
WO
WIPO (PCT)
Prior art keywords
photographing
photographing device
shooting
picture
pan
Prior art date
Application number
PCT/CN2020/141711
Other languages
French (fr)
Chinese (zh)
Inventor
杨俊超
王振动
王协平
Original Assignee
深圳市大疆创新科技有限公司
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by 深圳市大疆创新科技有限公司 filed Critical 深圳市大疆创新科技有限公司
Priority to CN202080069083.3A priority Critical patent/CN114556904A/en
Priority to PCT/CN2020/141711 priority patent/WO2022141271A1/en
Publication of WO2022141271A1 publication Critical patent/WO2022141271A1/en

Links

Images

Classifications

    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • H04N23/671Focus control based on electronic image sensor signals in combination with active ranging signals, e.g. using light or sound signals emitted toward objects
    • FMECHANICAL ENGINEERING; LIGHTING; HEATING; WEAPONS; BLASTING
    • F16ENGINEERING ELEMENTS AND UNITS; GENERAL MEASURES FOR PRODUCING AND MAINTAINING EFFECTIVE FUNCTIONING OF MACHINES OR INSTALLATIONS; THERMAL INSULATION IN GENERAL
    • F16MFRAMES, CASINGS OR BEDS OF ENGINES, MACHINES OR APPARATUS, NOT SPECIFIC TO ENGINES, MACHINES OR APPARATUS PROVIDED FOR ELSEWHERE; STANDS; SUPPORTS
    • F16M11/00Stands or trestles as supports for apparatus or articles placed thereon ; Stands for scientific apparatus such as gravitational force meters
    • F16M11/02Heads
    • F16M11/04Means for attachment of apparatus; Means allowing adjustment of the apparatus relatively to the stand
    • F16M11/06Means for attachment of apparatus; Means allowing adjustment of the apparatus relatively to the stand allowing pivoting
    • GPHYSICS
    • G01MEASURING; TESTING
    • G01SRADIO DIRECTION-FINDING; RADIO NAVIGATION; DETERMINING DISTANCE OR VELOCITY BY USE OF RADIO WAVES; LOCATING OR PRESENCE-DETECTING BY USE OF THE REFLECTION OR RERADIATION OF RADIO WAVES; ANALOGOUS ARRANGEMENTS USING OTHER WAVES
    • G01S5/00Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations
    • G01S5/02Position-fixing by co-ordinating two or more direction or position line determinations; Position-fixing by co-ordinating two or more distance determinations using radio waves
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/61Control of cameras or camera modules based on recognised objects
    • H04N23/611Control of cameras or camera modules based on recognised objects where the recognised objects include parts of the human body
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/67Focus control based on electronic image sensor signals
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N23/00Cameras or camera modules comprising electronic image sensors; Control thereof
    • H04N23/60Control of cameras or camera modules
    • H04N23/695Control of camera direction for changing a field of view, e.g. pan, tilt or based on tracking of objects
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04NPICTORIAL COMMUNICATION, e.g. TELEVISION
    • H04N5/00Details of television systems
    • H04N5/222Studio circuitry; Studio devices; Studio equipment

Definitions

  • the present application relates to the field of image capturing, and in particular, to a control method of a pan-tilt system, a control device, a pan-tilt system and a storage medium.
  • auxiliary shooting devices only support the auxiliary stabilization function in the photography-related functions, and cannot assist people in shooting, which greatly reduces the functions of the auxiliary shooting devices.
  • the present application provides a control method of a pan-tilt system, a control device, a pan-tilt system, and a storage medium, which are used for assisting shooting and enhancing auxiliary functions to improve shooting quality.
  • a first aspect of the present application is to provide a control method of a pan/tilt system, wherein the pan/tilt system includes a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device
  • the method includes: acquiring depth data obtained by a depth sensor observing the environment; determining a target object in the environment closest to the photographing device according to the depth data; controlling the cloud
  • the stage mechanism adjusts the posture of the photographing device to adjust the target object to a preset position in the photographing picture of the photographing device.
  • a second aspect of the present invention is to provide a control device for a pan-tilt system
  • the pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and adjusting the posture of the photographing device, and the photographing device is used to monitor the environment
  • the control device further includes: a memory and a processor; the memory is used to store a computer program; the processor is used to call and execute the computer program to perform the following steps: acquiring the Depth data obtained by observing the environment; determining the target object in the environment closest to the photographing device according to the depth data; controlling the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
  • a third aspect of the present invention is to provide a pan/tilt system, including a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the control device described in the second aspect.
  • a fourth aspect of the present invention is to provide a computer-readable storage medium, the storage medium is a computer-readable storage medium, and program instructions are stored in the computer-readable storage medium, and the program instructions are used in the first aspect. method described.
  • An embodiment of the present application provides a control method for a pan-tilt system, wherein the pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used for photographing an environment, and the method includes : Obtain the depth data observed by the depth sensor on the environment; determine the target object in the environment with the closest distance to the shooting device according to the depth data; control the pan-tilt mechanism to adjust the posture of the shooting device to adjust the target object to the position in the shooting screen of the shooting device Preset position.
  • the depth data is acquired by the depth sensor to more accurately determine the target object, so that the camera is adjusted through the pan-tilt mechanism to adjust the target object to a preset position for subsequent focusing and shooting, so as to assist the user in the Shoot, enhance shooting functions, and improve shooting quality.
  • FIG. 1 is a schematic flowchart of a control method of a pan-tilt system according to an embodiment of the present invention
  • FIG. 2 is a schematic diagram of a point cloud image provided by an embodiment of the present invention.
  • FIG. 3 is a specific schematic diagram of a control stabilizer provided by an embodiment of the present invention.
  • FIG. 4 is a schematic diagram of an outline of gesture recognition provided by an embodiment of the present invention.
  • FIG. 5 is a schematic structural diagram of a control device of a pan-tilt system according to an embodiment of the present invention.
  • FIG. 6 is a schematic structural diagram of a control device provided by an embodiment of the present invention.
  • the depth data is acquired by the depth sensor to more accurately determine the target object, and the camera is adjusted through the pan-tilt mechanism to adjust the target object to a preset position for subsequent focusing and shooting, thereby assisting the user Shoot, enhance your shooting capabilities, and improve your shooting quality.
  • FIG. 1 is a schematic flowchart of a control method of a pan-tilt system provided by an embodiment of the present invention
  • the method 100 provided by an embodiment of the present application may be executed by a pan-tilt system, such as a handheld pan-tilt.
  • the pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and adjusting the posture of the photographing device, and the photographing device is used for photographing the environment, such as a camera, a camera, and the like.
  • the method 100 includes the following steps:
  • the embodiments of the present application can also be implemented by a handheld stabilizer, a vehicle-mounted camera camera device stabilizer, an airborne camera camera device stabilizer, and the like, and these implementation devices may have a pan/tilt system.
  • the depth sensor is used to obtain depth data, which can be an active 3D depth sensor, which can autonomously emit the required visible light, invisible light and other electromagnetic waves or signals such as sound, and receive the signal reflected by the object to obtain the depth data.
  • Depth sensors include binocular vision sensors, TOF (Time of Flight) sensors, millimeter-wave radar or lidar.
  • the depth data may be data corresponding to a depth image or a point cloud image.
  • a photographing device such as a camera
  • a depth sensor is fixedly installed with a depth sensor.
  • an active 3D depth sensor can be external and fixed on the camera hot shoe.
  • the depth sensor can also be built in the P (Pitch, which is rotated around the X axis) axis of the gimbal mechanism. At this time, the calibration needs to be calibrated and the relative position of the camera.
  • the depth sensor can also be built in the gimbal R (Roll, which is rotated around the Z axis) axis. At this time, the relative position of the calibration and the camera and the relative angle of the camera and the camera need to be calculated in real time.
  • the depth sensor is built in the Y (Yaw, rotating around the Y axis) axis (or handle) of the gimbal. At this time, it is necessary to calibrate the relative position of the camera and the camera. The factory calibration can be done once. If the error between the center of the camera and the center of the gimbal mechanism is ignored , you only need to calibrate and the position of the center of the gimbal, you can do it once and for all.
  • the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device.
  • the optical axis of the camera is parallel to the optical axis of the depth sensor.
  • the above-mentioned environment is the environment of the shooting screen of the shooting device, that is, the environment in the shooting screen.
  • the user before taking an image, the user can move the lens of the camera to move the person to be photographed to a position near the center of the camera's field of view, and then the active 3D depth sensor can first actively emit detection into the shooting environment After it actively transmits the detection signal, it records the time when the detection signal is reflected from the transmitted to the object in the shooting environment to calculate the distance between the reflection surface of the object and the active 3D depth sensor.
  • the movement of the object is not considered here, or the movement of the object is ignored.
  • the speed of the moving object can also be calculated according to the change of the wavelength of the detection signal.
  • the active 3D depth sensor actively transmits the detection signal, which is reflected by the surface of the object.
  • the processor of the PTZ system can record the time interval between transmitting the detection signal and receiving the reflected signal, and then can calculate the depth information of the current pixel point according to the signal propagation speed, that is, the depth data (that is, the distance between the sensor and the active 3D depth sensor).
  • the depth data that is, the distance between the sensor and the active 3D depth sensor.
  • three-dimensional depth information can be obtained, that is, depth data, that is, a point cloud image is obtained, which can also be called a depth image (that is, corresponding to each Depth information or depth data of a pixel).
  • Figure 2 shows a point cloud image, that is, a depth image.
  • the embodiment of the present application can solve the problem that the reliability of image capturing of the device is affected under special weather or light conditions through the active 3D depth sensor;
  • the gimbal is externally combined.
  • the target object may be an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture.
  • the closest distance to the photographing device can also be determined by the closest distance to the depth sensor (the photographing device can be set in the same vertical direction as the depth sensor), for example, the angle of view of the camera corresponds to the person A who is the closest to the camera, and the person A is the closest to the depth sensor. . And person A is the closest to the preset position in the shooting screen.
  • determining the target object closest to the photographing device in the environment according to the depth data includes: determining the distances between multiple objects in the environment and the depth sensor according to the depth data; Identify the target object.
  • the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen; the target object is determined from the multiple objects according to the distances between the multiple objects and the depth sensor, including : Determine the positions of multiple objects in the shooting screen according to the installation position relationship between the depth sensor and the shooting device; determine the target from the multiple objects according to the distances between the multiple objects and the depth sensor and the positions of the multiple objects in the shooting screen object.
  • the processor of the gimbal system obtains the depth image through the depth sensor 301 , that is, after the depth data 302 , the matrix convolution operation combined with various gradient operators can be used to easily obtain points Cloud image, that is, the contour information or contour data of each object in the depth image.
  • the center point of the contour information which may be the depth data corresponding to the center point of the rectangular frame circumscribing the contour information, the physical distance between the object corresponding to the contour information and the active 3D depth sensor is determined.
  • the contour information can also be selected to represent the contour information, so as to determine the distance to the active 3D depth sensor according to the depth data corresponding to the point. After the distance is determined, the object corresponding to the contour information closest to the active 3D depth sensor can be determined. When multiple objects are closest to the active 3D depth sensor (then it can be directly considered that the multiple objects are the closest to the camera, and the error can be ignored), then the installation between the active 3D depth sensor and the camera can be continued according to the The positional relationship can determine the mapping relationship between the coordinate system corresponding to the active 3D depth sensor, that is, the coordinate system where the depth data is located, and the camera coordinate system.
  • the depth data can be mapped into the camera coordinate system, that is, the position in the shooting picture corresponding to the camera is determined. Therefore, after determining the objects closest to the active 3D depth sensor, the positions of the above-mentioned contour information center points of these objects in the shooting picture can be determined, and whether it is the closest to the preset position in the shooting picture can be determined by selecting The center point closest to the preset position in the shooting picture, so as to deduce the contour information corresponding to the center point and the corresponding target object.
  • contour information in the point cloud image is sufficient to implement many functions including the above functions. Considering that the photographed people and the background will not be on the same plane, the focus deviation can be obtained according to the outline information in the point cloud image, without the need to accurately obtain the human shape or other shapes, which can reduce the processing time of the processor, save the code space and In consideration of reducing system power consumption, complex shape recognition is not performed for functions that do not require complex operations. In addition, the acquisition of contour information can also directly obtain the contour information of a specified shape by using a related method of machine learning.
  • determining the target object from the plurality of objects according to the distances between the plurality of objects and the depth sensor includes: determining the object with the smallest distance from the depth sensor among the plurality of objects as the target object.
  • the object with the smallest distance from the depth sensor can be directly used as the target object, or the distance can be used as the target object.
  • the smallest depth sensor is considered the smallest distance to the camera, and the object that is the smallest distance to the depth sensor is also the smallest object to the camera.
  • the distance between the multiple objects and the photographing device may also be determined based on the installation position relationship between the depth sensor and the photographing device, so as to select the target object.
  • determining the target object from the multiple objects according to the distance between the multiple objects and the depth sensor includes: determining the multiple objects and the shooting device according to the distance between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the shooting device The distance between the devices; the object with the smallest distance from the photographing device among the multiple objects is determined as the target object.
  • the physical position relationship between the depth sensor and the photographing device can be determined according to the installation position relationship between the two, so that according to this physical position relationship, the The distances between the multiple objects and the depth sensor are converted into distances between the multiple objects and the photographing device, and the object with the smallest distance from the photographing device can be selected as the target object.
  • the active 3D depth sensor can be externally placed and fixed on the camera hot shoe, then according to this positional relationship and the distance between multiple objects and the active 3D depth sensor, conversion can be performed, such as through the triangle formula, to determine multiple The distance between the object and the camera.
  • the preset position is the center of the shooting screen, and may also be other positions on the shooting screen.
  • the preset position is determined by the user, and the user can set it through the photographing device or other display screens provided by the PTZ system, and the user can operate on the display screen or the screen of the photographing device to select the preset position. Thereby, the pan/tilt system determines the preset position in response to this operation.
  • controlling the pan-tilt mechanism to adjust the posture of the photographing device includes: controlling the pan-tilt mechanism to adjust the posture of the photographing device according to the deviation between the position of the target object in the photographing screen and a preset position.
  • the pan-tilt system can be corrected according to the deviation obtained between the position of the target object's contour information center point in the shooting picture and the preset position, and the controller of the pan-tilt system is controlled stably device, such as the angle of the gimbal mechanism, so as to adjust the posture of the camera, so that the camera zooms after the posture is adjusted.
  • the controller of the gimbal system can send control commands to the gimbal mechanism to adjust the angle and zoom the camera according to the deviation value corresponding to the deviation.
  • the controller can control the motor rotation of the gimbal mechanism according to the angle deviation, such as controlling the rotation of the P-axis motor, thereby rotating the Pitch axis and adjusting the position of the camera.
  • the depth data of the target object for example, the center point of its corresponding contour information
  • the position in the point cloud image and the preset position in the depth data or the position in the point cloud image (which can be determined by the above two methods.
  • the mapping relationship of each coordinate system is reversed, such as mapping from the data in the camera coordinate system to the data in the depth sensor coordinate system, that is, the depth data in the depth sensor coordinate system) Control the PTZ mechanism to adjust the shooting The posture of the device will not be repeated here.
  • the contour information near the middle of the camera's field of view (for example, within a certain preset area in the middle of the field of view) can be retained, and the center position of the corresponding contour information in the shooting picture can be calculated. Location. Then, the deviation between the position and the preset position can be determined, so as to make adjustment, which will not be repeated here.
  • the protruding contour at the nearest position of the center point can be automatically aligned.
  • This function can be used to assist people who want to be in the center of the field of view when taking pictures or selfies, and quickly and accurately correct them to the center.
  • the PTZ system can also realize the following functions:
  • the method 100 further includes: determining, according to the depth data, an object occupying the largest image area in the shooting picture of the shooting device; The entirety of the subject is in the shooting frame.
  • the pan-tilt system can determine the position of each object in the shooting screen according to the depth data, or the image area of each object in the shooting screen, so as to determine the size of each image area, so as to control the pan-tilt
  • the mechanism and/or the camera are adjusted so that the object with the largest image area is located at the preset position, which will not be repeated here.
  • controlling the pan/tilt mechanism and/or the focal length of the photographing device to be located at a preset position in the photographing screen and making the entire object lie in the photographing frame includes: if the entire object is not in the photographing frame, controlling the focal length of the photographing device So that the whole of the object is located in the shooting screen; the pan-tilt mechanism is controlled to adjust the shooting posture of the shooting device so that the object is located in the preset position in the shooting screen.
  • the object with the largest image area is not completely in the shooting screen, you can adjust the focus by controlling the focal length of the camera, so that the image area of the object is in the shooting screen.
  • the position of the object is preset by adjusting the camera posture by controlling the pan-tilt mechanism, which will not be repeated here.
  • the pan-tilt system uses the pan-tilt mechanism, so that after the photographing device adjusts the object with the largest image area to a preset position, it can compose a picture for the object.
  • the gimbal system can calculate the contour information center of the object and the length and width of the circumscribed rectangle of the contour information.
  • the preset composition ratio such as the golden ratio
  • the angle and the focal length of the shooting device make the composition ratio the closest to the required ratio (the gimbal mechanism does not move, and the proportion of the outline itself is not appropriate. ). Thus, quick composition can be achieved.
  • the method 100 further includes: determining the focus object in the picture of the photographing device; determining the distance of the focusing object according to the depth data; controlling the photographing device to focus according to the distance, so that the photographing device focuses on the focusing object.
  • determining the focus object in the picture of the shooting device includes: identifying the focus object in the shooting picture.
  • the focus object may be the target object described above, such as a person. Further, it can also be a characteristic part of a person. Such as human face, facial features, such as nose, eyes or mouth.
  • the feature part of the person is the feature part of the target object.
  • the gimbal system moves the target object to the preset position of the shooting screen through the gimbal mechanism, it can identify the focus object of the object, such as the nose, so as to focus on it, that is, the focus object. Then, the distance between the focal point and the camera is determined by the method described above, so as to focus.
  • the preset position can be used as the focus to focus.
  • the gimbal system can enable close-up focus selection.
  • the camera can be controlled to zoom according to the corresponding focal distance (which has been acquired by the depth sensor).
  • the method 100 further includes: recognizing the specific gesture of the user according to the depth data; and controlling the pan-tilt system or the photographing device according to the recognized specific gesture.
  • the specific gesture refers to a preset gesture, such as a gesture of greeting with both hands, as shown in FIG. 4 .
  • the user can enable the gesture mode through the application program of the PTZ system, and the PTZ system can first recognize the specific gesture, such as a gesture of greeting with both hands.
  • the PTZ system can obtain the outline information of the gesture, and then compare it with the outline information of the preset gesture to see if it is the same, or if the similarity exceeds the threshold, then it is determined to be a specific gesture, and the specific gesture can be enabled.
  • Corresponding functions such as waking up the tracking function of the PTZ system, or taking pictures, etc.
  • the PTZ system can set multiple preset gestures, and different preset gestures can correspond to different functions. After the PTZ system obtains the outline information of the user's gestures, it can be performed separately with multiple preset gestures. Compare and determine the corresponding preset gesture to wake up the function.
  • the method 100 further includes: determining the position of the tracking object in the shooting screen of the shooting device according to the depth data; controlling the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in the shooting screen.
  • the tracking object may be a person, a car, or the like.
  • the tracking function can be awakened according to the gesture, or the tracking function can be directly enabled to track the tracking object, such as taking a picture or a video.
  • the gimbal system can use the depth data to determine the position of the tracking object in the shooting screen, such as a preset position, in the way described above, so as to control the gimbal mechanism according to the position to adjust the camera posture for tracking shooting. .
  • the tracking object can be preset to be a person. Therefore, after obtaining the depth data, the tracking object can be determined according to the corresponding contour information, and the contour information can be compared with the contour information of the preset tracking object to determine. , and then trace.
  • the photographing or photographing of the tracking can be implemented in combination with the functions described above.
  • the embodiment of the present application proposes to use an active 3D depth sensor to process it to obtain a 3D point cloud image, that is, to combine the depth data with the control of a stabilizer and a photographing device, such as a camera device, to realize automatic centering, quick composition, close-up focus selection, gestures Recognition and other functions, combined with its active characteristics, increase the reliability of the target tracking function.
  • this type of sensor can use simpler software methods to achieve better results than image processing.
  • FIG. 5 is a schematic structural diagram of a control device of a pan-tilt system according to an embodiment of the present invention
  • the device 500 can be applied to a pan-tilt system, for example, a hand-held pan-tilt system.
  • a pan-tilt mechanism for adjusting the posture of the photographing device the photographing device is used to photograph the environment, and the device 500 can execute the above-mentioned control method of the pan-tilt system.
  • the apparatus 500 includes: an acquisition module 501 , a determination module 502 and a control module 503 . The functions of each module are described in detail below:
  • the acquiring module 501 is configured to acquire depth data obtained by the depth sensor observing the environment.
  • the determining module 502 is configured to determine, according to the depth data, a target object that is closest to the photographing device in the environment.
  • the control module 503 is configured to control the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing image of the photographing device.
  • the determining module 502 includes: a first determining unit, configured to determine the distances between multiple objects in the environment and the depth sensor according to the depth data; The target object is determined in the object.
  • the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen;
  • the determining module 502 includes: a second determining unit, which is used for determining according to the depth sensor and the shooting device. The installation position relationship between the multiple objects determines the positions of the multiple objects in the shooting picture; the second determining unit is used for determining the target from the multiple objects according to the distances between the multiple objects and the depth sensor and the positions of the multiple objects in the shooting picture. object.
  • the first determining unit is configured to: determine the object with the smallest distance from the depth sensor among the multiple objects as the target object.
  • the first determining unit is configured to: determine the distance between the multiple objects and the shooting device according to the distances between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the shooting device; The object with the smallest distance between the photographing devices is determined as the target object.
  • control module 503 is configured to control the pan-tilt mechanism according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
  • the target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture.
  • the preset position is the center of the shooting screen.
  • the preset position is determined by the user.
  • the photographing device is fixedly installed with the depth sensor.
  • the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device.
  • the depth sensor includes binocular vision sensor, TOF sensor, millimeter wave radar or lidar.
  • the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
  • the determining module 502 is further configured to: determine the object that occupies the largest image area in the shooting picture of the shooting device according to the depth data; the control module 503 is further configured to: control the pan-tilt mechanism and/or the focal length of the shooting device to make A preset position in the shooting screen and the entirety of the subject is located in the shooting screen.
  • control module 503 includes: a first control unit for controlling the focal length of the photographing device if the whole of the object is not in the shooting picture, so that the whole of the object is in the shooting picture; a second control unit for controlling the cloud
  • the stage mechanism is used to adjust the shooting posture of the shooting device so that the object is located at a preset position in the shooting picture.
  • the determining module 502 is also used for: determining the focus object in the picture of the photographing device; the determining module 502 is also used for: determining the distance of the focusing object according to the depth data; the control module 503 is also used for: controlling the shooting device according to the distance Focus so that the camera focuses on the subject in focus.
  • the determining module 502 is configured to: identify the focus object in the shooting picture.
  • the device 500 further includes: an identification module for identifying a specific gesture of the user according to the depth data; and a control module 503 for controlling the pan-tilt system or the photographing device according to the identified specific gesture.
  • the determination module 502 is also used to determine the position of the tracking object in the shooting picture of the shooting device according to the depth data; the control module 503 is also used to: control the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in in the shooting screen.
  • the structure of the control device 500 of the pan-tilt system shown in FIG. 5 may be implemented as an electronic device, and the electronic device may be a control device of the pan-tilt system.
  • the control device 600 of the PTZ system may include: one or more processors 601 and one or more memories 602 .
  • the pan/tilt system includes a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used for photographing the environment.
  • the memory 602 is used to store a program that supports the electronic device to execute the control method of the pan-tilt system provided in the embodiments shown in FIG. 1 to FIG. 4 .
  • the processor 601 is configured to execute programs stored in the memory 602 .
  • the program includes one or more computer instructions, wherein the one or more computer instructions can implement the following steps when executed by the processor 601:
  • the processor 601 is specifically configured to: determine the distances between multiple objects in the environment and the depth sensor according to the depth data; and determine the target object from the multiple objects according to the distances between the multiple objects and the depth sensor.
  • the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen; the processor 601 is specifically used for: according to the installation position between the depth sensor and the shooting device The relationship determines the positions of the multiple objects in the shooting picture; the target object is determined from the multiple objects according to the distances of the depth sensors of the multiple objects and the positions of the multiple objects in the shooting picture.
  • the processor 601 is specifically configured to: determine the object with the smallest distance from the depth sensor among the multiple objects as the target object.
  • the processor 601 is specifically configured to: determine the distance between the multiple objects and the photographing device according to the distances between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the photographing device; The object with the smallest distance between the photographing devices is determined as the target object.
  • the processor 601 is specifically configured to: control the pan-tilt mechanism according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
  • the target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture.
  • the preset position is the center of the shooting screen.
  • the preset position is determined by the user.
  • the photographing device is fixedly installed with the depth sensor.
  • the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device.
  • the depth sensor includes binocular vision sensor, TOF sensor, millimeter wave radar or lidar.
  • the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
  • the processor 601 is further configured to: determine the object that occupies the largest image area in the photographing picture of the photographing device according to the depth data; control the pan-tilt mechanism and/or the focal length of the photographing device to be located at a preset position in the photographing picture and The entirety of the subject is in the shooting frame.
  • the processor 601 is specifically configured to: if the whole of the object is not in the shooting screen, control the focal length of the shooting device so that the whole of the object is located in the shooting screen; control the pan-tilt mechanism to adjust the shooting posture of the shooting device to make the object at the preset position in the shooting screen.
  • the processor 601 is further configured to: determine the focusing object in the picture of the photographing device; determine the distance of the focusing object according to the depth data; and control the focusing of the photographing device according to the distance, so that the photographing device focuses on the focusing object.
  • the processor 601 is specifically configured to: identify the focus object in the shooting picture.
  • the processor 601 is further configured to: recognize the specific gesture of the user according to the depth data; and control the pan-tilt system or the photographing device according to the recognized specific gesture.
  • the processor 601 is further configured to: determine the position of the tracking object in the shooting picture of the shooting device according to the depth data; and control the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in the shooting picture.
  • an embodiment of the present invention provides a computer-readable storage medium, where the storage medium is a computer-readable storage medium, and program instructions are stored in the computer-readable storage medium, and the program instructions are used to implement the above-mentioned methods of FIG. 1 to FIG. 4 . .
  • a pan-tilt system provided by an embodiment of the present invention includes a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and a control device as shown in FIG. 6 .
  • the disclosed related detection apparatus eg, IMU
  • the embodiments of the remote control device described above are only illustrative.
  • the division of the modules or units is only a logical function division.
  • there may be other division methods, such as multiple units or components. May be combined or may be integrated into another system, or some features may be omitted, or not implemented.
  • the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of the remote control device or unit, which may be electrical, mechanical or other forms.
  • the units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
  • each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit.
  • the above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
  • the integrated unit if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium.
  • the technical solution of the present invention is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions for causing a computer processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present invention.
  • the aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes.

Landscapes

  • Engineering & Computer Science (AREA)
  • Multimedia (AREA)
  • Signal Processing (AREA)
  • General Engineering & Computer Science (AREA)
  • Mechanical Engineering (AREA)
  • Physics & Mathematics (AREA)
  • General Physics & Mathematics (AREA)
  • Radar, Positioning & Navigation (AREA)
  • Remote Sensing (AREA)
  • Studio Devices (AREA)

Abstract

A control method (100) and control device (600) for a platform system, the platform system, and a storage medium. The platform system comprises a platform mechanism used for carrying a photographing device and adjusting the posture of the photographing device, the photographing device being used for photographing an environment. The control method (100) comprises: acquiring depth data (302) obtained by a depth sensor (301) observing the environment; according to the depth data (302), determining a target object closest to the photographing device in the environment; and controlling the platform mechanism to adjust the posture of the photographing device so as to adjust the target object to a preset position in a photographing screen of the photographing device. Therefore, the depth data (302) is acquired by means of the depth sensor (301) to more accurately determine the target object, thereby adjusting the photographing device by means of the platform mechanism, so as to adjust the target object to a preset position for subsequent focusing and photographing.

Description

云台系统的控制方法、控制设备、云台系统和存储介质Control method, control device, PTZ system and storage medium of PTZ system 技术领域technical field
本申请涉及领域图像拍摄领域,尤其涉及一种云台系统的控制方法、控制设备、云台系统和存储介质。The present application relates to the field of image capturing, and in particular, to a control method of a pan-tilt system, a control device, a pan-tilt system and a storage medium.
背景技术Background technique
近年来,随着科技的发展,影像设备逐渐普及,拍照留念以及拍摄视频分享逐渐成了人们喜欢的事情。为了避开学习摄影技巧费时费力的问题,发明了辅助拍摄的设备,如稳定器。人们可以在各种场景、各种运动模式中轻松拍出图像稳定、变焦流畅的画面。In recent years, with the development of science and technology, imaging equipment has gradually become popular, and people like taking pictures and sharing videos. In order to avoid the time-consuming and laborious problem of learning photography skills, equipment to assist shooting, such as stabilizers, was invented. People can easily take pictures with stable images and smooth zooming in various scenes and various sports modes.
然而,目前大多数的辅助拍摄的设备在摄影相关功能中仅仅支持辅助增稳功能,无法辅助人们进行拍摄,这样使得辅助拍摄的设备的功能大大缩水。However, at present, most of the auxiliary shooting devices only support the auxiliary stabilization function in the photography-related functions, and cannot assist people in shooting, which greatly reduces the functions of the auxiliary shooting devices.
发明内容SUMMARY OF THE INVENTION
本申请提供了一种云台系统的控制方法、控制设备、云台系统和存储介质,用于辅助拍摄,增强辅助功能,以提高拍摄质量。The present application provides a control method of a pan-tilt system, a control device, a pan-tilt system, and a storage medium, which are used for assisting shooting and enhancing auxiliary functions to improve shooting quality.
本申请的第一方面是为了提供一种云台系统的控制方法,其中,所述云台系统包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构,所述拍摄装置用于对环境进行拍摄,所述方法包括:获取深度传感器对所述环境观测得到的深度数据;根据所述深度数据确定所述环境中与所述拍摄装置距离最近的目标对象;控制所述云台机构以调整所述拍摄装置的姿态以将所述目标对象调整至所述拍摄装置的拍摄画面中的预设位置。A first aspect of the present application is to provide a control method of a pan/tilt system, wherein the pan/tilt system includes a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device For photographing an environment, the method includes: acquiring depth data obtained by a depth sensor observing the environment; determining a target object in the environment closest to the photographing device according to the depth data; controlling the cloud The stage mechanism adjusts the posture of the photographing device to adjust the target object to a preset position in the photographing picture of the photographing device.
本发明的第二方面是为了提供一种云台系统的控制设备,云台系统包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构,所述拍摄装置用于对环境进行拍摄;所述控制设备还包括:存储器和处理器;所述存储器,用于存储计算机程序;所述处理器,用于调用并执行所述计算机程序,以执行以下步骤:获取深度传感器对所述环境观测得到的深度数据;根据所述深度数据确定所述环境中与所述拍摄装置距离最近的目标对象;控制所述 云台机构以调整所述拍摄装置的姿态以将所述目标对象调整至所述拍摄装置的拍摄画面中的预设位置。A second aspect of the present invention is to provide a control device for a pan-tilt system, the pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and adjusting the posture of the photographing device, and the photographing device is used to monitor the environment The control device further includes: a memory and a processor; the memory is used to store a computer program; the processor is used to call and execute the computer program to perform the following steps: acquiring the Depth data obtained by observing the environment; determining the target object in the environment closest to the photographing device according to the depth data; controlling the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
本发明的第三方面是为了提供一种云台系统,包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构以及上述第二方面所述的控制设备。A third aspect of the present invention is to provide a pan/tilt system, including a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the control device described in the second aspect.
本发明的第四方面是为了提供一种计算机可读存储介质,所述存储介质为计算机可读存储介质,该计算机可读存储介质中存储有程序指令,所述程序指令用于第一方面所述的方法。A fourth aspect of the present invention is to provide a computer-readable storage medium, the storage medium is a computer-readable storage medium, and program instructions are stored in the computer-readable storage medium, and the program instructions are used in the first aspect. method described.
本申请实施例提供一种云台系统的控制方法,其中,云台系统包括用于承载拍摄装置且用于调整拍摄装置的姿态的云台机构,拍摄装置用于对环境进行拍摄,该方法包括:获取深度传感器对环境观测得到的深度数据;根据深度数据确定环境中与拍摄装置距离最近的目标对象;控制云台机构以调整拍摄装置的姿态以将目标对象调整至拍摄装置的拍摄画面中的预设位置。由此,通过深度传感器获取到深度数据来较为准确地确定目标对象,从而通过云台机构进行调整拍摄装置从而将目标对象调整到预设位置,以进行后续的对焦和拍摄,以此辅助用户进行拍摄,增强拍摄功能以及提高拍摄质量。An embodiment of the present application provides a control method for a pan-tilt system, wherein the pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used for photographing an environment, and the method includes : Obtain the depth data observed by the depth sensor on the environment; determine the target object in the environment with the closest distance to the shooting device according to the depth data; control the pan-tilt mechanism to adjust the posture of the shooting device to adjust the target object to the position in the shooting screen of the shooting device Preset position. In this way, the depth data is acquired by the depth sensor to more accurately determine the target object, so that the camera is adjusted through the pan-tilt mechanism to adjust the target object to a preset position for subsequent focusing and shooting, so as to assist the user in the Shoot, enhance shooting functions, and improve shooting quality.
附图说明Description of drawings
此处所说明的附图用来提供对本申请的进一步理解,构成本申请的一部分,本申请的示意性实施例及其说明用于解释本申请,并不构成对本申请的不当限定。在附图中:The drawings described herein are used to provide further understanding of the present application and constitute a part of the present application. The schematic embodiments and descriptions of the present application are used to explain the present application and do not constitute an improper limitation of the present application. In the attached image:
图1为本发明实施例提供的一种云台系统的控制方法的流程示意图;1 is a schematic flowchart of a control method of a pan-tilt system according to an embodiment of the present invention;
图2为本发明实施例提供的点云图像的示意图;2 is a schematic diagram of a point cloud image provided by an embodiment of the present invention;
图3为本发明实施例提供的控制稳定器的具体示意图;3 is a specific schematic diagram of a control stabilizer provided by an embodiment of the present invention;
图4为本发明实施例提供的手势识别的轮廓的示意图;4 is a schematic diagram of an outline of gesture recognition provided by an embodiment of the present invention;
图5为本发明实施例提供的一种云台系统的控制装置的结构示意图;5 is a schematic structural diagram of a control device of a pan-tilt system according to an embodiment of the present invention;
图6为本发明实施例提供的一种控制设备的结构示意图。FIG. 6 is a schematic structural diagram of a control device provided by an embodiment of the present invention.
具体实施方式Detailed ways
为使本发明实施例的目的、技术方案和优点更加清楚,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述, 显然,所描述的实施例是本发明一部分实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都属于本发明保护的范围。In order to make the purposes, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments These are some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.
除非另有定义,本文所使用的所有的技术和科学术语与属于本发明的技术领域的技术人员通常理解的含义相同。本文中在本发明的说明书中所使用的术语只是为了描述具体的实施例的目的,不是旨在于限制本发明。Unless otherwise defined, all technical and scientific terms used herein have the same meaning as commonly understood by one of ordinary skill in the art to which this invention belongs. The terms used herein in the description of the present invention are for the purpose of describing specific embodiments only, and are not intended to limit the present invention.
为了便于理解本申请的技术方案和技术效果,下面对现有技术进行简要说明:In order to facilitate the understanding of the technical solutions and technical effects of the present application, the prior art is briefly described below:
根据前文可知,由于目前大多数的稳定器在摄影相关功能中仅仅支持辅助增稳功能,这样使得稳定器的功能大大缩水。As can be seen from the foregoing, since most of the current stabilizers only support the auxiliary stabilization function in the photography-related functions, the function of the stabilizer is greatly reduced.
本申请实施例通过深度传感器获取到深度数据来较为准确地确定目标对象,从而通过云台机构进行调整拍摄装置从而将目标对象调整到预设位置,以进行后续的对焦和拍摄,以此辅助用户进行拍摄,增强拍摄功能以及提高拍摄质量。In this embodiment of the present application, the depth data is acquired by the depth sensor to more accurately determine the target object, and the camera is adjusted through the pan-tilt mechanism to adjust the target object to a preset position for subsequent focusing and shooting, thereby assisting the user Shoot, enhance your shooting capabilities, and improve your shooting quality.
下面结合附图,对本发明的一些实施方式作详细说明。在各实施例之间不冲突的情况下,下述的实施例及实施例中的特征可以相互组合。Some embodiments of the present invention will be described in detail below with reference to the accompanying drawings. The following embodiments and features in the embodiments may be combined with each other without conflict between the embodiments.
图1为本发明实施例提供的一种云台系统的控制方法的流程示意图;本申请实施例提供的该方法100可以由云台系统执行,如,手持云台。云台系统包括用于承载拍摄装置且用于调整拍摄装置的姿态的云台机构,拍摄装置用于对环境进行拍摄,如相机,摄像头等。该方法100包括以下步骤:1 is a schematic flowchart of a control method of a pan-tilt system provided by an embodiment of the present invention; the method 100 provided by an embodiment of the present application may be executed by a pan-tilt system, such as a handheld pan-tilt. The pan-tilt system includes a pan-tilt mechanism for carrying a photographing device and adjusting the posture of the photographing device, and the photographing device is used for photographing the environment, such as a camera, a camera, and the like. The method 100 includes the following steps:
101:获取深度传感器对环境观测得到的深度数据。101: Acquire depth data obtained by the depth sensor observing the environment.
102:根据深度数据确定环境中与拍摄装置距离最近的目标对象。102: Determine a target object in the environment that is closest to the photographing device according to the depth data.
103:控制云台机构以调整拍摄装置的姿态以将目标对象调整至拍摄装置的拍摄画面中的预设位置。103: Control the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
需要说明的是,本申请实施例还可以通过手持稳定器,车载拍照摄影设备稳定器,机载拍照摄影设备稳定器等来执行,这些执行设备可以具有云台系统。It should be noted that, the embodiments of the present application can also be implemented by a handheld stabilizer, a vehicle-mounted camera camera device stabilizer, an airborne camera camera device stabilizer, and the like, and these implementation devices may have a pan/tilt system.
以下针对上述步骤进行详细地阐述:The above steps are described in detail below:
101:获取深度传感器对环境观测得到的深度数据。101: Acquire depth data obtained by the depth sensor observing the environment.
其中,深度传感器是用于获取深度数据的,可以是属于主动式3D深度传感器,其可以自主发射需要的可见光,不可见光等电磁波或者声音等信号, 接受物体反射回来的信号,从而得到深度数据。深度传感器包括双目视觉传感器、TOF(Time of Flight,飞行时间)传感器、毫米波雷达或激光雷达。深度数据可以为从深度图像或点云图像对应的数据。Among them, the depth sensor is used to obtain depth data, which can be an active 3D depth sensor, which can autonomously emit the required visible light, invisible light and other electromagnetic waves or signals such as sound, and receive the signal reflected by the object to obtain the depth data. Depth sensors include binocular vision sensors, TOF (Time of Flight) sensors, millimeter-wave radar or lidar. The depth data may be data corresponding to a depth image or a point cloud image.
其中,拍摄装置,如相机与深度传感器固定安装。例如,主动式3D深度传感器可以外置并固定在相机热靴上。Among them, a photographing device, such as a camera, is fixedly installed with a depth sensor. For example, an active 3D depth sensor can be external and fixed on the camera hot shoe.
除此以外,深度传感器还可以内置在云台机构的P(Pitch,是围绕X轴旋转)轴,这时标定是需要标定和相机的相对位置。深度传感器也可以内置在云台R(Roll,是围绕Z轴旋转)轴,此时标定和相机的相对位置以及并需要实时计算和相机和相对角度。In addition, the depth sensor can also be built in the P (Pitch, which is rotated around the X axis) axis of the gimbal mechanism. At this time, the calibration needs to be calibrated and the relative position of the camera. The depth sensor can also be built in the gimbal R (Roll, which is rotated around the Z axis) axis. At this time, the relative position of the calibration and the camera and the relative angle of the camera and the camera need to be calculated in real time.
深度传感器内置在云台Y(Yaw,围绕Y轴旋转)轴(或者手柄),此时标定时需要标定和相机的相对位置、出厂标定一次即可,如果忽略相机中心和云台机构中心的误差,只需要标定和云台中心的位置,可以一劳永逸。The depth sensor is built in the Y (Yaw, rotating around the Y axis) axis (or handle) of the gimbal. At this time, it is necessary to calibrate the relative position of the camera and the camera. The factory calibration can be done once. If the error between the center of the camera and the center of the gimbal mechanism is ignored , you only need to calibrate and the position of the center of the gimbal, you can do it once and for all.
其中,云台机构包括用于安装拍摄装置的承载基座;深度传感器安装在承载基座或者拍摄装置上。拍摄装置的光轴与深度传感器的光轴平行。Wherein, the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device. The optical axis of the camera is parallel to the optical axis of the depth sensor.
上述环境为拍摄装置的拍摄画面的环境,即拍摄画面中的环境。The above-mentioned environment is the environment of the shooting screen of the shooting device, that is, the environment in the shooting screen.
例如,根据前文所述,在进行图像拍摄前,用户可以移动相机的镜头将需要拍摄的人物移动到相机的视场中央附近位置,而后主动式3D深度传感器可以先向拍摄环境中主动式发射探测信号,其主动发射探测信号后,记录探测信号从发射到被拍摄环境中的物体反射回来的时间,来计算物体反射面和主动式3D深度传感器的距离。(这里不考虑物体移动,或者忽略物体移动,理论上依据多普勒效应,移动物体的速度也可以根据探测信号波长的变化计算得出,主动式3D深度传感器主动发射探测信号,经过物体表面反射到敏感器件,云台系统的处理器可以记录发射探测信号和接收反射信号的时间间隔,进而可以根据信号传播速度计算当前像素点的深度信息,即深度数据(即与主动式3D深度传感器之间的距离)。结合发射探测信号的角度变化(一般由信号相位角决定),可以获取三维的深度信息,即深度数据,也即是获取到了点云图像,也可以称为深度图像(即对应每一个像素点的深度信息或深度数据)。其中,图2示出了点云图像,即深度图像。For example, according to the above, before taking an image, the user can move the lens of the camera to move the person to be photographed to a position near the center of the camera's field of view, and then the active 3D depth sensor can first actively emit detection into the shooting environment After it actively transmits the detection signal, it records the time when the detection signal is reflected from the transmitted to the object in the shooting environment to calculate the distance between the reflection surface of the object and the active 3D depth sensor. (The movement of the object is not considered here, or the movement of the object is ignored. In theory, according to the Doppler effect, the speed of the moving object can also be calculated according to the change of the wavelength of the detection signal. The active 3D depth sensor actively transmits the detection signal, which is reflected by the surface of the object. To the sensitive device, the processor of the PTZ system can record the time interval between transmitting the detection signal and receiving the reflected signal, and then can calculate the depth information of the current pixel point according to the signal propagation speed, that is, the depth data (that is, the distance between the sensor and the active 3D depth sensor). In combination with the angle change of the transmitted detection signal (generally determined by the signal phase angle), three-dimensional depth information can be obtained, that is, depth data, that is, a point cloud image is obtained, which can also be called a depth image (that is, corresponding to each Depth information or depth data of a pixel). Among them, Figure 2 shows a point cloud image, that is, a depth image.
需要说明的是,本申请实施例可以通过主动式3D深度传感器解决了设备在特殊天气或者光线条件下,图像拍摄可靠性受到影响的问题;同时将主动式3D深度传感器与云台系统,如手持云台进行外置结合。It should be noted that the embodiment of the present application can solve the problem that the reliability of image capturing of the device is affected under special weather or light conditions through the active 3D depth sensor; The gimbal is externally combined.
102:根据深度数据确定环境中与拍摄装置距离最近的目标对象。102: Determine a target object in the environment that is closest to the photographing device according to the depth data.
其中,目标对象可以为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象。距离拍摄装置最近也可以通过距离深度传感器最近来确定(拍摄装置可以与深度传感器设置在同一竖直方向上),如,相机的视角对应拍摄画面中A人物距离相机最近,A人物距离深度传感器最近。且A人物在拍摄画面中距离预设位置中最近。The target object may be an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture. The closest distance to the photographing device can also be determined by the closest distance to the depth sensor (the photographing device can be set in the same vertical direction as the depth sensor), for example, the angle of view of the camera corresponds to the person A who is the closest to the camera, and the person A is the closest to the depth sensor. . And person A is the closest to the preset position in the shooting screen.
具体的,根据深度数据确定环境中与拍摄装置距离最近的目标对象,包括:根据深度数据确定环境中多个对象与深度传感器的距离;根据多个对象与所述深度传感器的距离从多个对象中确定目标对象。Specifically, determining the target object closest to the photographing device in the environment according to the depth data includes: determining the distances between multiple objects in the environment and the depth sensor according to the depth data; Identify the target object.
其中,目标对象为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象;根据多个对象与深度传感器的距离从多个对象中确定目标对象,包括:根据深度传感器与拍摄装置之间的安装位置关系确定多个对象在拍摄画面中的位置;根据多个对象与深度传感器的距离和多个对象在拍摄画面中的位置从多个对象中确定目标对象。Wherein, the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen; the target object is determined from the multiple objects according to the distances between the multiple objects and the depth sensor, including : Determine the positions of multiple objects in the shooting screen according to the installation position relationship between the depth sensor and the shooting device; determine the target from the multiple objects according to the distances between the multiple objects and the depth sensor and the positions of the multiple objects in the shooting screen object.
例如,根据前文所述,如图3所示,云台系统的处理器通过深度传感器301获取到深度图像,即深度数据302后,可以利用矩阵卷积运算结合各类梯度算子可以轻松获取点云图像,即深度图像中的各个对象的轮廓信息或轮廓数据。根据轮廓信息的中心点,可以是轮廓信息外接矩形框的中心点对应的深度数据,确定该轮廓信息对应的对象距离主动式3D深度传感器的物理距离。当然也可以选择轮廓信息中的其它点,如左上角、右下角等的点来代表轮廓信息,从而根据该点对应的深度数据确定与主动式3D深度传感器之间的距离。在确定出距离后,可以确定出距离主动式3D深度传感器最近的轮廓信息对应的对象。当由多个对象都距离主动式3D深度传感器最近(那么可以直接视为该多个对象都距离相机最近,可以忽略其中的误差),则可以继续根据主动式3D深度传感器与相机之间的安装位置关系,可以确定出主动式3D深度传感器对应的坐标系,即深度数据所在的坐标系,与相机坐标系之间的映射关系。通过这种映射关系,可以将深度数据映射到相机坐标系中,即确定在相机对应的拍摄画面中的位置。由此,在确定了距离主动式3D深度传感器最近的多个对象后,可以确定出这些对象的上述轮廓信息中心点在拍摄画面中的位置,并确定是否距离拍摄画面中预设位置最近,选择距离拍摄画面中预设位置最近的中心点,从而反推出该中心点对应的轮廓信息,以及对应的目标对象。For example, according to the foregoing, as shown in FIG. 3 , the processor of the gimbal system obtains the depth image through the depth sensor 301 , that is, after the depth data 302 , the matrix convolution operation combined with various gradient operators can be used to easily obtain points Cloud image, that is, the contour information or contour data of each object in the depth image. According to the center point of the contour information, which may be the depth data corresponding to the center point of the rectangular frame circumscribing the contour information, the physical distance between the object corresponding to the contour information and the active 3D depth sensor is determined. Of course, other points in the contour information, such as the upper left corner, the lower right corner, etc., can also be selected to represent the contour information, so as to determine the distance to the active 3D depth sensor according to the depth data corresponding to the point. After the distance is determined, the object corresponding to the contour information closest to the active 3D depth sensor can be determined. When multiple objects are closest to the active 3D depth sensor (then it can be directly considered that the multiple objects are the closest to the camera, and the error can be ignored), then the installation between the active 3D depth sensor and the camera can be continued according to the The positional relationship can determine the mapping relationship between the coordinate system corresponding to the active 3D depth sensor, that is, the coordinate system where the depth data is located, and the camera coordinate system. Through this mapping relationship, the depth data can be mapped into the camera coordinate system, that is, the position in the shooting picture corresponding to the camera is determined. Therefore, after determining the objects closest to the active 3D depth sensor, the positions of the above-mentioned contour information center points of these objects in the shooting picture can be determined, and whether it is the closest to the preset position in the shooting picture can be determined by selecting The center point closest to the preset position in the shooting picture, so as to deduce the contour information corresponding to the center point and the corresponding target object.
需要说明的是,合理地利用点云图像中轮廓信息足够可以实现包含上述功能的很多功能。考虑到拍摄的人物等和背景不会在同一平面,因此根据点云图像中的轮廓信息即可获取焦点偏差,无需精确地获取人型或者其它形状,可以减少处理器处理时间,节省代码空间以及减少系统功耗的考虑,对于不需要复杂运算的功能不进行复杂的形状识别。此外,轮廓信息的获取也可以使用机器学习的相关方法直接获取指定形状的轮廓信息。It should be noted that reasonable use of the contour information in the point cloud image is sufficient to implement many functions including the above functions. Considering that the photographed people and the background will not be on the same plane, the focus deviation can be obtained according to the outline information in the point cloud image, without the need to accurately obtain the human shape or other shapes, which can reduce the processing time of the processor, save the code space and In consideration of reducing system power consumption, complex shape recognition is not performed for functions that do not require complex operations. In addition, the acquisition of contour information can also directly obtain the contour information of a specified shape by using a related method of machine learning.
具体的,根据多个对象与深度传感器的距离从多个对象中确定目标对象,包括:将多个对象中距离深度传感器最小的对象确定为目标对象。Specifically, determining the target object from the plurality of objects according to the distances between the plurality of objects and the depth sensor includes: determining the object with the smallest distance from the depth sensor among the plurality of objects as the target object.
由于前文已经阐述过相似内容,此处就不再赘述,仅说明:在确定出多个对象距离深度传感器的距离后,可以直接将距离深度传感器最小的对象作为目标对象,或者说,可以将距离深度传感器最小视为距离拍摄装置最小,距离深度传感器最小的对象也是距离拍摄装置最小的对象。Since the similar content has been described above, it will not be repeated here, but only to explain: after determining the distances of multiple objects from the depth sensor, the object with the smallest distance from the depth sensor can be directly used as the target object, or the distance can be used as the target object. The smallest depth sensor is considered the smallest distance to the camera, and the object that is the smallest distance to the depth sensor is also the smallest object to the camera.
为了能够更加精确地确定目标对象,还可以基于深度传感器与拍摄装置之间的安装位置关系,来确定多个对象中与拍摄装置之间的距离,从而选择目标对象。In order to be able to determine the target object more accurately, the distance between the multiple objects and the photographing device may also be determined based on the installation position relationship between the depth sensor and the photographing device, so as to select the target object.
具体的,根据多个对象与深度传感器的距离从多个对象中确定目标对象,包括:根据多个对象与深度传感器的距离和深度传感器与拍摄装置之间的安装位置关系确定多个对象与拍摄装置之间的距离;将多个对象中与拍摄装置之间的距离最小的对象确定为目标对象。Specifically, determining the target object from the multiple objects according to the distance between the multiple objects and the depth sensor includes: determining the multiple objects and the shooting device according to the distance between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the shooting device The distance between the devices; the object with the smallest distance from the photographing device among the multiple objects is determined as the target object.
由于前文已经阐述过相似内容,此处就不再赘述,仅说明:可以根据深度传感器与拍摄装置之间的安装位置关系,确定两者之间的物理位置关系,从而根据这个物理位置关系,来对多个对象与深度传感器的距离进行转换,转换为多个对象与拍摄装置的距离,并可以选择与拍摄装置的距离最小的对象为目标对象。例如,主动式3D深度传感器可以外置并固定在相机热靴上,那么可以根据这种位置关系以及多个对象与主动式3D深度传感器的距离,进行转换,如通过三角形公式,来确定多个对象与相机之间的距离。Since the similar content has been described above, it will not be repeated here, but only to explain: the physical position relationship between the depth sensor and the photographing device can be determined according to the installation position relationship between the two, so that according to this physical position relationship, the The distances between the multiple objects and the depth sensor are converted into distances between the multiple objects and the photographing device, and the object with the smallest distance from the photographing device can be selected as the target object. For example, the active 3D depth sensor can be externally placed and fixed on the camera hot shoe, then according to this positional relationship and the distance between multiple objects and the active 3D depth sensor, conversion can be performed, such as through the triangle formula, to determine multiple The distance between the object and the camera.
103:控制云台机构以调整拍摄装置的姿态以将目标对象调整至拍摄装置的拍摄画面中的预设位置。103: Control the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
其中,预设位置为拍摄画面的中心,也可以为拍摄画面的其它位置。预设位置是由用户确定的,用户可以通过拍摄装置,或者云台系统提供的其它展示屏来进行设定,用户可以在展示屏或者拍摄装置的屏幕上进行操作,选 择预设位置。由此,云台系统响应于这种操作,确定出预设位置。Wherein, the preset position is the center of the shooting screen, and may also be other positions on the shooting screen. The preset position is determined by the user, and the user can set it through the photographing device or other display screens provided by the PTZ system, and the user can operate on the display screen or the screen of the photographing device to select the preset position. Thereby, the pan/tilt system determines the preset position in response to this operation.
具体的,控制云台机构以调整所述拍摄装置的姿态,包括:根据目标对象在拍摄画面中的位置和预设位置之间的偏差控制云台机构以调整拍摄装置的姿态。Specifically, controlling the pan-tilt mechanism to adjust the posture of the photographing device includes: controlling the pan-tilt mechanism to adjust the posture of the photographing device according to the deviation between the position of the target object in the photographing screen and a preset position.
例如,根据前文所述,云台系统,如图3所示,可以根据目标对象的轮廓信息中心点在拍摄画面中的位置和预设位置得到的偏差进行校正,云台系统的控制器控制稳定器,如云台机构角度,以使调整相机的姿态,使得相机在被调整姿态后进行变焦。云台系统的控制器可以根据偏差对应的偏差值发出控制指令到云台机构进行角度调整和相机的变焦。如,控制器可以根据角度偏差控制云台机构的电机转动,如控制P轴电机转动,从而转动P itch轴,调整相机所在位置。For example, according to the foregoing, the pan-tilt system, as shown in Figure 3, can be corrected according to the deviation obtained between the position of the target object's contour information center point in the shooting picture and the preset position, and the controller of the pan-tilt system is controlled stably device, such as the angle of the gimbal mechanism, so as to adjust the posture of the camera, so that the camera zooms after the posture is adjusted. The controller of the gimbal system can send control commands to the gimbal mechanism to adjust the angle and zoom the camera according to the deviation value corresponding to the deviation. For example, the controller can control the motor rotation of the gimbal mechanism according to the angle deviation, such as controlling the rotation of the P-axis motor, thereby rotating the Pitch axis and adjusting the position of the camera.
相应的,也可以根据目标对象(如,其对应轮廓信息的中心点)在深度数据,或点云图像中的位置和预设位置在深度数据,或点云图像中的位置(可以通过上述两个坐标系的映射关系,进行反推,如从相机坐标系中的数据映射到深度传感器坐标系中的数据,即深度传感器坐标系中的深度数据)之间的偏差控制云台机构以调整拍摄装置的姿态,此处就不再赘述。Correspondingly, it can also be based on the depth data of the target object (for example, the center point of its corresponding contour information), or the position in the point cloud image and the preset position in the depth data, or the position in the point cloud image (which can be determined by the above two methods. The mapping relationship of each coordinate system is reversed, such as mapping from the data in the camera coordinate system to the data in the depth sensor coordinate system, that is, the depth data in the depth sensor coordinate system) Control the PTZ mechanism to adjust the shooting The posture of the device will not be repeated here.
为了能够快速地进行调整,还可以在获取到轮廓信息后,保留相机视场中间附近(如,视场中间一定预置区域内)的轮廓信息,算出对应轮廓信息的中心位置在拍摄画面中的位置。进而可以确定该位置和预设位置的偏差,从而进行调整,此处就不再赘述。In order to make adjustments quickly, after obtaining the contour information, the contour information near the middle of the camera's field of view (for example, within a certain preset area in the middle of the field of view) can be retained, and the center position of the corresponding contour information in the shooting picture can be calculated. Location. Then, the deviation between the position and the preset position can be determined, so as to make adjustment, which will not be repeated here.
除了上述将目标对象调整到预设位置,如自动对中(auto center),可以自动地对准中心点最近位置的突出轮廓。该功能可用于拍照或者自拍时辅助将希望位于视场中心的人物等,快速准确地置于矫正到中心。云台系统还可以实现以下功能:In addition to the above-mentioned adjustment of the target object to a preset position, such as automatic centering (auto center), the protruding contour at the nearest position of the center point can be automatically aligned. This function can be used to assist people who want to be in the center of the field of view when taking pictures or selfies, and quickly and accurately correct them to the center. The PTZ system can also realize the following functions:
具体的,该方法100还包括:根据深度数据确定拍摄装置的拍摄画面中占据的图像区域最大的对象;控制云台机构和/或拍摄装置的焦距以使位于拍摄画面中的预设位置并且使对象的整体全部位于拍摄画面中。Specifically, the method 100 further includes: determining, according to the depth data, an object occupying the largest image area in the shooting picture of the shooting device; The entirety of the subject is in the shooting frame.
例如,根据前文所述的方式,云台系统可以根据深度数据确定各个对象在拍摄画面中的位置,或者说各个对象在拍摄画面中的图像区域,从而确定各个图像区域的大小,从而控制云台机构和/或相机进行调整,使得图像区域最大的对象位于预设位置,就不再赘述。For example, according to the method described above, the pan-tilt system can determine the position of each object in the shooting screen according to the depth data, or the image area of each object in the shooting screen, so as to determine the size of each image area, so as to control the pan-tilt The mechanism and/or the camera are adjusted so that the object with the largest image area is located at the preset position, which will not be repeated here.
其中,控制云台机构和/或拍摄装置的焦距以使位于拍摄画面中的预设位置并且使对象的整体全部位于拍摄画面中,包括:若对象的整体不在拍摄画面中,控制拍摄装置的焦距以使对象的整体全部位于拍摄画面中;控制云台机构以调整拍摄装置的拍摄姿态以使对象位于拍摄画面中的预设位置。Wherein, controlling the pan/tilt mechanism and/or the focal length of the photographing device to be located at a preset position in the photographing screen and making the entire object lie in the photographing frame includes: if the entire object is not in the photographing frame, controlling the focal length of the photographing device So that the whole of the object is located in the shooting screen; the pan-tilt mechanism is controlled to adjust the shooting posture of the shooting device so that the object is located in the preset position in the shooting screen.
例如,根据前文所述,如果该图像区域最大的对象不完全在拍摄画面中,则可以通过控制相机的焦距进行调焦,将该对象的图像区域在拍摄画面中。通过控制云台机构调整相机姿态将该对象位置预设位置,此处就不再赘述。For example, according to the foregoing, if the object with the largest image area is not completely in the shooting screen, you can adjust the focus by controlling the focal length of the camera, so that the image area of the object is in the shooting screen. The position of the object is preset by adjusting the camera posture by controlling the pan-tilt mechanism, which will not be repeated here.
此外,云台系统通过云台机构,使得拍摄装置将图像区域最大的对象调整到预设位置后,可以针对该对象进行构图。云台系统可以计算该对象的轮廓信息中心和轮廓信息外接矩形的长宽,最后根据预置构图比例,如黄金分割比,计算云台机构各个轴的偏差,以及焦距的偏差,调整云台机构角度以及拍摄装置焦距,使得构图比例最接近所需要的比例(云台机构不移动,轮廓本身比例不合适,存在无法精确达到要求的可能,可以选择长宽或者对角线之一符合比例即可)。从而可以实现快速构图。In addition, the pan-tilt system uses the pan-tilt mechanism, so that after the photographing device adjusts the object with the largest image area to a preset position, it can compose a picture for the object. The gimbal system can calculate the contour information center of the object and the length and width of the circumscribed rectangle of the contour information. Finally, according to the preset composition ratio, such as the golden ratio, calculate the deviation of each axis of the gimbal mechanism and the deviation of the focal length, and adjust the gimbal mechanism. The angle and the focal length of the shooting device make the composition ratio the closest to the required ratio (the gimbal mechanism does not move, and the proportion of the outline itself is not appropriate. ). Thus, quick composition can be achieved.
具体的,该方法100还包括:确定拍摄装置画面中的对焦对象;根据深度数据确定对焦对象的距离;根据距离,控制拍摄装置对焦,使得拍摄装置对焦在对焦对象上。Specifically, the method 100 further includes: determining the focus object in the picture of the photographing device; determining the distance of the focusing object according to the depth data; controlling the photographing device to focus according to the distance, so that the photographing device focuses on the focusing object.
其中,确定拍摄装置画面中的对焦对象,包括:识别拍摄画面中的对焦对象。该对焦对象可以是如前文所述的目标对象,如人物。进一步还可以为人的特征部位。如人脸、五官,如鼻子、眼睛或嘴巴等。人的特征部位为可以目标对象的特征部位。Wherein, determining the focus object in the picture of the shooting device includes: identifying the focus object in the shooting picture. The focus object may be the target object described above, such as a person. Further, it can also be a characteristic part of a person. Such as human face, facial features, such as nose, eyes or mouth. The feature part of the person is the feature part of the target object.
例如,根据前文所述,云台系统通过云台机构将目标对象移动至拍摄画面的预设位置后,可以通过识别该对象的对焦对象,如鼻子,从而以此为焦点,即对焦对象。再通过前文所述的方式确定该焦点与相机之间的距离,从而进行对焦。For example, according to the above, after the gimbal system moves the target object to the preset position of the shooting screen through the gimbal mechanism, it can identify the focus object of the object, such as the nose, so as to focus on it, that is, the focus object. Then, the distance between the focal point and the camera is determined by the method described above, so as to focus.
需要说明是,没有对焦对象的时候,可以以预设位置作为焦点,进行对焦。It should be noted that when there is no focusing object, the preset position can be used as the focus to focus.
由此,云台系统可以开启特写选焦。可以根据对应的焦点距离(深度传感器已经获取)控制拍摄装置进行变焦。As a result, the gimbal system can enable close-up focus selection. The camera can be controlled to zoom according to the corresponding focal distance (which has been acquired by the depth sensor).
具体的,该方法100还包括:根据深度数据识别用户的特定手势;根据识别出来的特定手势,控制云台系统或者拍摄装置。Specifically, the method 100 further includes: recognizing the specific gesture of the user according to the depth data; and controlling the pan-tilt system or the photographing device according to the recognized specific gesture.
其中,特定手势是指预置好的手势,如双手打招呼的手势,如图4所示。The specific gesture refers to a preset gesture, such as a gesture of greeting with both hands, as shown in FIG. 4 .
例如,根据前文所述,用户可以通过云台系统的应用程序开启手势模式,则云台系统可以先识别该特定手势,如双手打招呼的手势。根据前文所述的方式,云台系统可以获取到该手势的轮廓信息,然后和预置手势的轮廓信息比较,是否相同,或者相似度超过阈值,则确定是特定手势,则可以开启该特定手势对应的功能,如唤醒云台系统的跟踪功能,或者拍照等等。For example, according to the foregoing, the user can enable the gesture mode through the application program of the PTZ system, and the PTZ system can first recognize the specific gesture, such as a gesture of greeting with both hands. According to the method described above, the PTZ system can obtain the outline information of the gesture, and then compare it with the outline information of the preset gesture to see if it is the same, or if the similarity exceeds the threshold, then it is determined to be a specific gesture, and the specific gesture can be enabled. Corresponding functions, such as waking up the tracking function of the PTZ system, or taking pictures, etc.
需要说明的是,云台系统可以设置多个预置手势,不同的预置手势可以对应不同的功能,云台系统在获取到用户的手势的轮廓信息后,可以与多个预置手势分别进行对比,确定对应的预置手势,以进行功能的唤醒。It should be noted that the PTZ system can set multiple preset gestures, and different preset gestures can correspond to different functions. After the PTZ system obtains the outline information of the user's gestures, it can be performed separately with multiple preset gestures. Compare and determine the corresponding preset gesture to wake up the function.
具体的,该方法100还包括:根据深度数据确定跟踪对象在拍摄装置的拍摄画面中的位置;根据位置控制云台机构以调整拍摄装置的姿态以使跟踪对象位于拍摄画面中。Specifically, the method 100 further includes: determining the position of the tracking object in the shooting screen of the shooting device according to the depth data; controlling the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in the shooting screen.
其中,跟踪对象可以为人、车等对象。The tracking object may be a person, a car, or the like.
例如,根据前文所述,可以根据手势来唤醒跟踪功能,也可以直接开启跟踪功能,进行跟踪对象的跟踪,如拍照或者摄像。开始跟踪后,云台系统可以前文所述的方式,通过深度数据来确定跟踪对象在拍摄画面中的位置,如预设位置,从而根据该位置控制云台机构来调整相机姿态,以进行跟踪拍摄。For example, according to the foregoing description, the tracking function can be awakened according to the gesture, or the tracking function can be directly enabled to track the tracking object, such as taking a picture or a video. After starting tracking, the gimbal system can use the depth data to determine the position of the tracking object in the shooting screen, such as a preset position, in the way described above, so as to control the gimbal mechanism according to the position to adjust the camera posture for tracking shooting. .
需要说明的是,可以先预置好跟踪对象为人,由此,在获取到深度数据后,可以根据对应的轮廓信息确定跟踪对象,可以将轮廓信息与预置跟踪对象的轮廓信息进行对比来确定,然后进行跟踪。其中,跟踪的拍照或者摄像均可以结合前文所述的功能来实现。It should be noted that the tracking object can be preset to be a person. Therefore, after obtaining the depth data, the tracking object can be determined according to the corresponding contour information, and the contour information can be compared with the contour information of the preset tracking object to determine. , and then trace. Wherein, the photographing or photographing of the tracking can be implemented in combination with the functions described above.
至于后续跟踪是利用主动式3D深度传感器计算结果还是利用原先非主动式传感器的结果,亦或是融合二者计算结果,可以由设定决定。As for the follow-up tracking, whether to use the calculation results of the active 3D depth sensor or the results of the original non-active sensor, or to fuse the calculation results of the two, can be determined by the settings.
本申请实施例提出了使用主动式3D深度传感器,处理其获取3D点云图像,即深度数据结合对稳定器和拍摄装置,如相机设备的控制实现自动对中、快速构图、特写选焦、手势识别等功能,再结合其主动式的特点增加目标跟踪功能的可靠性,另外,该类传感器可以采用更加简单的软件手段实现比图像处理更好的效果。The embodiment of the present application proposes to use an active 3D depth sensor to process it to obtain a 3D point cloud image, that is, to combine the depth data with the control of a stabilizer and a photographing device, such as a camera device, to realize automatic centering, quick composition, close-up focus selection, gestures Recognition and other functions, combined with its active characteristics, increase the reliability of the target tracking function. In addition, this type of sensor can use simpler software methods to achieve better results than image processing.
图5为本发明实施例提供的一种云台系统的控制装置的结构示意图;该装置500可以应用于云台系统中,例如,手持云台,云台系统包括用于承载拍摄 装置且用于调整拍摄装置的姿态的云台机构,拍摄装置用于对环境进行拍摄,该装置500可以执行上述的云台系统的控制方法。其中,该装置500包括:获取模块501、确定模块502以及控制模块503。以下针对各个模块的功能进行详细的阐述:5 is a schematic structural diagram of a control device of a pan-tilt system according to an embodiment of the present invention; the device 500 can be applied to a pan-tilt system, for example, a hand-held pan-tilt system. For a pan-tilt mechanism for adjusting the posture of the photographing device, the photographing device is used to photograph the environment, and the device 500 can execute the above-mentioned control method of the pan-tilt system. The apparatus 500 includes: an acquisition module 501 , a determination module 502 and a control module 503 . The functions of each module are described in detail below:
获取模块501,用于获取深度传感器对环境观测得到的深度数据。The acquiring module 501 is configured to acquire depth data obtained by the depth sensor observing the environment.
确定模块502,用于根据深度数据确定环境中与拍摄装置距离最近的目标对象。The determining module 502 is configured to determine, according to the depth data, a target object that is closest to the photographing device in the environment.
控制模块503,用于控制云台机构以调整拍摄装置的姿态以将目标对象调整至拍摄装置的拍摄画面中的预设位置。The control module 503 is configured to control the pan-tilt mechanism to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing image of the photographing device.
具体的,确定模块502,包括:第一确定单元,用于根据深度数据确定环境中多个对象与深度传感器的距离;第一确定单元,用于根据多个对象与深度传感器的距离从多个对象中确定目标对象。Specifically, the determining module 502 includes: a first determining unit, configured to determine the distances between multiple objects in the environment and the depth sensor according to the depth data; The target object is determined in the object.
其中,目标对象为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象;确定模块502,包括:第二确定单元,用于根据深度传感器与拍摄装置之间的安装位置关系确定多个对象在拍摄画面中的位置;第二确定单元,用于根据多个对象与深度传感器的距离和多个对象在拍摄画面中的位置从多个对象中确定目标对象。Wherein, the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen; the determining module 502 includes: a second determining unit, which is used for determining according to the depth sensor and the shooting device. The installation position relationship between the multiple objects determines the positions of the multiple objects in the shooting picture; the second determining unit is used for determining the target from the multiple objects according to the distances between the multiple objects and the depth sensor and the positions of the multiple objects in the shooting picture. object.
具体的,第一确定单元,用于:将多个对象中距离深度传感器最小的对象确定为目标对象。Specifically, the first determining unit is configured to: determine the object with the smallest distance from the depth sensor among the multiple objects as the target object.
具体的,第一确定单元,用于:根据多个对象与深度传感器的距离和深度传感器与拍摄装置之间的安装位置关系确定多个对象与拍摄装置之间的距离;将多个对象中与拍摄装置之间的距离最小的对象确定为目标对象。Specifically, the first determining unit is configured to: determine the distance between the multiple objects and the shooting device according to the distances between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the shooting device; The object with the smallest distance between the photographing devices is determined as the target object.
具体的,控制模块503,用于根据目标对象在拍摄画面中的位置和预设位置之间的偏差控制云台机构以调整拍摄装置的姿态。Specifically, the control module 503 is configured to control the pan-tilt mechanism according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
其中,目标对象为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象。The target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture.
其中,预设位置为拍摄画面的中心。The preset position is the center of the shooting screen.
其中,预设位置是由用户确定的。The preset position is determined by the user.
具体的,拍摄装置与深度传感器固定安装。Specifically, the photographing device is fixedly installed with the depth sensor.
具体的,云台机构包括用于安装拍摄装置的承载基座;深度传感器安装在承载基座或者拍摄装置上。Specifically, the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device.
其中,深度传感器包括双目视觉传感器、TOF传感器、毫米波雷达或激光雷达。Among them, the depth sensor includes binocular vision sensor, TOF sensor, millimeter wave radar or lidar.
其中,拍摄装置的光轴与深度传感器的光轴平行。Wherein, the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
此外,确定模块502,还用于:根据深度数据确定拍摄装置的拍摄画面中占据的图像区域最大的对象;控制模块503,还用于:控制云台机构和/或拍摄装置的焦距以使位于拍摄画面中的预设位置并且使对象的整体全部位于拍摄画面中。In addition, the determining module 502 is further configured to: determine the object that occupies the largest image area in the shooting picture of the shooting device according to the depth data; the control module 503 is further configured to: control the pan-tilt mechanism and/or the focal length of the shooting device to make A preset position in the shooting screen and the entirety of the subject is located in the shooting screen.
具体的,控制模块503,包括:第一控制单元,用于若对象的整体不在拍摄画面中,控制拍摄装置的焦距以使对象的整体全部位于拍摄画面中;第二控制单元,用于控制云台机构以调整拍摄装置的拍摄姿态以使对象位于拍摄画面中的预设位置。Specifically, the control module 503 includes: a first control unit for controlling the focal length of the photographing device if the whole of the object is not in the shooting picture, so that the whole of the object is in the shooting picture; a second control unit for controlling the cloud The stage mechanism is used to adjust the shooting posture of the shooting device so that the object is located at a preset position in the shooting picture.
此外,确定模块502,还用于:确定拍摄装置画面中的对焦对象;确定模块502,还用于:根据深度数据确定对焦对象的距离;控制模块503,还用于:根据距离,控制拍摄装置对焦,使得拍摄装置对焦在对焦对象上。In addition, the determining module 502 is also used for: determining the focus object in the picture of the photographing device; the determining module 502 is also used for: determining the distance of the focusing object according to the depth data; the control module 503 is also used for: controlling the shooting device according to the distance Focus so that the camera focuses on the subject in focus.
具体的,确定模块502,用于:识别拍摄画面中的对焦对象。Specifically, the determining module 502 is configured to: identify the focus object in the shooting picture.
此外,该装置500还包括:识别模块,用于根据深度数据识别用户的特定手势;控制模块503,还用于:根据识别出来的特定手势,控制云台系统或者拍摄装置。In addition, the device 500 further includes: an identification module for identifying a specific gesture of the user according to the depth data; and a control module 503 for controlling the pan-tilt system or the photographing device according to the identified specific gesture.
此外,确定模块502,还用于根据深度数据确定跟踪对象在拍摄装置的拍摄画面中的位置;控制模块503,还用于:根据位置控制云台机构以调整拍摄装置的姿态以使跟踪对象位于拍摄画面中。In addition, the determination module 502 is also used to determine the position of the tracking object in the shooting picture of the shooting device according to the depth data; the control module 503 is also used to: control the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in in the shooting screen.
在一个可能的设计中,图5所示云台系统的控制装置500的结构可实现为一电子设备,该电子设备可以是云台系统的控制设备。如图6所示,该云台系统的控制设备600可以包括:一个或多个处理器601、一个或多个存储器602。云台系统包括用于承载拍摄装置且用于调整拍摄装置的姿态的云台机构,拍摄装置用于对环境进行拍摄。其中,存储器602用于存储支持电子设备执行上述图1-图4所示实施例中提供的云台系统的控制方法的程序。处理器601被配置为用于执行存储器602中存储的程序。具体的,程序包括一条或多条计算机指令,其中,一条或多条计算机指令被处理器601执行时能够实现如下步骤:In a possible design, the structure of the control device 500 of the pan-tilt system shown in FIG. 5 may be implemented as an electronic device, and the electronic device may be a control device of the pan-tilt system. As shown in FIG. 6 , the control device 600 of the PTZ system may include: one or more processors 601 and one or more memories 602 . The pan/tilt system includes a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used for photographing the environment. The memory 602 is used to store a program that supports the electronic device to execute the control method of the pan-tilt system provided in the embodiments shown in FIG. 1 to FIG. 4 . The processor 601 is configured to execute programs stored in the memory 602 . Specifically, the program includes one or more computer instructions, wherein the one or more computer instructions can implement the following steps when executed by the processor 601:
运行存储器602中存储的计算机程序以实现:获取深度传感器对环境观测得到的深度数据;根据深度数据确定环境中与拍摄装置距离最近的目标对象; 控制云台机构以调整拍摄装置的姿态以将目标对象调整至拍摄装置的拍摄画面中的预设位置。Run the computer program stored in the memory 602 to achieve: obtain the depth data obtained by the depth sensor on the environment; determine the target object in the environment with the closest distance to the photographing device according to the depth data; control the pan-tilt mechanism to adjust the posture of the photographing device to move the target The subject is adjusted to a preset position in the photographing frame of the photographing device.
具体的,处理器601,具体用于:根据深度数据确定环境中多个对象与深度传感器的距离;根据多个对象与深度传感器的距离从多个对象中确定所述目标对象。Specifically, the processor 601 is specifically configured to: determine the distances between multiple objects in the environment and the depth sensor according to the depth data; and determine the target object from the multiple objects according to the distances between the multiple objects and the depth sensor.
其中,目标对象为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象;处理器601,具体用于:根据深度传感器与拍摄装置之间的安装位置关系确定多个对象在拍摄画面中的位置;根据多个对象深度传感器的距离和多个对象在拍摄画面中的位置从多个对象中确定目标对象。Wherein, the target object is the object that is closest to the shooting device in the environment and whose image area in the shooting screen is closest to the preset position in the shooting screen; the processor 601 is specifically used for: according to the installation position between the depth sensor and the shooting device The relationship determines the positions of the multiple objects in the shooting picture; the target object is determined from the multiple objects according to the distances of the depth sensors of the multiple objects and the positions of the multiple objects in the shooting picture.
具体的,处理器601,具体用于:将多个对象中距离深度传感器最小的对象确定为目标对象。Specifically, the processor 601 is specifically configured to: determine the object with the smallest distance from the depth sensor among the multiple objects as the target object.
具体的,处理器601,具体用于:根据多个对象与深度传感器的距离和深度传感器与拍摄装置之间的安装位置关系确定多个对象与拍摄装置之间的距离;将多个对象中与拍摄装置之间的距离最小的对象确定为目标对象。Specifically, the processor 601 is specifically configured to: determine the distance between the multiple objects and the photographing device according to the distances between the multiple objects and the depth sensor and the installation position relationship between the depth sensor and the photographing device; The object with the smallest distance between the photographing devices is determined as the target object.
具体的,处理器601,具体用于:根据目标对象在拍摄画面中的位置和预设位置之间的偏差控制云台机构以调整拍摄装置的姿态。Specifically, the processor 601 is specifically configured to: control the pan-tilt mechanism according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
其中,目标对象为环境中距离拍摄装置最近且其在拍摄画面中的图像区域最靠近拍摄画面中预设位置的对象。The target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to the preset position in the photographing picture.
其中,预设位置为拍摄画面的中心。The preset position is the center of the shooting screen.
其中,预设位置是由用户确定的。The preset position is determined by the user.
具体的,拍摄装置与深度传感器固定安装。Specifically, the photographing device is fixedly installed with the depth sensor.
具体的,云台机构包括用于安装拍摄装置的承载基座;深度传感器安装在承载基座或者拍摄装置上。Specifically, the pan-tilt mechanism includes a bearing base for installing the photographing device; the depth sensor is installed on the bearing base or the photographing device.
其中,深度传感器包括双目视觉传感器、TOF传感器、毫米波雷达或激光雷达。Among them, the depth sensor includes binocular vision sensor, TOF sensor, millimeter wave radar or lidar.
其中,拍摄装置的光轴与深度传感器的光轴平行。Wherein, the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
此外,处理器601,还用于:根据深度数据确定拍摄装置的拍摄画面中占据的图像区域最大的对象;控制云台机构和/或拍摄装置的焦距以使位于拍摄画面中的预设位置并且使对象的整体全部位于拍摄画面中。In addition, the processor 601 is further configured to: determine the object that occupies the largest image area in the photographing picture of the photographing device according to the depth data; control the pan-tilt mechanism and/or the focal length of the photographing device to be located at a preset position in the photographing picture and The entirety of the subject is in the shooting frame.
具体的,处理器601,具体用于:若对象的整体不在拍摄画面中,控制拍 摄装置的焦距以使对象的整体全部位于拍摄画面中;控制云台机构以调整拍摄装置的拍摄姿态以使对象位于拍摄画面中的预设位置。Specifically, the processor 601 is specifically configured to: if the whole of the object is not in the shooting screen, control the focal length of the shooting device so that the whole of the object is located in the shooting screen; control the pan-tilt mechanism to adjust the shooting posture of the shooting device to make the object at the preset position in the shooting screen.
此外,处理器601,还用于:确定拍摄装置画面中的对焦对象;根据深度数据确定对焦对象的距离;根据距离,控制拍摄装置对焦,使得拍摄装置对焦在对焦对象上。In addition, the processor 601 is further configured to: determine the focusing object in the picture of the photographing device; determine the distance of the focusing object according to the depth data; and control the focusing of the photographing device according to the distance, so that the photographing device focuses on the focusing object.
具体的,处理器601,具体用于:识别拍摄画面中的对焦对象。Specifically, the processor 601 is specifically configured to: identify the focus object in the shooting picture.
此外,处理器601,还用于:根据深度数据识别用户的特定手势;根据识别出来的特定手势,控制云台系统或者拍摄装置。In addition, the processor 601 is further configured to: recognize the specific gesture of the user according to the depth data; and control the pan-tilt system or the photographing device according to the recognized specific gesture.
此外,处理器601,还用于:根据深度数据确定跟踪对象在拍摄装置的拍摄画面中的位置;根据位置控制云台机构以调整拍摄装置的姿态以使跟踪对象位于拍摄画面中。In addition, the processor 601 is further configured to: determine the position of the tracking object in the shooting picture of the shooting device according to the depth data; and control the pan-tilt mechanism according to the position to adjust the posture of the shooting device so that the tracking object is located in the shooting picture.
另外,本发明实施例提供了一种计算机可读存储介质,存储介质为计算机可读存储介质,该计算机可读存储介质中存储有程序指令,程序指令用于实现上述图1-图4的方法。In addition, an embodiment of the present invention provides a computer-readable storage medium, where the storage medium is a computer-readable storage medium, and program instructions are stored in the computer-readable storage medium, and the program instructions are used to implement the above-mentioned methods of FIG. 1 to FIG. 4 . .
本发明实施例提供的一种云台系统,包括用于承载拍摄装置且用于调整拍摄装置的姿态的云台机构以及如图6所示的控制设备。A pan-tilt system provided by an embodiment of the present invention includes a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and a control device as shown in FIG. 6 .
以上各个实施例中的技术方案、技术特征在与本相冲突的情况下均可以单独,或者进行组合,只要未超出本领域技术人员的认知范围,均属于本申请保护范围内的等同实施例。The technical solutions and technical features in the above embodiments can be used alone or combined in the case of conflict with the present invention, as long as they do not exceed the cognitive scope of those skilled in the art, they all belong to the equivalent embodiments within the protection scope of the present application .
在本发明所提供的几个实施例中,应该理解到,所揭露的相关检测装置(例如:IMU)和方法,可以通过其它的方式实现。例如,以上所描述的遥控装置实施例仅仅是示意性的,例如,所述模块或单元的划分,仅仅为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,遥控装置或单元的间接耦合或通信连接,可以是电性,机械或其它的形式。In the several embodiments provided by the present invention, it should be understood that the disclosed related detection apparatus (eg, IMU) and method may be implemented in other manners. For example, the embodiments of the remote control device described above are only illustrative. For example, the division of the modules or units is only a logical function division. In actual implementation, there may be other division methods, such as multiple units or components. May be combined or may be integrated into another system, or some features may be omitted, or not implemented. On the other hand, the shown or discussed mutual coupling or direct coupling or communication connection may be through some interfaces, indirect coupling or communication connection of the remote control device or unit, which may be electrical, mechanical or other forms.
所述作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个网络单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。The units described as separate components may or may not be physically separated, and components displayed as units may or may not be physical units, that is, may be located in one place, or may be distributed to multiple network units. Some or all of the units may be selected according to actual needs to achieve the purpose of the solution in this embodiment.
另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中, 也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, or each unit may exist physically alone, or two or more units may be integrated into one unit. The above-mentioned integrated units may be implemented in the form of hardware, or may be implemented in the form of software functional units.
所述集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个计算机可读取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得计算机处理器(processor)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、移动硬盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、磁盘或者光盘等各种可以存储程序代码的介质。The integrated unit, if implemented in the form of a software functional unit and sold or used as an independent product, may be stored in a computer-readable storage medium. Based on such understanding, the technical solution of the present invention is essentially or the part that contributes to the prior art, or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions for causing a computer processor (processor) to execute all or part of the steps of the methods described in the various embodiments of the present invention. The aforementioned storage medium includes: U disk, mobile hard disk, Read-Only Memory (ROM, Read-Only Memory), Random Access Memory (RAM, Random Access Memory), magnetic disk or optical disk and other media that can store program codes.
以上所述仅为本发明的实施例,并非因此限制本发明的专利范围,凡是利用本发明说明书及附图内容所作的等效结构或等效流程变换,或直接或间接运用在其他相关的技术领域,均同理包括在本发明的专利保护范围内。The above descriptions are only the embodiments of the present invention, and are not intended to limit the scope of the present invention. Any equivalent structure or equivalent process transformation made by using the contents of the description and drawings of the present invention, or directly or indirectly applied to other related technologies Fields are similarly included in the scope of patent protection of the present invention.
最后应说明的是:以上各实施例仅用以说明本发明的技术方案,而非对其限制;尽管参照前述各实施例对本发明进行了详细的说明,本领域的普通技术人员应当理解:其依然可以对前述各实施例所记载的技术方案进行修改,或者对其中部分或者全部技术特征进行等同替换;而这些修改或者替换,并不使相应技术方案的本质脱离本发明各实施例技术方案的范围。Finally, it should be noted that the above embodiments are only used to illustrate the technical solutions of the present invention, but not to limit them; although the present invention has been described in detail with reference to the foregoing embodiments, those of ordinary skill in the art should understand that: The technical solutions described in the foregoing embodiments can still be modified, or some or all of the technical features thereof can be equivalently replaced; and these modifications or replacements do not make the essence of the corresponding technical solutions deviate from the technical solutions of the embodiments of the present invention. scope.

Claims (40)

  1. 一种云台系统的控制方法,其中,所述云台系统包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构,所述拍摄装置用于对环境进行拍摄,其特征在于,所述方法包括:A control method of a pan/tilt system, wherein the pan/tilt system includes a pan/tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used to photograph the environment, wherein That is, the method includes:
    获取深度传感器对所述环境观测得到的深度数据;acquiring depth data obtained by the depth sensor observing the environment;
    根据所述深度数据确定所述环境中与所述拍摄装置距离最近的目标对象;Determine the target object closest to the photographing device in the environment according to the depth data;
    控制所述云台机构以调整所述拍摄装置的姿态以将所述目标对象调整至所述拍摄装置的拍摄画面中的预设位置。The pan-tilt mechanism is controlled to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
  2. 根据权利要求1所述的方法,其特征在于,所述根据所述深度数据确定所述环境中与所述拍摄装置距离最近的目标对象,包括:The method according to claim 1, wherein the determining, according to the depth data, the target object that is closest to the photographing device in the environment comprises:
    根据所述深度数据确定所述环境中多个对象与所述深度传感器的距离;determining distances of a plurality of objects in the environment from the depth sensor according to the depth data;
    根据所述多个对象与所述深度传感器的距离从所述多个对象中确定所述目标对象。The target object is determined from the plurality of objects according to distances of the plurality of objects from the depth sensor.
  3. 根据权利要求2所述的方法,其特征在于,所述目标对象为所述环境中距离所述拍摄装置最近且其在拍摄画面中的图像区域最靠近所述拍摄画面中预设位置的对象;The method according to claim 2, wherein the target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to a preset position in the photographing picture;
    所述根据所述多个对象与所述深度传感器的距离从所述多个对象中确定所述目标对象,包括:The determining the target object from the plurality of objects according to the distances between the plurality of objects and the depth sensor includes:
    根据所述深度传感器与所述拍摄装置之间的安装位置关系确定所述多个对象在所述拍摄画面中的位置;Determine the positions of the plurality of objects in the shooting picture according to the installation position relationship between the depth sensor and the shooting device;
    根据所述多个对象与所述深度传感器的距离和所述多个对象在所述拍摄画面中的位置从所述多个对象中确定目标对象。A target object is determined from the plurality of objects according to the distances of the plurality of objects to the depth sensor and the positions of the plurality of objects in the photographing picture.
  4. 根据权利要求2所述的方法,其特征在于,所述根据所述多个对象与所述深度传感器的距离从所述多个对象中确定所述目标对象,包括:The method according to claim 2, wherein the determining the target object from the plurality of objects according to the distances between the plurality of objects and the depth sensor comprises:
    将所述多个对象中距离所述深度传感器最小的对象确定为所述目标对象。The object with the smallest distance from the depth sensor among the plurality of objects is determined as the target object.
  5. 根据权利要求2所述的方法,其特征在于,所述根据所述多个对象与所述深度传感器的距离从所述多个对象中确定所述目标对象,包括:The method according to claim 2, wherein the determining the target object from the plurality of objects according to the distances between the plurality of objects and the depth sensor comprises:
    根据所述多个对象与所述深度传感器的距离和所述深度传感器与所述拍摄装置之间的安装位置关系确定所述多个对象与拍摄装置之间的距离;determining the distances between the plurality of objects and the photographing device according to the distances of the plurality of objects and the depth sensor and the installation position relationship between the depth sensor and the photographing device;
    将多个对象中与拍摄装置之间的距离最小的对象确定为所述目标对象。An object with the smallest distance from the photographing device among the plurality of objects is determined as the target object.
  6. 根据权利要求1-5任一项所述的方法,其特征在于,所述控制所述云台机构以调整所述拍摄装置的姿态,包括:The method according to any one of claims 1-5, wherein the controlling the pan/tilt mechanism to adjust the posture of the photographing device comprises:
    根据所述目标对象在所述拍摄画面中的位置和所述预设位置之间的偏差控制所述云台机构以调整所述拍摄装置的姿态。The PTZ mechanism is controlled according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
  7. 根据权利要求1-6任一项所述的方法,其特征在于,所述目标对象为所述环境中距离所述拍摄装置最近且其在拍摄画面中的图像区域最靠近所述拍摄画面中预设位置的对象。The method according to any one of claims 1-6, wherein the target object is the closest to the photographing device in the environment and whose image area in the photographing picture is closest to the preset in the photographing picture The object to set the location to.
  8. 根据权利要求1-7任一项所述的方法,其特征在于,所述预设位置为所述拍摄画面的中心。The method according to any one of claims 1-7, wherein the preset position is the center of the shooting picture.
  9. 根据权利要求1-8任一项所述的方法,其特征在于,所述预设位置是由用户确定的。The method according to any one of claims 1-8, wherein the preset position is determined by a user.
  10. 根据权利要求1-9任一项所述的方法,其特征在于,所述拍摄装置与所述深度传感器固定安装。The method according to any one of claims 1-9, wherein the photographing device is fixedly installed with the depth sensor.
  11. 根据权利要求1-10任一项所述的方法,其特征在于,所述云台机构包括用于安装所述拍摄装置的承载基座;The method according to any one of claims 1-10, wherein the pan/tilt mechanism comprises a bearing base for installing the photographing device;
    所述深度传感器安装在所述承载基座或者所述拍摄装置上。The depth sensor is mounted on the carrying base or the photographing device.
  12. 根据权利要求1-11任一项所述的方法,其特征在于,所述深度传感器包括双目视觉传感器、TOF传感器、毫米波雷达或激光雷达。The method according to any one of claims 1-11, wherein the depth sensor comprises a binocular vision sensor, a TOF sensor, a millimeter-wave radar or a lidar.
  13. 根据权利要求1所述的方法,其特征在于,所述拍摄装置的光轴与所述深度传感器的光轴平行。The method according to claim 1, wherein the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
  14. 根据权利要求1-13任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-13, wherein the method further comprises:
    根据所述深度数据确定所述拍摄装置的拍摄画面中占据的图像区域最大的对象;Determine the object that occupies the largest image area in the shooting picture of the shooting device according to the depth data;
    控制所述云台机构和/或拍摄装置的焦距以使所述位于所述拍摄画面中的预设位置并且使所述对象的整体全部位于所述拍摄画面中。The focal length of the pan/tilt mechanism and/or the photographing device is controlled so that the preset position in the photographing picture and the entirety of the object are located in the photographing picture.
  15. 根据权利要求14所述的方法,其特征在于,所述控制所述云台机构和/或拍摄装置的焦距以使所述位于所述拍摄画面中的预设位置并且使所述对象的整体全部位于所述拍摄画面中,包括:The method according to claim 14, characterized in that the control of the pan/tilt mechanism and/or the focal length of the photographing device is performed so that the object is located at a preset position in the photographing screen and the whole of the object is completely Located in the shooting screen, including:
    若所述对象的整体不在所述拍摄画面中,控制拍摄装置的焦距以使所述对象的整体全部位于所述拍摄画面中;If the whole of the object is not in the shooting picture, controlling the focal length of the shooting device so that the whole of the object is located in the shooting picture;
    控制所述云台机构以调整所述拍摄装置的拍摄姿态以使所述对象位于所述拍摄画面中的预设位置。The pan-tilt mechanism is controlled to adjust the shooting posture of the shooting device so that the object is located at a preset position in the shooting picture.
  16. 根据权利要求1-15任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-15, wherein the method further comprises:
    确定拍摄装置画面中的对焦对象;Determine the focus object in the picture of the photographing device;
    根据所述深度数据确定所述对焦对象的距离;determining the distance of the focused object according to the depth data;
    根据所述距离,控制所述拍摄装置对焦,使得拍摄装置对焦在所述对焦对象上。According to the distance, the photographing device is controlled to focus, so that the photographing device focuses on the focusing object.
  17. 根据权利要求16所述的方法,其特征在于,所述确定拍摄装置画面中的对焦对象,包括:The method according to claim 16, wherein the determining the focus object in the picture of the photographing device comprises:
    识别所述拍摄画面中的对焦对象。Identifying a focus object in the shooting picture.
  18. 根据权利要求1-17任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-17, wherein the method further comprises:
    根据所述深度数据识别用户的特定手势;Identify the user's specific gesture based on the depth data;
    根据识别出来的特定手势,控制所述云台系统或者拍摄装置。The pan-tilt system or the photographing device is controlled according to the recognized specific gesture.
  19. 根据权利要求1-18任一项所述的方法,其特征在于,所述方法还包括:The method according to any one of claims 1-18, wherein the method further comprises:
    根据所述深度数据确定跟踪对象在拍摄装置的拍摄画面中的位置;Determine the position of the tracking object in the shooting picture of the shooting device according to the depth data;
    根据所述位置控制所述云台机构以调整所述拍摄装置的姿态以使跟踪对象位于所述拍摄画面中。The pan-tilt mechanism is controlled according to the position to adjust the posture of the photographing device so that the tracking object is located in the photographing screen.
  20. 一种云台系统的控制设备,其特征在于,所述云台系统包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构,所述拍摄装置用于对环境进行拍摄;所述控制设备还包括:存储器和处理器;A control device of a pan-tilt system, characterized in that the pan-tilt system comprises a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and the photographing device is used to photograph the environment; The control device further includes: a memory and a processor;
    所述存储器,用于存储计算机程序;the memory for storing computer programs;
    所述处理器,用于调用并执行所述计算机程序,以执行以下步骤:The processor is used to call and execute the computer program to perform the following steps:
    获取深度传感器对所述环境观测得到的深度数据;acquiring depth data obtained by the depth sensor observing the environment;
    根据所述深度数据确定所述环境中与所述拍摄装置距离最近的目标对象;Determine the target object closest to the photographing device in the environment according to the depth data;
    控制所述云台机构以调整所述拍摄装置的姿态以将所述目标对象调整至所述拍摄装置的拍摄画面中的预设位置。The pan-tilt mechanism is controlled to adjust the posture of the photographing device to adjust the target object to a preset position in the photographing screen of the photographing device.
  21. 根据权利要求20所述的控制设备,其特征在于,所述处理器,具体用于:The control device according to claim 20, wherein the processor is specifically configured to:
    根据所述深度数据确定所述环境中多个对象与所述深度传感器的距离;determining distances of a plurality of objects in the environment from the depth sensor according to the depth data;
    根据所述多个对象与所述深度传感器的距离从所述多个对象中确定所述目标对象。The target object is determined from the plurality of objects according to distances of the plurality of objects from the depth sensor.
  22. 根据权利要求21所述的控制设备,其特征在于,所述目标对象为所述环境中距离所述拍摄装置最近且其在拍摄画面中的图像区域最靠近所述拍摄画面中预设位置的对象;The control device according to claim 21, wherein the target object is an object in the environment that is closest to the photographing device and whose image area in the photographing picture is closest to a preset position in the photographing picture ;
    所述处理器,具体用于:The processor is specifically used for:
    根据所述深度传感器与所述拍摄装置之间的安装位置关系确定所述多个对象在所述拍摄画面中的位置;Determine the positions of the plurality of objects in the shooting picture according to the installation position relationship between the depth sensor and the shooting device;
    根据所述多个对象深度传感器的距离和所述多个对象在所述拍摄画面中的位置从所述多个对象中确定目标对象。A target object is determined from the plurality of objects according to the distances of the plurality of object depth sensors and the positions of the plurality of objects in the photographing picture.
  23. 根据权利要求21所述的控制设备,其特征在于,所述处理器,具体用于:The control device according to claim 21, wherein the processor is specifically configured to:
    将所述多个对象中距离所述深度传感器最小的对象确定为所述目标对象。The object with the smallest distance from the depth sensor among the plurality of objects is determined as the target object.
  24. 根据权利要求21所述的控制设备,其特征在于,所述处理器,具体用于:The control device according to claim 21, wherein the processor is specifically configured to:
    根据所述多个对象与所述深度传感器的距离和所述深度传感器与所述拍摄装置之间的安装位置关系确定所述多个对象与拍摄装置之间的距离;determining the distances between the plurality of objects and the photographing device according to the distances of the plurality of objects and the depth sensor and the installation position relationship between the depth sensor and the photographing device;
    将多个对象中与拍摄装置之间的距离最小的对象确定为所述目标对象。An object with the smallest distance from the photographing device among the plurality of objects is determined as the target object.
  25. 根据权利要求20-24任一项所述的控制设备,其特征在于,所述处理器,具体用于:The control device according to any one of claims 20-24, wherein the processor is specifically configured to:
    根据所述目标对象在所述拍摄画面中的位置和所述预设位置之间的偏差控制所述云台机构以调整所述拍摄装置的姿态。The PTZ mechanism is controlled according to the deviation between the position of the target object in the shooting picture and the preset position to adjust the posture of the shooting device.
  26. 根据权利要求20-25任一项所述的控制设备,其特征在于,所述目标对象为所述环境中距离所述拍摄装置最近且其在拍摄画面中的图像区域最靠近所述拍摄画面中预设位置的对象。The control device according to any one of claims 20-25, wherein the target object is the closest to the photographing device in the environment and whose image area in the photographing picture is closest to the photographing picture Objects in preset positions.
  27. 根据权利要求20-26任一项所述的控制设备,其特征在于,所述预设位置为所述拍摄画面的中心。The control device according to any one of claims 20-26, wherein the preset position is the center of the shooting picture.
  28. 根据权利要求20-27任一项所述的控制设备,其特征在于,所述预设位置是由用户确定的。The control device according to any one of claims 20-27, wherein the preset position is determined by a user.
  29. 根据权利要求20-28任一项所述的控制设备,其特征在于,所述拍摄装置与所述深度传感器固定安装。The control device according to any one of claims 20-28, wherein the photographing device is fixedly installed with the depth sensor.
  30. 根据权利要求20-29任一项所述的控制设备,其特征在于,所述云台机构包括用于安装所述拍摄装置的承载基座;The control device according to any one of claims 20-29, wherein the pan-tilt mechanism comprises a bearing base for installing the photographing device;
    所述深度传感器安装在所述承载基座或者所述拍摄装置上。The depth sensor is mounted on the carrying base or the photographing device.
  31. 根据权利要求20-30任一项所述的控制设备,其特征在于,所述深度传感器包括双目视觉传感器、TOF传感器、毫米波雷达或激光雷达。The control device according to any one of claims 20-30, wherein the depth sensor comprises a binocular vision sensor, a TOF sensor, a millimeter-wave radar or a lidar.
  32. 根据权利要求20所述的控制设备,其特征在于,所述拍摄装置的光轴与所述深度传感器的光轴平行。The control device according to claim 20, wherein the optical axis of the photographing device is parallel to the optical axis of the depth sensor.
  33. 根据权利要求20-32任一项所述的控制设备,其特征在于,所述处理器,还用于:The control device according to any one of claims 20-32, wherein the processor is further configured to:
    根据所述深度数据确定所述拍摄装置的拍摄画面中占据的图像区域最大的对象;Determine the object that occupies the largest image area in the shooting picture of the shooting device according to the depth data;
    控制所述云台机构和/或拍摄装置的焦距以使所述位于所述拍摄画面中的预设位置并且使所述对象的整体全部位于所述拍摄画面中。The focal length of the pan/tilt mechanism and/or the photographing device is controlled so that the preset position in the photographing picture and the entirety of the object are located in the photographing picture.
  34. 根据权利要求33所述的控制设备,其特征在于,所述处理器,具体用于:The control device according to claim 33, wherein the processor is specifically configured to:
    若所述对象的整体不在所述拍摄画面中,控制拍摄装置的焦距以使所述对象的整体全部位于所述拍摄画面中;If the whole of the object is not in the shooting picture, controlling the focal length of the shooting device so that the whole of the object is located in the shooting picture;
    控制所述云台机构以调整所述拍摄装置的拍摄姿态以使所述对象位于所述拍摄画面中的预设位置。The pan-tilt mechanism is controlled to adjust the shooting posture of the shooting device so that the object is located at a preset position in the shooting picture.
  35. 根据权利要求20-34任一项所述的控制设备,其特征在于,所述处理器,还用于:The control device according to any one of claims 20-34, wherein the processor is further configured to:
    确定拍摄装置画面中的对焦对象;Determine the focus object in the picture of the photographing device;
    根据所述深度数据确定所述对焦对象的距离;determining the distance of the focused object according to the depth data;
    根据所述距离,控制所述拍摄装置对焦,使得拍摄装置对焦在所述对焦对象上。According to the distance, the photographing device is controlled to focus, so that the photographing device focuses on the focusing object.
  36. 根据权利要求35所述的控制设备,其特征在于,所述处理器,具体用于:识别所述拍摄画面中的对焦对象。The control device according to claim 35, wherein the processor is specifically configured to: identify the focus object in the shooting picture.
  37. 根据权利要求20-36任一项所述的控制设备,其特征在于,所述处理器,还用于:The control device according to any one of claims 20-36, wherein the processor is further configured to:
    根据所述深度数据识别用户的特定手势;Identify the user's specific gesture based on the depth data;
    根据识别出来的特定手势,控制所述云台系统或者拍摄装置。According to the identified specific gesture, the pan-tilt system or the photographing device is controlled.
  38. 根据权利要求20-37任一项所述的控制设备,其特征在于,所述处理器,还用于:The control device according to any one of claims 20-37, wherein the processor is further configured to:
    根据所述深度数据确定跟踪对象在拍摄装置的拍摄画面中的位置;Determine the position of the tracking object in the shooting picture of the shooting device according to the depth data;
    根据所述位置控制所述云台机构以调整所述拍摄装置的姿态以使跟踪对象位于所述拍摄画面中。The pan-tilt mechanism is controlled according to the position to adjust the posture of the photographing device so that the tracking object is located in the photographing screen.
  39. 一种云台系统,其特征在于,包括用于承载拍摄装置且用于调整所述拍摄装置的姿态的云台机构以及如权利要求20-38任一项控制设备。A pan-tilt system is characterized by comprising a pan-tilt mechanism for carrying a photographing device and for adjusting the posture of the photographing device, and a control device according to any one of claims 20-38.
  40. 一种计算机可读存储介质,其特征在于,所述存储介质为计算机可读存储介质,该计算机可读存储介质中存储有程序指令,所述程序指令用于实现权利要求1-19中任意一项所述的云台系统的控制方法。A computer-readable storage medium, characterized in that the storage medium is a computer-readable storage medium, and program instructions are stored in the computer-readable storage medium, and the program instructions are used to implement any one of claims 1-19 The control method of the PTZ system described in item.
PCT/CN2020/141711 2020-12-30 2020-12-30 Control method and control device for platform system, platform system, and storage medium WO2022141271A1 (en)

Priority Applications (2)

Application Number Priority Date Filing Date Title
CN202080069083.3A CN114556904A (en) 2020-12-30 2020-12-30 Control method and control device of holder system, holder system and storage medium
PCT/CN2020/141711 WO2022141271A1 (en) 2020-12-30 2020-12-30 Control method and control device for platform system, platform system, and storage medium

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
PCT/CN2020/141711 WO2022141271A1 (en) 2020-12-30 2020-12-30 Control method and control device for platform system, platform system, and storage medium

Publications (1)

Publication Number Publication Date
WO2022141271A1 true WO2022141271A1 (en) 2022-07-07

Family

ID=81668180

Family Applications (1)

Application Number Title Priority Date Filing Date
PCT/CN2020/141711 WO2022141271A1 (en) 2020-12-30 2020-12-30 Control method and control device for platform system, platform system, and storage medium

Country Status (2)

Country Link
CN (1) CN114556904A (en)
WO (1) WO2022141271A1 (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117557167A (en) * 2024-01-03 2024-02-13 微网优联科技(成都)有限公司 Production quality management method and system of cradle head machine

Families Citing this family (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116614614B (en) * 2023-07-19 2023-10-31 中国科学院自动化研究所 Adaptive distance control method, device and storage medium for three-dimensional camera

Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN107749952A (en) * 2017-11-09 2018-03-02 睿魔智能科技(东莞)有限公司 A method and system for intelligent unmanned photography based on deep learning
KR20180024454A (en) * 2016-08-30 2018-03-08 주식회사 코이노시스템 linked phone type selfie apparatus
CN108737718A (en) * 2018-03-21 2018-11-02 北京猎户星空科技有限公司 Image pickup method, device and smart machine
CN110291771A (en) * 2018-07-23 2019-09-27 深圳市大疆创新科技有限公司 A kind of depth information acquisition method and moveable platform of target object
CN111213364A (en) * 2018-12-21 2020-05-29 深圳市大疆创新科技有限公司 Shooting equipment control method, shooting equipment control device and shooting equipment
CN111246110A (en) * 2020-02-06 2020-06-05 浙江大华技术股份有限公司 Image output method and device, storage medium and electronic device
CN111862168A (en) * 2020-06-22 2020-10-30 上海摩象网络科技有限公司 Target object tracking method and device and double-camera system
CN112154650A (en) * 2019-08-13 2020-12-29 深圳市大疆创新科技有限公司 Focusing control method and device for shooting device and unmanned aerial vehicle

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN105957109A (en) * 2016-04-29 2016-09-21 北京博瑞爱飞科技发展有限公司 Target tracking method and device
CN108574822B (en) * 2017-03-08 2021-01-29 华为技术有限公司 Method for realizing target tracking, pan-tilt camera and monitoring platform
CN109765939A (en) * 2018-12-21 2019-05-17 中国科学院自动化研究所南京人工智能芯片创新研究院 Cloud platform control method, device and the storage medium of unmanned plane
CN111932588B (en) * 2020-08-07 2024-01-30 浙江大学 A tracking method for airborne UAV multi-target tracking system based on deep learning

Patent Citations (8)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
KR20180024454A (en) * 2016-08-30 2018-03-08 주식회사 코이노시스템 linked phone type selfie apparatus
CN107749952A (en) * 2017-11-09 2018-03-02 睿魔智能科技(东莞)有限公司 A method and system for intelligent unmanned photography based on deep learning
CN108737718A (en) * 2018-03-21 2018-11-02 北京猎户星空科技有限公司 Image pickup method, device and smart machine
CN110291771A (en) * 2018-07-23 2019-09-27 深圳市大疆创新科技有限公司 A kind of depth information acquisition method and moveable platform of target object
CN111213364A (en) * 2018-12-21 2020-05-29 深圳市大疆创新科技有限公司 Shooting equipment control method, shooting equipment control device and shooting equipment
CN112154650A (en) * 2019-08-13 2020-12-29 深圳市大疆创新科技有限公司 Focusing control method and device for shooting device and unmanned aerial vehicle
CN111246110A (en) * 2020-02-06 2020-06-05 浙江大华技术股份有限公司 Image output method and device, storage medium and electronic device
CN111862168A (en) * 2020-06-22 2020-10-30 上海摩象网络科技有限公司 Target object tracking method and device and double-camera system

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117557167A (en) * 2024-01-03 2024-02-13 微网优联科技(成都)有限公司 Production quality management method and system of cradle head machine
CN117557167B (en) * 2024-01-03 2024-03-19 微网优联科技(成都)有限公司 Production quality management method and system of cradle head machine

Also Published As

Publication number Publication date
CN114556904A (en) 2022-05-27

Similar Documents

Publication Publication Date Title
WO2018214078A1 (en) Photographing control method and device
US9927948B2 (en) Image display apparatus and image display method
EP3188467B1 (en) Method for image capturing using unmanned image capturing device and electronic device supporting the same
WO2020107372A1 (en) Control method and apparatus for photographing device, and device and storage medium
WO2017020150A1 (en) Image processing method, device and camera
WO2022100265A1 (en) Camera calibration method, apparatus, and system
US20210152750A1 (en) Information processing apparatus and method for controlling the same
CN106973221B (en) Unmanned aerial vehicle camera shooting method and system based on aesthetic evaluation
CN111479065A (en) Camera field angle synchronization method and device and image shooting method and device
WO2021127888A1 (en) Control method, smart glasses, mobile platform, gimbal, control system, and computer-readable storage medium
WO2020124517A1 (en) Photographing equipment control method, photographing equipment control device and photographing equipment
WO2019227333A1 (en) Group photograph photographing method and apparatus
WO2021168838A1 (en) Position information determining method, device, and storage medium
WO2020239094A1 (en) Focusing method and apparatus, aerial photography camera, and unmanned aerial vehicle
EP3621292B1 (en) Electronic device for obtaining images by controlling frame rate for external moving object through point of interest, and operating method thereof
JP2011166305A (en) Image processing apparatus and imaging apparatus
CN113647094A (en) Electronic device, method, and computer-readable medium for providing out-of-focus imaging effects in video
CN113805607B (en) Unmanned aerial vehicle shooting method and device, unmanned aerial vehicle and storage medium
US20230164442A1 (en) Method for providing image and electronic device supporting same
WO2022141271A1 (en) Control method and control device for platform system, platform system, and storage medium
CN114641983A (en) System and method for obtaining intelligent panoramic image
US10649460B2 (en) Interactive robots positionable for optimal interactions
CN110139038B (en) An autonomous surround shooting method, device and unmanned aerial vehicle
CN111093022A (en) Image shooting method, device, terminal and computer storage medium
US11949984B2 (en) Electronic device that performs a driving operation of a second camera based on a determination that a tracked object is leaving the field of view of a moveable first camera having a lesser angle of view than the second camera, method for controlling the same, and recording medium of recording program

Legal Events

Date Code Title Description
121 Ep: the epo has been informed by wipo that ep was designated in this application

Ref document number: 20967611

Country of ref document: EP

Kind code of ref document: A1

NENP Non-entry into the national phase

Ref country code: DE

122 Ep: pct application non-entry in european phase

Ref document number: 20967611

Country of ref document: EP

Kind code of ref document: A1