CN107426507A

CN107426507A - Video image splicing apparatus and its joining method

Info

Publication number: CN107426507A
Application number: CN201610349672.2A
Authority: CN
Inventors: 王寒光; 王旭光
Original assignee: Suzhou Institute of Nano Tech and Nano Bionics of CAS
Current assignee: China Science Fusion Perception Intelligence Research Institute Suzhou Industrial Park Co ltd
Priority date: 2016-05-24
Filing date: 2016-05-24
Publication date: 2017-12-01

Abstract

The present invention provides a kind of video image splicing apparatus and its joining method, video image splicing apparatus includes image capture module, for obtaining target video image and video image to be transformed, the target video image is overlapping with the subregion of the video image to be transformed；Image pre-processing module, for carrying out image preconditioning to the video image to be transformed of reception, to obtain preprocessed video image；Image mosaic accelerating module, for the overlapping region of the target video image of reception and the preprocessed video image to be carried out into acceleration splicing, to obtain spliced video image.Video image splicing apparatus proposed by the present invention includes image mosaic accelerating module, can realize that the video of embedded architecture gathers in real time, splicing and display.

Description

Video image splicing device and splicing method thereof

技术领域technical field

本发明涉及视频图像处理领域，尤其涉及一种视频图像拼接装置及其拼接方法。The invention relates to the field of video image processing, in particular to a video image splicing device and a splicing method thereof.

背景技术Background technique

随着计算机技术的迅速发展，数字图像处理技术在各个领域的研究与应用日益深入和广泛。视频是连续的、动态的图像集合，因其包含更多的细节信息，备受学术界和工业界的高度关注。在视频处理领域，为了获得高分辨率的大场景信息，通常采用视频拼接的方式来对具有重叠区域的多路视频进行拼接，但是，在视频拼接技术中会面临很多挑战，其中主要面临实时性处理、含动态物体会产生鬼影等挑战。With the rapid development of computer technology, the research and application of digital image processing technology in various fields has become increasingly in-depth and extensive. Video is a continuous and dynamic collection of images, which has attracted great attention from academia and industry because it contains more detailed information. In the field of video processing, in order to obtain high-resolution large scene information, video stitching is usually used to stitch multiple videos with overlapping areas. However, there are many challenges in video stitching technology, mainly real-time Handling, including dynamic objects will produce ghosting and other challenges.

目前，面对这些挑战主要采用两种解决方案：一种是从算法实现方面，考虑到更多的限制因素；另一种通过硬件加速实现，如采用功能强大的服务器、GPU等。尽管两种方案都能在某些方面改善视频处理的效率或质量，但是，第一种方案由于算法的复杂性，很难满足实时性需求；第二种方案过多地依赖硬件，不适用于嵌入式等资源有限的应用场景。At present, two solutions are mainly adopted to face these challenges: one is to consider more restrictive factors in terms of algorithm implementation; the other is to achieve through hardware acceleration, such as using powerful servers and GPUs. Although both schemes can improve the efficiency or quality of video processing in some aspects, the first scheme is difficult to meet the real-time requirements due to the complexity of the algorithm; the second scheme relies too much on hardware and is not suitable for Embedded and other application scenarios with limited resources.

发明内容Contents of the invention

为了解决上述问题，本发明提出一种视频图像拼接装置及其拼接方法，能够实现嵌入式架构的视频实时采集、拼接处理与显示。In order to solve the above problems, the present invention proposes a video image splicing device and a splicing method thereof, which can realize real-time video acquisition, splicing processing and display of embedded architecture.

本发明提出的具体技术方案为：提供一种视频图像拼接装置，包括：图像采集模块，用于获取目标视频图像和待变换视频图像，所述目标视频图像与所述待变换视频图像的部分区域重叠；图像预处理模块，用于对接收的所述待变换视频图像进行图像变换预处理，以获得预处理视频图像；图像拼接加速模块，用于将接收的所述目标视频图像以及所述预处理视频图像的重叠区域进行加速拼接，以获得拼接后的视频图像。The specific technical solution proposed by the present invention is to provide a video image splicing device, including: an image acquisition module for acquiring a target video image and a video image to be transformed, the target video image and the partial area of the video image to be transformed Overlapping; image preprocessing module, used to perform image conversion preprocessing on the received video image to be converted, to obtain preprocessed video image; image splicing acceleration module, used to receive the target video image and the preprocessed video image Process overlapping areas of video images for accelerated stitching to obtain stitched video images.

进一步地，所述图像拼接加速模块包括：第一图像格式转换单元，用于对所述目标视频图像和所述预处理视频图像分别进行格式转换；滤波单元，用于分别对格式转换后的目标视频图像和预处理视频图像进行滤波，以获得第一滤波图像和第二滤波图像；下采样单元，用于分别对所述第一滤波图像和所述第二滤波图像进行下采样处理，以构造第一高斯金字塔和第二高斯金字塔；差分单元，用于分别对所述第一高斯金字塔的相邻层之间以及所述第二高斯金字塔的相邻层之间作差分处理，以获得第一拉普拉斯金字塔和第二拉普拉斯金字塔；计算单元，用于对所述第一拉普拉斯金字塔的对应于重叠区域的数值与所述第二拉普拉斯金字塔的对应于重叠区域的数值进行逐层加权平均，以获得加权平均后的拉普拉斯金字塔；上采样单元，用于对所述加权平均后的拉普拉斯金字塔进行上采样处理，以获得待格式逆转换的视频图像；第二图像格式转换单元，用于对所述待格式逆转换的视频图像进行格式逆转换，以获得拼接后的视频图像。Further, the image splicing acceleration module includes: a first image format conversion unit, configured to perform format conversion on the target video image and the pre-processed video image respectively; The video image and the pre-processed video image are filtered to obtain a first filtered image and a second filtered image; a downsampling unit is configured to perform downsampling processing on the first filtered image and the second filtered image respectively to construct The first Gaussian pyramid and the second Gaussian pyramid; a difference unit, which is used to perform differential processing between adjacent layers of the first Gaussian pyramid and between adjacent layers of the second Gaussian pyramid, so as to obtain the first Gaussian pyramid Placian Pyramid and the second Laplacian Pyramid; a calculation unit for calculating the value corresponding to the overlapping area of the first Laplacian Pyramid and the corresponding overlapping area of the second Laplacian Pyramid The numerical value is weighted and averaged layer by layer to obtain the Laplacian pyramid after the weighted average; the upsampling unit is used to perform upsampling processing on the Laplacian pyramid after the weighted average to obtain the Laplacian pyramid after the format inverse conversion Video image; a second image format converting unit, configured to perform format inverse conversion on the video image to be format inversely converted to obtain a spliced video image.

进一步地，所述图像预处理模块包括：特征提取单元，用于提取所述目标视频图像及所述待变换视频图像的特征点；特征匹配单元，用于根据汉明距离对所述目标视频图像的特征点和所述待变换视频图像的特征点进行匹配并根据匹配关系计算得到单应性矩阵；图像变换单元，用于利用所述单应性矩阵对待变换视频图像进行图像变换预处理，以获得预处理视频图像。Further, the image preprocessing module includes: a feature extraction unit, used to extract the feature points of the target video image and the video image to be transformed; a feature matching unit, used to compare the target video image according to the Hamming distance Match the feature points of the video image to be transformed with the feature points of the video image to be transformed and calculate a homography matrix according to the matching relationship; the image transformation unit is used to use the homography matrix to perform image transformation preprocessing on the video image to be transformed, to Obtain preprocessed video images.

进一步地，还包括连接于所述图像预处理模块和所述图像拼接加速模块之间的存储单元(DDR)，所述DDR用于存储目标视频图像和预处理视频图像。Further, it also includes a storage unit (DDR) connected between the image preprocessing module and the image splicing acceleration module, and the DDR is used for storing target video images and preprocessed video images.

进一步地，还包括与所述图像拼接加速模块连接的显示模块，所述显示模块用于对拼接后的视频图像进行显示。Further, it also includes a display module connected to the image splicing acceleration module, and the display module is used for displaying the spliced video images.

本发明还提供了一种视频图像的拼接方法，包括：The present invention also provides a splicing method of video images, comprising:

获取目标视频图像及待变换视频图像；其中，目标视频图像和待变换视频图像的部分区域重叠；Acquiring the target video image and the video image to be transformed; wherein, the target video image and the partial area of the video image to be transformed overlap;

对待变换视频图像进行图像变换预处理，以获得预处理视频图像；Perform image transformation preprocessing on the video image to be transformed to obtain a preprocessed video image;

对目标视频图像和预处理视频图像的重叠区域进行拼接，以获得拼接后的视频图像。The overlapping regions of the target video image and the preprocessed video image are spliced to obtain a spliced video image.

进一步地，对目标视频图像和预处理视频图像的重叠区域进行拼接的方法包括：Further, the method for splicing the overlapping regions of the target video image and the preprocessed video image includes:

对目标视频图像和预处理视频图像分别进行格式转换；Perform format conversion on the target video image and the preprocessed video image respectively;

利用高斯核函数分别对格式转换后的目标视频图像和预处理视频图像进行滤波，以获得第一滤波图像和第二滤波图像；Filtering the format-converted target video image and the preprocessed video image respectively by using a Gaussian kernel function to obtain a first filtered image and a second filtered image;

分别对第一滤波图像和第二滤波图像进行下采样处理，以构造第一高斯金字塔和第二高斯金字塔；respectively performing downsampling processing on the first filtered image and the second filtered image to construct a first Gaussian pyramid and a second Gaussian pyramid;

分别对第一高斯金字塔的相邻层之间以及第二高斯金字塔的相邻层之间作差分处理，以获得第一拉普拉斯金字塔和第二拉普拉斯金字塔；Perform difference processing between adjacent layers of the first Gaussian pyramid and between adjacent layers of the second Gaussian pyramid to obtain the first Laplacian pyramid and the second Laplacian pyramid;

对第一拉普拉斯金字塔的对应于重叠区域的数值与第二拉普拉斯金字塔的对应于重叠区域的数值进行逐层加权平均，以获得加权平均后的拉普拉斯金字塔；Carry out layer-by-layer weighted average to the numerical value corresponding to the overlapping area of the first Laplacian pyramid and the numerical value corresponding to the overlapping area of the second Laplacian pyramid, to obtain the Laplacian pyramid after the weighted average;

对加权平均后的拉普拉斯金字塔进行上采样处理，以获得待格式逆转换的视频图像；Perform upsampling processing on the weighted average Laplacian pyramid to obtain the video image to be inversely converted;

对待格式逆转换的视频图像进行格式逆转换，以获得拼接后的视频图像。The format of the video image to be reverse-converted is performed to obtain the spliced video image.

进一步地，对待变换视频图像进行图像变换预处理，以获得预处理视频图像的方法包括：Further, performing image transformation preprocessing on the video image to be transformed to obtain a preprocessed video image includes:

提取目标视频图像及待变换视频图像的特征点；Extract the feature points of the target video image and the video image to be transformed;

根据汉明距离对目标视频图像的特征点和待变换视频图像的特征点进行匹配；Matching the feature points of the target video image and the feature points of the video image to be transformed according to the Hamming distance;

根据匹配关系计算得到单应性矩阵；Calculate the homography matrix according to the matching relationship;

利用单应性矩阵对待变换视频图像进行图像变换预处理，以获得预处理视频图像。Perform image transformation preprocessing on the video image to be transformed by using the homography matrix, so as to obtain the preprocessed video image.

进一步地，在对待变换视频图像进行图像变换预处理，以获得预处理视频图像之后，将目标视频图像和预处理视频图像存储于DDR中；Further, after performing image conversion preprocessing on the video image to be converted to obtain the preprocessed video image, the target video image and the preprocessed video image are stored in the DDR;

在对目标视频图像和预处理视频图像的重叠区域进行拼接之前，从DDR中读取目标视频图像和预处理视频图像。The target video image and the pre-processed video image are read from the DDR before splicing the overlapping area of the target video image and the pre-processed video image.

进一步地，获得拼接后的视频图像之后，显示拼接后的视频图像。Further, after the spliced video image is obtained, the spliced video image is displayed.

本发明提出的视频图像拼接装置及其拼接方法，所述视频图像拼接装置包括图像拼接加速模块，图像拼接加速模块为用户自定义IP核，图像拼接加速模块可以根据用户自定义的图像拼接算法对视频图像进行拼接，而且VDMA单元、图像拼接加速模块以及输出单元通过FPGA实现，通过软件和硬件协同设计，从而实现嵌入式架构的视频实时采集、拼接处理与显示。The video image splicing device and the splicing method thereof proposed by the present invention, the video image splicing device includes an image splicing acceleration module, the image splicing acceleration module is a user-defined IP core, and the image splicing acceleration module can be used according to the user-defined image splicing algorithm. Video images are spliced, and the VDMA unit, image splicing acceleration module and output unit are realized through FPGA, and through software and hardware collaborative design, real-time video acquisition, splicing processing and display of embedded architecture are realized.

附图说明Description of drawings

通过结合附图进行的以下描述，本发明的实施例的上述和其它方面、特点和优点将变得更加清楚，附图中：The above and other aspects, features and advantages of embodiments of the present invention will become more apparent through the following description in conjunction with the accompanying drawings, in which:

图1为视频图像拼接装置电路结构示意图；Fig. 1 is a schematic diagram of the circuit structure of a video image splicing device;

图2为图像预处理模块电路结构示意图；Fig. 2 is a schematic diagram of the circuit structure of the image preprocessing module;

图3为可编程系统PL电路结构示意图；Fig. 3 is a schematic structural diagram of the programmable system PL circuit;

图4为图像拼接加速模块电路结构示意图；Fig. 4 is a schematic diagram of the circuit structure of the image splicing acceleration module;

图5为视频图像的拼接方法的流程示意图；Fig. 5 is a schematic flow chart of a splicing method for video images;

图6为步骤S2的流程示意图；Fig. 6 is a schematic flow chart of step S2;

图7为步骤S4的流程示意图。FIG. 7 is a schematic flow chart of step S4.

具体实施方式detailed description

以下，将参照附图来详细描述本发明的实施例。然而，可以以许多不同的形式来实施本发明，并且本发明不应该被解释为限制于这里阐述的具体实施例。相反，提供这些实施例是为了解释本发明的原理及其实际应用，从而使本领域的其他技术人员能够理解本发明的各种实施例和适合于特定预期应用的各种修改。Hereinafter, embodiments of the present invention will be described in detail with reference to the accompanying drawings. This invention may, however, be embodied in many different forms and should not be construed as limited to the specific embodiments set forth herein. Rather, the embodiments are provided to explain the principles of the invention and its practical application, thereby enabling others skilled in the art to understand the invention for various embodiments and with various modifications as are suited to particular intended uses.

本发明提供的视频图像拼接装置是基于Xilinx ZYNQ平台的嵌入式视频图像拼接装置，其包括处理系统(Processing System，PS)以及可编程系统(Programmable Logic，PL)，这里PS即ARM处理器。本发明中PS以及PL集成在一块芯片上。The video image splicing device provided by the present invention is an embedded video image splicing device based on Xilinx ZYNQ platform, which includes a processing system (Processing System, PS) and a programmable system (Programmable Logic, PL), where PS is an ARM processor. In the present invention, PS and PL are integrated on one chip.

参照图1，本实施例提供的视频图像拼接装置包括图像采集模块10、PS、PL以及连接于所述PS与PL之间的存储单元(DDR)30，优选的，DDR 30为双倍速率同步动态随机存储器。其中PS包括图像预处理模块20，PL包括图像拼接加速模块40以及显示模块50。With reference to Fig. 1, the video image splicing device that the present embodiment provides comprises image acquisition module 10, PS, PL and is connected to the storage unit (DDR) 30 between described PS and PL, and preferably, DDR 30 is double rate synchronous DRAM. The PS includes an image preprocessing module 20 , and the PL includes an image splicing acceleration module 40 and a display module 50 .

图像采集模块10用于获取目标视频图像和待变换视频图像，所述目标视频图像与所述待变换视频图像的部分区域重叠，例如，图像采集模块10为摄像头图像采集模块10与图像预处理模块20连接，例如，图像采集模块10通过USB接口与图像预处理模块20进行连接，图像预处理模块20用于对接收的所述待变换视频图像进行图像变换预处理，以获得预处理视频图像。DDR 30连接于图像预处理模块20与图像拼接加速模块40之间，用于存储目标视频图像和预处理视频图像。图像拼接加速模块40与DDR 30连接后再与所述显示模块50连接，图像拼接加速模块40用于将标视频图像以及预处理视频图像的重叠区域进行加速拼接，以获得拼接后的视频图像，其中，图像拼接加速模块40为用户自定义IP核。所述显示模块50用于将拼接后的视频图像进行显示。为了得到拼接质量较好的视频图像，重叠区域的面积为目标视频图像或待变换视频图像两者中面积较小的20％～80％；同时考虑到计算量问题，优选的，重叠区域的面积为目标视频图像或待变换视频图像两者中面积较小的40％～60％。参照图2，图像预处理模块20包括依次连接的特征提取单元21、特征匹配单元22以及图像变换单元23。特征提取单元21与图像采集模块10连接，特征提取单元21用于接收目标视频图像和待变换视频图像并分别提取目标视频图像和待变换视频图像的特征点，特征匹配单元22用于根据汉明距离对所述目标视频图像的特征点和所述待变换视频图像的特征点进行匹配并根据匹配关系计算得到单应性矩阵，图像变换单元23用于利用所述单应性矩阵对待变换视频图像进行图像变换预处理，以获得预处理视频图像。图像变换单元23还与DDR 30连接，用于将预处理视频图像发送给DDR 30进行存储。The image acquisition module 10 is used to obtain the target video image and the video image to be converted, the target video image overlaps with the partial area of the video image to be transformed, for example, the image acquisition module 10 is a camera image acquisition module 10 and an image preprocessing module 20 connection, for example, the image acquisition module 10 is connected to the image preprocessing module 20 through a USB interface, and the image preprocessing module 20 is used to perform image conversion preprocessing on the received video image to be converted to obtain a preprocessed video image. The DDR 30 is connected between the image preprocessing module 20 and the image splicing acceleration module 40, and is used for storing target video images and preprocessed video images. The image splicing acceleration module 40 is connected with the DDR 30 and then connected with the display module 50. The image splicing acceleration module 40 is used to splice the overlapping areas of the standard video image and the pre-processed video image to obtain the spliced video image. Wherein, the image splicing acceleration module 40 is a user-defined IP core. The display module 50 is used for displaying the spliced video images. In order to obtain a video image with good splicing quality, the area of the overlapping region is 20% to 80% of the smaller area of the target video image or the video image to be transformed; simultaneously, considering the amount of calculation, preferably, the area of the overlapping region It is 40% to 60% of the smaller area of the target video image or the video image to be transformed. Referring to FIG. 2 , the image preprocessing module 20 includes a feature extraction unit 21 , a feature matching unit 22 and an image transformation unit 23 connected in sequence. The feature extraction unit 21 is connected with the image acquisition module 10, the feature extraction unit 21 is used to receive the target video image and the video image to be transformed and extracts the feature points of the target video image and the video image to be transformed respectively, and the feature matching unit 22 is used for according to Hamming The distance matches the feature points of the target video image and the feature points of the video image to be transformed and calculates a homography matrix according to the matching relationship, and the image transformation unit 23 is used to utilize the homography matrix to transform the video image Perform image transformation preprocessing to obtain preprocessed video images. The image conversion unit 23 is also connected to the DDR 30, and is used to send the pre-processed video image to the DDR 30 for storage.

参照图3，PL系统还包括VDMA单元41以及输出单元42。VDMA单元41连接于DDR 30与图像拼接加速模块40之间，VDMA单元41通过HP接口与DDR 30之间进行数据通信，VDMA单元41用于从DDR 30中提取目标视频图像和预处理视频图像并通过AXI4-Stream接口与图像拼接加速模块40之间实现数据传输。输出单元42连接于图像拼接加速模块40与显示模块50之间，输出单元42用于对拼接后的视频图像进行格式转换，以使得拼接后的视频图像的格式与显示模块50的显示格式相匹配。例如，显示模块50为HDMI显示器，输出单元42从图像拼接加速模块40接收的拼接后的视频图像的色彩模式为RGB，输出单元42将拼接后的视频图像的色彩模式转换为YCbCr，再经过显示模块50进行显示。Referring to FIG. 3 , the PL system further includes a VDMA unit 41 and an output unit 42 . VDMA unit 41 is connected between DDR 30 and image splicing acceleration module 40, and VDMA unit 41 carries out data communication between DDR 30 by HP interface, and VDMA unit 41 is used for extracting target video image and preprocessing video image from DDR 30 and Data transmission is realized between the AXI4-Stream interface and the image splicing acceleration module 40 . The output unit 42 is connected between the image splicing acceleration module 40 and the display module 50, and the output unit 42 is used to convert the format of the spliced video image, so that the format of the spliced video image matches the display format of the display module 50 . For example, the display module 50 is an HDMI display, and the color mode of the spliced video image received by the output unit 42 from the image splicing acceleration module 40 is RGB, and the output unit 42 converts the color mode of the spliced video image into YCbCr, and then displays Module 50 is displayed.

在本实施例中，VDMA单元41、图像拼接加速模块40以及输出单元42通过FPGA实现，当然，也可以通过其他方式实现，这里仅仅是作为示例示出并不用于对本发明进行限定。In this embodiment, the VDMA unit 41, the image splicing acceleration module 40 and the output unit 42 are realized by FPGA, of course, they can also be realized by other ways, which are shown here as an example and not intended to limit the present invention.

参照图4，图像拼接加速模块40包括依次连接的第一图像格式转换单元100、滤波单元101、下采样单元102、差分单元103、计算单元104、上采样单元105以及第二图像格式转换单元106。第一图像格式转换单元100还与VDMA单元41连接，第二图像格式转换单元106与上采样单元105连接后再与输出单元42连接。Referring to Fig. 4, image mosaic acceleration module 40 comprises the first image format conversion unit 100, filter unit 101, down-sampling unit 102, difference unit 103, calculation unit 104, up-sampling unit 105 and the second image format conversion unit 106 connected in sequence . The first image format conversion unit 100 is also connected to the VDMA unit 41 , and the second image format conversion unit 106 is connected to the up-sampling unit 105 and then connected to the output unit 42 .

第一图像格式转换单元100用于分别将AXIvideo格式的目标视频图像以及预处理后的视频图像转换为Mat格式，滤波单元101用于分别对Mat格式的目标视频图像和预处理视频图像进行滤波，以获得第一滤波图像和第二滤波图像，下采样单元102用于分别对第一滤波图像和第二滤波图像进行下采样，以构造第一高斯金字塔以及第二高斯金字塔。差分单元103用于分别对第一高斯金字塔的相邻层之间以及第二高斯金字塔相邻层之间进行差分处理，以获得第一拉普拉斯金字塔以及第二拉普拉斯金字塔，计算单元104用于对所述第一拉普拉斯金字塔的对应于重叠区域的数值与所述第二拉普拉斯金字塔的对应于重叠区域的数值进行逐层加权平均，以获得加权平均后的拉普拉斯金字塔，上采样单元105用于对接收的所述加权平均后的拉普拉斯金字塔进行上采样处理，以获得待格式逆转换的视频图像，第二图像格式转换单元106用于对所述待格式逆转换的视频图像进行格式逆转换，以获得拼接后的视频图像，即将Mat格式的拼接后的视频图像转换为AXIvideo格式的视频图像。The first image format conversion unit 100 is used to convert the target video image in AXIvideo format and the preprocessed video image into Mat format respectively, and the filtering unit 101 is used to filter the target video image in Mat format and the preprocessed video image respectively, To obtain the first filtered image and the second filtered image, the downsampling unit 102 is configured to respectively downsample the first filtered image and the second filtered image to construct a first Gaussian pyramid and a second Gaussian pyramid. The difference unit 103 is used to perform differential processing between adjacent layers of the first Gaussian pyramid and between adjacent layers of the second Gaussian pyramid, so as to obtain the first Laplacian pyramid and the second Laplacian pyramid, and calculate Unit 104 is used to carry out layer-by-layer weighted average of the value corresponding to the overlapping region of the first Laplacian pyramid and the value corresponding to the overlapping region of the second Laplacian pyramid, so as to obtain the weighted average For the Laplacian pyramid, the upsampling unit 105 is used to perform upsampling processing on the received Laplacian pyramid after the weighted average, so as to obtain a video image to be inversely converted in format, and the second image format conversion unit 106 is used for Carrying out format inverse conversion on the video image to be inversely formatted to obtain a spliced video image, that is, converting the spliced video image in Mat format into a video image in AXIvideo format.

参照图5，本实施例还提供了所述视频图像拼接装置的拼接方法，所述方法包括：Referring to FIG. 5 , this embodiment also provides a splicing method of the video image splicing device, the method comprising:

步骤S1、获取目标视频图像及待变换视频图像；其中，目标视频图像和待变换视频图像的部分区域重叠。为了得到拼接质量较好的视频图像，重叠区域的面积为目标视频图像或待变换视频图像两者中面积较小的20％～80％；同时考虑到计算量问题，优选的，重叠区域的面积为目标视频图像或待变换视频图像两者中面积较小的40％～60％。Step S1. Obtain the target video image and the video image to be converted; wherein, the target video image and the video image to be converted partially overlap. In order to obtain a video image with good splicing quality, the area of the overlapping region is 20% to 80% of the smaller area of the target video image or the video image to be transformed; simultaneously, considering the amount of calculation, preferably, the area of the overlapping region It is 40% to 60% of the smaller area of the target video image or the video image to be transformed.

步骤S2、对待变换视频图像进行图像变换预处理，以获得预处理视频图像。Step S2, performing image transformation preprocessing on the video image to be transformed to obtain a preprocessed video image.

参照图6，具体的，步骤S2包括：Referring to FIG. 6, specifically, step S2 includes:

S21、提取目标视频图像及待变换视频图像的特征点，其中，特征点提取算法采用ORB(ORiented Brief)算法。S21. Extract feature points of the target video image and the video image to be transformed, wherein the feature point extraction algorithm adopts ORB (ORiented Brief) algorithm.

S22、根据汉明距离对目标视频图像的特征点和待变换视频图像的特征点进行匹配。S22. Match the feature points of the target video image and the feature points of the video image to be transformed according to the Hamming distance.

S23、根据匹配关系计算得到单应性矩阵。S23. Calculate and obtain a homography matrix according to the matching relationship.

S24、利用单应性矩阵对待变换视频图像进行图像变换预处理，以获得预处理视频图像。S24. Perform image transformation preprocessing on the video image to be transformed by using the homography matrix to obtain a preprocessed video image.

再次参照图5，步骤S3、将目标视频图像以及预处理视频图像存储到DDR 30中。Referring again to FIG. 5 , step S3 , storing the target video image and the preprocessed video image in the DDR 30 .

步骤S4、对目标视频图像和预处理视频图像的重叠区域进行拼接，以获得拼接后的视频图像。Step S4, splicing the overlapping area of the target video image and the pre-processed video image to obtain a spliced video image.

参照图7，具体的，步骤S4包括：Referring to FIG. 7, specifically, step S4 includes:

S41、从DDR 30中读取目标视频图像和预处理视频图像，对目标视频图像和预处理视频图像分别进行格式转换，其中，格式转换例如将AXIvideo格式的目标视频图像以及预处理视频图像转换为Mat格式的目标视频图像以及预处理视频图像。S42、利用高斯核函数分别对格式转换后的目标视频图像和预处理视频图像进行滤波，以获得第一滤波图像和第二滤波图像。S41. Read the target video image and the pre-processed video image from the DDR 30, and perform format conversion on the target video image and the pre-processed video image respectively, wherein the format conversion is for example converting the target video image in AXIvideo format and the pre-processed video image to The target video image in Mat format and the preprocessed video image. S42. Filter the format-converted target video image and the preprocessed video image respectively by using a Gaussian kernel function to obtain a first filtered image and a second filtered image.

S43、分别对第一滤波图像和第二滤波图像进行下采样处理，以构造第一高斯金字塔和第二高斯金字塔。S43. Perform downsampling processing on the first filtered image and the second filtered image respectively, so as to construct a first Gaussian pyramid and a second Gaussian pyramid.

S44、分别对第一高斯金字塔的相邻层之间以及第二高斯金字塔的相邻层之间作差分处理，以获得第一拉普拉斯金字塔和第二拉普拉斯金字塔。S44. Perform differential processing on adjacent layers of the first Gaussian pyramid and adjacent layers of the second Gaussian pyramid, to obtain the first Laplacian pyramid and the second Laplacian pyramid.

S45、对第一拉普拉斯金字塔的对应于重叠区域的数值与第二拉普拉斯金字塔的对应于重叠区域的数值进行逐层加权平均，以获得加权平均后的拉普拉斯金字塔。S45. Perform layer-by-layer weighted averaging of the value corresponding to the overlapping area of the first Laplacian pyramid and the value corresponding to the overlapping area of the second Laplacian pyramid to obtain a weighted-averaged Laplacian pyramid.

S46、对加权平均后的拉普拉斯金字塔进行上采样处理，以获得待格式逆转换的视频图像，其中，待格式逆转换的视频图像为Mat格式。S46. Perform up-sampling processing on the weighted-averaged Laplacian pyramid to obtain a video image to be inversely converted, wherein the video image to be inversely converted is in Mat format.

S47、对待格式逆转换的视频图像进行格式逆转换，以获得拼接后的视频图像，其中，拼接后的视频图像为AXIvideo格式。S47. Perform format inverse conversion on the video image to be format inverse converted to obtain a spliced video image, wherein the spliced video image is in AXIvideo format.

再次参照图5，步骤S5、对拼接后的视频图像进行显示。Referring to FIG. 5 again, step S5, displaying the spliced video images.

以上所述仅是本申请的具体实施方式，应当指出，对于本技术领域的普通技术人员来说，在不脱离本申请原理的前提下，还可以做出若干改进和润饰，这些改进和润饰也应视为本申请的保护范围。The above description is only the specific implementation of the present application. It should be pointed out that for those of ordinary skill in the art, without departing from the principle of the present application, some improvements and modifications can also be made. It should be regarded as the protection scope of this application.

Claims

1. A video image stitching device, characterized in that, comprising:

An image acquisition module, configured to acquire a target video image and a video image to be converted, where the target video image overlaps with a part of the video image to be converted;

An image preprocessing module, configured to perform image transformation preprocessing on the received video image to be transformed, so as to obtain a preprocessed video image;

An image splicing acceleration module, configured to splice the overlapping areas of the received target video image and the preprocessed video image to obtain a spliced video image.

2. video image splicing device according to claim 1, is characterized in that, described image splicing acceleration module comprises:

A first image format conversion unit, configured to perform format conversion on the target video image and the pre-processed video image respectively;

a filtering unit, configured to respectively filter the format-converted target video image and the preprocessed video image to obtain a first filtered image and a second filtered image;

a downsampling unit, configured to downsample the first filtered image and the second filtered image, respectively, to construct a first Gaussian pyramid and a second Gaussian pyramid;

A differential unit, configured to perform differential processing between adjacent layers of the first Gaussian pyramid and between adjacent layers of the second Gaussian pyramid, so as to obtain the first Laplacian pyramid and the second Laplacian pyramid Pyramid of Sri Lanka;

A calculation unit, configured to carry out layer-by-layer weighted averaging of the value corresponding to the overlapping region of the first Laplacian pyramid and the value corresponding to the overlapping region of the second Laplacian pyramid, to obtain a weighted average the Laplace Pyramid;

An upsampling unit, configured to perform upsampling processing on the weighted and averaged Laplacian pyramid, so as to obtain a video image to be format reverse-converted;

The second image format conversion unit is configured to perform inverse format conversion on the video image to be inversely converted to obtain a spliced video image.

3. video image splicing device according to claim 1, is characterized in that, described image preprocessing module comprises:

A feature extraction unit for extracting feature points of the target video image and the video image to be transformed;

A feature matching unit, configured to match the feature points of the target video image and the feature points of the video image to be transformed according to the Hamming distance and calculate a homography matrix according to the matching relationship;

An image transformation unit, configured to use the homography matrix to perform image transformation preprocessing on the video image to be transformed, so as to obtain a preprocessed video image.

4. The video image splicing device according to claim 1, further comprising a storage unit connected between the image preprocessing module and the image splicing acceleration module, the storage unit is used to store the target video images and preprocess video images.

5. The video image splicing device according to claim 1, further comprising a display module connected to the image splicing acceleration module, the display module being used to display the spliced video images.

6. A splicing method of video images, characterized in that, comprising:

Acquiring the target video image and the video image to be transformed; wherein, the target video image and the partial area of the video image to be transformed overlap;

Perform image transformation preprocessing on the video image to be transformed to obtain a preprocessed video image;

The overlapping regions of the target video image and the preprocessed video image are spliced to obtain a spliced video image.

7. the splicing method of video image according to claim 6, is characterized in that, the method for splicing the overlapping area of target video image and preprocessing video image comprises:

Perform format conversion on the target video image and the preprocessed video image respectively;

Filtering the format-converted target video image and the preprocessed video image respectively by using a Gaussian kernel function to obtain a first filtered image and a second filtered image;

respectively performing downsampling processing on the first filtered image and the second filtered image to construct a first Gaussian pyramid and a second Gaussian pyramid;

Respectively performing differential processing between adjacent layers of the first Gaussian pyramid and between adjacent layers of the second Gaussian pyramid to obtain the first Laplacian pyramid and the second Laplacian pyramid;

Carry out layer-by-layer weighted average to the numerical value corresponding to the overlapping area of the first Laplacian pyramid and the numerical value corresponding to the overlapping area of the second Laplacian pyramid, to obtain the Laplacian pyramid after the weighted average;

Perform upsampling processing on the weighted average Laplacian pyramid to obtain the video image to be inversely converted;

The format of the video image to be reverse-converted is performed to obtain the spliced video image.

8. The splicing method of video images according to claim 6, wherein the video image to be transformed is subjected to image transformation preprocessing, so that the method for obtaining the preprocessing video image comprises:

Extract the feature points of the target video image and the video image to be transformed;

Matching the feature points of the target video image and the feature points of the video image to be transformed according to the Hamming distance;

Calculate the homography matrix according to the matching relationship;

Perform image transformation preprocessing on the video image to be transformed by using the homography matrix, so as to obtain the preprocessed video image.

9. The splicing method of video images according to claim 6, characterized in that, after the video images to be transformed are subjected to image conversion pre-processing to obtain the pre-processing video images, the target video images and the pre-processing video images are stored in the storage in the unit;

Before splicing the overlapping area of the target video image and the pre-processing video image, the target video image and the pre-processing video image are read from the storage unit.

10. The method for splicing video images according to claim 6, characterized in that after the spliced video images are obtained, the spliced video images are displayed.