Nothing Special   »   [go: up one dir, main page]

CN115984124A - A neuromorphic pulse signal denoising and super-resolution method and device - Google Patents

A neuromorphic pulse signal denoising and super-resolution method and device Download PDF

Info

Publication number
CN115984124A
CN115984124A CN202211543963.7A CN202211543963A CN115984124A CN 115984124 A CN115984124 A CN 115984124A CN 202211543963 A CN202211543963 A CN 202211543963A CN 115984124 A CN115984124 A CN 115984124A
Authority
CN
China
Prior art keywords
pulse
resolution
denoising
super
data
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202211543963.7A
Other languages
Chinese (zh)
Inventor
施柏鑫
段沛奇
马逸
周鑫渝
施新宇
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Original Assignee
Peking University
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Peking University filed Critical Peking University
Priority to CN202211543963.7A priority Critical patent/CN115984124A/en
Publication of CN115984124A publication Critical patent/CN115984124A/en
Pending legal-status Critical Current

Links

Images

Classifications

    • YGENERAL TAGGING OF NEW TECHNOLOGICAL DEVELOPMENTS; GENERAL TAGGING OF CROSS-SECTIONAL TECHNOLOGIES SPANNING OVER SEVERAL SECTIONS OF THE IPC; TECHNICAL SUBJECTS COVERED BY FORMER USPC CROSS-REFERENCE ART COLLECTIONS [XRACs] AND DIGESTS
    • Y02TECHNOLOGIES OR APPLICATIONS FOR MITIGATION OR ADAPTATION AGAINST CLIMATE CHANGE
    • Y02DCLIMATE CHANGE MITIGATION TECHNOLOGIES IN INFORMATION AND COMMUNICATION TECHNOLOGIES [ICT], I.E. INFORMATION AND COMMUNICATION TECHNOLOGIES AIMING AT THE REDUCTION OF THEIR OWN ENERGY USE
    • Y02D10/00Energy efficient computing, e.g. low power processors, power management or thermal management

Landscapes

  • Closed-Circuit Television Systems (AREA)

Abstract

本发明公开了一种神经形态脉冲信号去噪和超分辨方法及装置,通过在显示屏中设置不同分辨率的相同视频,并用脉冲相机来拍摄显示屏,从而得到不同分辨率的真实脉冲数据对,用实拍数据集作为训练集,避免了由于仿真数据与真实数据的差距太大而导致训练后的网络对真实数据不兼容的问题,解决了脉冲信号仿真器无法准确生成事件数据的问题。同时使用深度学习的方法,利用3D‑UNet网络模型来学习脉冲信号去噪和超分辨率重建的端到端的映射模型,在输入只有脉冲序列的情况下,就可以有效实现对事件的去噪和超分辨任务,避免了现有方法依赖视频帧和IMU信息,省去了求解光流信息的过程,节省了大量的运行时间,极大的提升了处理速度。

Figure 202211543963

The invention discloses a neuromorphic pulse signal denoising and super-resolution method and device. By setting the same video with different resolutions in the display screen and using a pulse camera to shoot the display screen, real pulse data pairs with different resolutions can be obtained. , using the real shot data set as the training set avoids the problem that the trained network is not compatible with the real data due to the large gap between the simulated data and the real data, and solves the problem that the pulse signal simulator cannot accurately generate event data. At the same time, using the deep learning method, the 3D-UNet network model is used to learn the end-to-end mapping model of pulse signal denoising and super-resolution reconstruction. When the input is only the pulse sequence, it can effectively realize the denoising and super-resolution of the event. The super-resolution task avoids the dependence of existing methods on video frames and IMU information, saves the process of solving optical flow information, saves a lot of running time, and greatly improves the processing speed.

Figure 202211543963

Description

一种神经形态脉冲信号去噪和超分辨方法及装置A neuromorphic pulse signal denoising and super-resolution method and device

技术领域technical field

本发明涉及计算机视觉技术领域,尤其涉及一种神经形态脉冲信号去噪和超分辨方法及装置。The invention relates to the technical field of computer vision, in particular to a neuromorphic pulse signal denoising and super-resolution method and device.

背景技术Background technique

随着计算机技术发展,计算机算力逐渐加强,机器学习、深度学习技术快速进步,计算机视觉相关技术逐渐应用到各个场景,例如手机相机的人脸检测、修图美图、夜间拍照等功能,无人驾驶中的行人检测、道路识别,移动支付与车站身份检测的人脸识别,或是机器人的同步定位与建图任务等。随着大数据、智能化时代的来临,越来越多的应用场景需要计算视觉技术的支持,海量的视频、图像数据亟待处理,更凸显底层视觉任务的重要意义。由此,底层图像处理技术的不可替代性及其对于更高语义层次任务的重要意义,受到社会广泛关注。低噪声、低模糊、高空间分辨率、高时间分辨率、高动态范围等特性的成像,作为计算摄像学的基础任务,其发展对于其他计算机视觉技术来说极其重要。With the development of computer technology, computer computing power is gradually strengthened, machine learning and deep learning technology are advancing rapidly, and computer vision-related technologies are gradually applied to various scenarios, such as face detection of mobile phone cameras, retouching and beautifying pictures, and taking pictures at night. Pedestrian detection and road recognition in human driving, face recognition for mobile payment and station identity detection, or simultaneous positioning and mapping tasks for robots, etc. With the advent of the era of big data and intelligence, more and more application scenarios need the support of computational vision technology. Massive video and image data need to be processed urgently, which highlights the importance of underlying visual tasks. Therefore, the irreplaceability of the underlying image processing technology and its significance for higher semantic level tasks have attracted widespread attention from the society. Imaging with low noise, low blur, high spatial resolution, high temporal resolution, and high dynamic range is the basic task of computational photography, and its development is extremely important for other computer vision technologies.

然而经过数十年的发展,传统数字相机进入了人们生活的各个领域。随着近年来人工智能的研究热潮到来,传统数字相机在解决自动驾驶、无人机控制、智能机器人等应用领域的视觉问题时显得无能为力。其原因在于,这些新兴应用对于高速运动的捕捉具有很高的要求,而传统数字相机固定帧率的采样方式,在面对高速运动时只能产生模糊的图像或视频。近年来逐渐热门的仿照生物视网膜成像原理的神经形态脉冲传感器,以其高动态范围、高时间分辨率等优势,已进入众多视觉分析应用领域。然而,高噪声和低分辨率的缺点制约了脉冲相机在工业视觉领域的应用。However, after decades of development, traditional digital cameras have entered various fields of people's lives. With the advent of the research boom of artificial intelligence in recent years, traditional digital cameras are powerless to solve the visual problems in the application fields such as automatic driving, drone control, and intelligent robots. The reason is that these emerging applications have high requirements for the capture of high-speed motion, while the sampling method of fixed frame rate of traditional digital cameras can only produce blurred images or videos in the face of high-speed motion. In recent years, the neuromorphic pulse sensor imitating the principle of biological retinal imaging has gradually become popular. With its advantages of high dynamic range and high temporal resolution, it has entered many visual analysis applications. However, the shortcomings of high noise and low resolution restrict the application of pulse cameras in the field of industrial vision.

相比传统数字相机,脉冲相机抛弃了“帧”和“曝光”的概念,每个像素独立地对光枪进行感知并积分,当光强的积分超过阈值时发放一个脉冲,并以二进制的形式传递出来,0表示该时刻该像素没有脉冲,1表示该时刻该像素输出一个脉冲。随着光强变化不断产生的脉冲构成了脉冲序列。不同于传统2D的图片或视频序列,触发的脉冲时间序列以三维时空点云的形式呈现。由于脉冲相机特殊的成像原理,以及现有的传感器制造工艺水平的局限性,当前的脉冲相机存在噪声大,空间分辨率低等问题,这制约了脉冲相机在工业视觉领域的应用。当脉冲相机用于目标跟踪、物体检测等任务时,可能会造成特征失真或缺失,导致结果出现较大的退化。当脉冲相机用于高帧率图像生成、图像去模糊、图像高动态范围恢复的任务时,可能会出现细节纹理丢失,视觉体验较差等问题。Compared with traditional digital cameras, pulse cameras abandon the concept of "frame" and "exposure". Each pixel perceives and integrates the light gun independently. Passed out, 0 means that the pixel has no pulse at this moment, and 1 means that the pixel outputs a pulse at this moment. Pulse trains are generated continuously as the light intensity varies. Unlike traditional 2D pictures or video sequences, the triggered pulse time series is presented in the form of a 3D spatiotemporal point cloud. Due to the special imaging principle of the pulse camera and the limitation of the existing sensor manufacturing process level, the current pulse camera has problems such as high noise and low spatial resolution, which restrict the application of the pulse camera in the field of industrial vision. When impulsive cameras are used for tasks such as target tracking and object detection, features may be distorted or missing, leading to large degradation in results. When the pulse camera is used for the tasks of high frame rate image generation, image deblurring, and image high dynamic range restoration, problems such as loss of detail texture and poor visual experience may occur.

针对脉冲信号的去噪问题,目前还没有相关方法。而与脉冲相机同属于神经形态相机的事件相机的信号去噪问题,目前主要有以下三种方法来解决:方法1)基于局部时空块内事件信号的时空关联性来去除噪声事件,如Super Resolve Dynamic Scene fromContinuous Spike Streams。方法2)通过利用DVS中同步记录的视频帧和相机运动信息来预测局部时空块内的事件是否为噪声的概率,从而标注训练样本,进而基于神经网络学习事件噪声分类网络,然后对事件信号进行噪声去除,如Event Probability Mask(EPM)andEvent Denoising Convolutional Neural Network(EDnCNN)for NeuromorphicCameras)。方法3)搭建事件相机和传统相机的混合相机系统,通过计算时空梯度来建立图像信号和事件信号的关系,从而利用图像信号低噪声高分辨率的特征并通过引导滤波的方式来提升事件信号的质量(Joint Filtering of Intensity Images and NeuromorphicEvents for High-Resolution Noise-Robust Imaging)。但是这三种方法都存在自身的问题,方法1)和方法2)无法处理所拍运动场景较为复杂的事件信号,无法实现事件的超分辨率处理,且只能消除被标注为噪声的事件,不能对未触发的事件进行恢复;方法3)的性能依赖图像信号的质量,且需要计算局部时空块的光流信息,处理速度较慢。For the denoising problem of pulse signal, there is no related method at present. The signal denoising problem of the event camera, which belongs to the same neuromorphic camera as the pulse camera, currently has the following three methods to solve: Method 1) Remove noise events based on the spatio-temporal correlation of event signals in local spatio-temporal blocks, such as Super Resolve Dynamic Scene fromContinuous Spike Streams. Method 2) By using the video frame and camera motion information recorded synchronously in DVS to predict the probability of whether the event in the local spatio-temporal block is noise or not, so as to mark the training samples, and then learn the event noise classification network based on the neural network, and then analyze the event signal Noise removal, such as Event Probability Mask (EPM) and Event Denoising Convolutional Neural Network (EDnCNN) for NeuromorphicCameras). Method 3) Build a hybrid camera system of an event camera and a traditional camera, and establish the relationship between the image signal and the event signal by calculating the spatio-temporal gradient, so as to use the low-noise and high-resolution features of the image signal and improve the accuracy of the event signal through guided filtering. Quality (Joint Filtering of Intensity Images and Neuromorphic Events for High-Resolution Noise-Robust Imaging). However, these three methods have their own problems. Method 1) and method 2) cannot handle the complex event signals of the captured motion scenes, cannot achieve super-resolution processing of events, and can only eliminate events marked as noise. Untriggered events cannot be restored; the performance of method 3) depends on the quality of the image signal, and the optical flow information of the local space-time block needs to be calculated, and the processing speed is relatively slow.

针对脉冲信号的上述超分辨率问题,目前主要有两种方法来解决:方法4)基于图像强度和脉冲信号之间的时空关联,建立由运动光流引导完成的由低分辨率脉冲信号到高分辨率图像的超分辨率算法。方法5)利用脉冲信号仿真器形成数据集,基于深度学习网络来学习低分辨率脉冲信号到高分辨率图像的映射网络。然而,方法4)由于包含了光流估计和逐像素估计的过程,因此超分辨率速度极慢。方法5)由于已有的脉冲信号仿真器难以仿真真实脉冲信号的噪声和高时间精度,因此所训练的网络缺乏对真实脉冲信号的兼容性。For the above-mentioned super-resolution problem of pulse signal, there are currently two main methods to solve it: Method 4) Based on the spatio-temporal correlation between image intensity and pulse signal, establish a low-resolution pulse signal to high-resolution image guided by moving optical flow. Super-resolution algorithms for high-resolution images. Method 5) Use a pulse signal simulator to form a data set, and learn a mapping network from a low-resolution pulse signal to a high-resolution image based on a deep learning network. However, method 4) is extremely slow for super-resolution due to the process of optical flow estimation and pixel-by-pixel estimation. Method 5) Since the existing pulse signal simulators are difficult to simulate the noise and high time accuracy of real pulse signals, the trained network lacks compatibility with real pulse signals.

发明内容Contents of the invention

本发明针对现有技术的缺陷,提出一种基于真实样本采集和深度学习的神经形态脉冲信号去噪和超分辨方法,通过用脉冲相机同步拍摄不同分辨率的相同场景来得到用于网络训练的大量真实数据集,解决了脉冲信号仿真器无法准确生成事件数据的问题;同时利用3D-UNet网络模型来学习脉冲信号去噪和超分辨率重建的端到端的映射模型,避免了现有方法依赖视频帧和IMU信息,省去了求解光流信息的过程,节省了大量的运行时间。Aiming at the defects of the prior art, the present invention proposes a neuromorphic pulse signal denoising and super-resolution method based on real sample collection and deep learning, and obtains images for network training by synchronously shooting the same scene with different resolutions with a pulse camera A large number of real data sets solve the problem that the pulse signal simulator cannot accurately generate event data; at the same time, the 3D-UNet network model is used to learn the end-to-end mapping model of pulse signal denoising and super-resolution reconstruction, avoiding the dependence of existing methods Video frame and IMU information saves the process of solving optical flow information and saves a lot of running time.

为了实现上述目的,本发明提供如下技术方案:In order to achieve the above object, the present invention provides the following technical solutions:

一方面,本发明提供了一种神经形态脉冲信号去噪和超分辨方法,包括以下步骤:In one aspect, the present invention provides a neuromorphic pulse signal denoising and super-resolution method, comprising the following steps:

S1、训练数据采集:通过用脉冲相机同步拍摄不同空间分辨率的相同场景,从而得到真实的训练数据集,利用显示屏同步展示不同分辨率的运动视频,然后在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集;S1. Training data collection: The real training data set is obtained by synchronously shooting the same scene with different spatial resolutions with the pulse camera, and the display screen is used to synchronously display motion videos with different resolutions, and then intercept from the data captured by the pulse camera Generate pulse data with different resolutions, and finally form a complete RGB frame + multi-resolution pulse data set;

S2、脉冲数据转换:采用3D卷积神经网络对事件信息进行Encoder-Decoder处理;S2. Pulse data conversion: use 3D convolutional neural network to perform Encoder-Decoder processing on event information;

S3、脉冲去噪和空间上采样:基于L2范数的卷积神经网络在学习中得到事件信号的去噪模型,求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;S3. Pulse denoising and spatial upsampling: The convolutional neural network based on the L2 norm obtains the denoising model of the event signal during learning, obtains the optimal solution of the denoising model, and outputs denoising and upsampling in the form of 3D tensor The reconstructed image afterwards;

S4、脉冲信号重分布:将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。S4. Pulse signal redistribution: Redistribute the pulses of the reconstructed image in the form of 3D tensor by distributing time stamps at equal intervals to restore high-resolution pulse signals.

进一步地,步骤S1的每组拍摄到的脉冲数据包含的信息组合为{RGB帧,S1A,S1B,S2,时间戳序列},{RGB帧,S1A,S2,时间戳序列}的信息组合用来完成2倍超分辨网络的训练,{RGB帧,S1A,S1B,时间戳序列}的信息组合用来完成去噪网络的训练。Further, the information contained in each group of captured pulse data in step S1 is combined into {RGB frame, S1A, S1B, S2, time stamp sequence}, and the information combination of {RGB frame, S1A, S2, time stamp sequence} is used to Complete the training of the 2-fold super-resolution network, and the information combination of {RGB frame, S1A, S1B, time stamp sequence} is used to complete the training of the denoising network.

进一步地,步骤S1的原始脉冲数据中,每一秒包含40000个H×W的0-1矩阵,以25μs的时间精度记录每个像素上是否有脉冲,0表示没有脉冲,1表示有脉冲。Further, the original pulse data in step S1 contains 40,000 H×W 0-1 matrices per second, and records whether there is a pulse on each pixel with a time accuracy of 25 μs. 0 means no pulse, and 1 means there is a pulse.

进一步地,步骤S2中,Encoder-Decoder处理前先对原始的脉冲信号利用基于脉冲间隔的方法进行图像重建预处理。Further, in step S2, before the Encoder-Decoder process, the original pulse signal is pre-processed for image reconstruction using a pulse interval-based method.

进一步地,预处理过程为:利用每个像素上前后两个相邻脉冲的时间间隔来表示该时刻光强的倒数,从而形成每个时刻初步的重建图像,即每秒重建出40000帧初步的图像,以作为后面网络的输入。Further, the preprocessing process is: use the time interval of two adjacent pulses on each pixel to represent the reciprocal of the light intensity at that moment, so as to form a preliminary reconstructed image at each moment, that is, reconstruct 40,000 frames of preliminary images per second. image, as the input of the subsequent network.

进一步地,步骤S3中去噪模型最优解表示为:Further, the optimal solution of the denoising model in step S3 is expressed as:

Figure BDA0003970856450000041
Figure BDA0003970856450000041

其中S是受到噪声污染的输入和输出训练数据,Ω是求得的去噪模型。where S is the input and output training data contaminated by noise, and Ω is the obtained denoising model.

进一步地,步骤S3利用3D UNet的结构来同时实现去噪和超分辨任务,在2倍超分辨网络中,3D UNet每个层级增加了3D反卷积层来进行跨层级特征融合,以实现分辨率的放大。Further, step S3 utilizes the structure of 3D UNet to achieve denoising and super-resolution tasks at the same time. In the 2x super-resolution network, 3D deconvolution layers are added to each level of 3D UNet for cross-level feature fusion to achieve resolution rate magnification.

进一步地,在训练期间,从采集到的数据中生成了24000个LR-HR脉冲对作为训练集;Benchsize设定为8,并训练了100个epoch;优化器为ADAM,损失函数loss由权重比为1:0.005的Charbonnier loss和TV loss组合而成;初始学习率为0.001,每50个周期衰减0.5倍。Further, during the training period, 24,000 LR-HR pulse pairs were generated from the collected data as the training set; the benchmark size was set to 8, and 100 epochs were trained; the optimizer was ADAM, and the loss function loss was determined by the weight ratio It is a combination of Charbonnier loss and TV loss of 1:0.005; the initial learning rate is 0.001, and the decay is 0.5 times every 50 cycles.

另一方面,本发明还提供了一种神经形态脉冲信号去噪和超分辨装置,包括显示屏和脉冲相机,以及以下模块以实现上述任一项所述的方法:On the other hand, the present invention also provides a neuromorphic pulse signal denoising and super-resolution device, including a display screen and a pulse camera, and the following modules to realize the method described in any one of the above:

训练数据采集模块:用于通过脉冲相机同步拍摄不同空间分辨率的相同场景从而得到真实的训练数据集,并利用显示屏同步展示不同分辨率的运动视频在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集;Training data acquisition module: used to capture the same scene with different spatial resolutions synchronously through the pulse camera to obtain a real training data set, and use the display screen to display motion videos with different resolutions synchronously, and intercept different data from the data captured by the pulse camera resolution pulse data, and finally form a complete RGB frame + multi-resolution pulse data set;

脉冲数据转换模块:用于采用3D卷积神经网络对事件信息进行Encoder-Decoder处理;Pulse data conversion module: used for Encoder-Decoder processing of event information using 3D convolutional neural network;

脉冲去噪和空间上采样模块:用于获得事件信号的去噪模型并求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;Pulse denoising and spatial upsampling module: used to obtain the denoising model of the event signal and obtain the optimal solution of the denoising model, and output the reconstructed image in the form of 3D tensor after denoising and upsampling;

脉冲信号重分布模块:用于将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。Pulse signal redistribution module: It is used to redistribute the pulses of the reconstructed image in the form of 3D tensor by distributing time stamps at equal intervals, and restore high-resolution pulse signals.

又一方面,本发明还提供了一种设备,包括处理器、通信接口、存储器和通信总线,所述处理器、所述通信接口、所述存储器通过所述通信总线完成相互间的通信;其中:In another aspect, the present invention also provides a device, including a processor, a communication interface, a memory, and a communication bus, and the processor, the communication interface, and the memory complete mutual communication through the communication bus; wherein :

所述存储器,用于存放计算机程序;The memory is used to store computer programs;

所述处理器,用于执行所述存储器上所存放的程序时,实现上述任一项所述的方法。The processor is configured to implement the method described in any one of the above when executing the program stored in the memory.

与现有技术相比,本发明的有益效果为:Compared with prior art, the beneficial effect of the present invention is:

本发明的神经形态脉冲信号去噪和超分辨方法和装置,通过在显示屏中设置不同分辨率的相同视频,并用脉冲相机来拍摄显示屏,从而得到不同分辨率的真实脉冲数据对,用实拍数据集作为训练集,避免了由于仿真数据与真实数据的差距太大而导致训练后的网络对真实数据不兼容的问题,解决了脉冲信号仿真器无法准确生成事件数据的问题。同时使用深度学习的方法,利用3D-UNet网络模型来学习脉冲信号去噪和超分辨率重建的端到端的映射模型,在输入只有脉冲序列的情况下,就可以有效实现对事件的去噪和超分辨任务,避免了现有方法依赖视频帧和IMU信息,省去了求解光流信息的过程,节省了大量的运行时间,极大的提升了处理速度。In the neuromorphic pulse signal denoising and super-resolution method and device of the present invention, the same video with different resolutions is set in the display screen, and the pulse camera is used to shoot the display screen, thereby obtaining real pulse data pairs with different resolutions. Taking the data set as the training set avoids the problem that the trained network is not compatible with the real data due to the large gap between the simulation data and the real data, and solves the problem that the pulse signal simulator cannot accurately generate event data. At the same time, using the method of deep learning, the 3D-UNet network model is used to learn the end-to-end mapping model of pulse signal denoising and super-resolution reconstruction. When the input is only the pulse sequence, the denoising and event denoising can be effectively realized. The super-resolution task avoids the dependence of existing methods on video frames and IMU information, saves the process of solving optical flow information, saves a lot of running time, and greatly improves the processing speed.

附图说明Description of drawings

为了更清楚地说明本申请实施例或现有技术中的技术方案,下面将对实施例中所需要使用的附图作简单地介绍。显而易见地,下面描述中的附图仅仅是本发明中记载的一些实施例,对于本领域普通技术人员来讲,还可以根据这些附图获得其他的附图。In order to more clearly illustrate the embodiments of the present application or the technical solutions in the prior art, the following will briefly introduce the drawings required in the embodiments. Apparently, the drawings in the following description are only some embodiments described in the present invention, and those skilled in the art can also obtain other drawings according to these drawings.

图1为本发明实施例提供的神经形态脉冲信号去噪和超分辨方法流程图。Fig. 1 is a flowchart of a neuromorphic pulse signal denoising and super-resolution method provided by an embodiment of the present invention.

图2为本发明实施例提供的训练数据采集装置示意图。Fig. 2 is a schematic diagram of a training data collection device provided by an embodiment of the present invention.

图3为本发明实施例提供的拍摄系统示意图。Fig. 3 is a schematic diagram of a shooting system provided by an embodiment of the present invention.

图4为本发明实施例提供的显示器中的三个视窗视角。Fig. 4 shows three viewing angles of windows in the display provided by the embodiment of the present invention.

具体实施方式Detailed ways

为了更好地理解本技术方案,下面结合附图对本发明的方法做详细的说明。In order to better understand the technical solution, the method of the present invention will be described in detail below in conjunction with the accompanying drawings.

本发明的神经形态脉冲信号去噪和超分辨方法,如图1所示,包括训练数据采集、脉冲数据转换、脉冲信号空间上采样和脉冲信号重分布步骤,具体如下:The neuromorphic pulse signal denoising and super-resolution method of the present invention, as shown in Figure 1, includes training data acquisition, pulse data conversion, pulse signal space upsampling and pulse signal redistribution steps, specifically as follows:

S1、训练数据采集S1. Training data collection

由于现有的仿真器无法精准的仿真事件数据的分布模型,因此,本发明提出通过用脉冲相机(如Vidar)同步拍摄不同空间分辨率的相同场景,从而得到真实的“低分辨率-高分辨率”训练数据集,利用显示屏同步展示不同分辨率的运动视频,然后在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集。每组拍摄到的脉冲数据包含的信息组合为{RGB帧,S1A,S1B,S2,时间戳序列},{RGB帧,S1A,S2,时间戳序列}的信息组合用来完成2倍超分辨网络的训练,{RGB帧,S1A,S1B,时间戳序列}的信息组合用来完成去噪网络的训练。Since the existing emulators cannot accurately simulate the distribution model of event data, the present invention proposes to use a pulse camera (such as Vidar) to simultaneously capture the same scene with different spatial resolutions, thereby obtaining a real "low resolution-high resolution Rate" training data set, using the display screen to display motion videos of different resolutions synchronously, and then intercepting pulse data of different resolutions from the data captured by the pulse camera, and finally forming a complete RGB frame + multi-resolution pulse data set. The combination of information contained in each group of captured pulse data is {RGB frame, S1A, S1B, S2, time stamp sequence}, and the information combination of {RGB frame, S1A, S2, time stamp sequence} is used to complete the 2 times super-resolution network The training of {RGB frame, S1A, S1B, timestamp sequence} information combination is used to complete the training of the denoising network.

S2、脉冲数据转换S2, pulse data conversion

由于本发明采用3D卷积神经网络对事件信息进行Encoder-Decoder处理,为了是输入信息中包含更多的空间图像信息,本发明首先对原始的脉冲信号进行预处理。在原始脉冲数据中,每一秒包含40000个H×W的0-1矩阵,以25μs的时间精度记录每个像素上是否有脉冲(0表示没有脉冲,1表示有脉冲)。本发明利用TFI图像重建方法,利用每个像素上前后两个相邻脉冲的时间间隔的来表示该时刻光强的倒数(如图4中(a)所示),从而形成每个时刻初步的重建图像,即每秒重建出40000帧初步的图像,以作为后面网络的输入。作为ground truth的高分辨率脉冲序列也需要做这一预处理操作。Since the present invention uses a 3D convolutional neural network to perform Encoder-Decoder processing on event information, in order to include more spatial image information in the input information, the present invention first preprocesses the original pulse signal. In the original pulse data, each second contains 40,000 H×W 0-1 matrices, and records whether there is a pulse on each pixel with a time accuracy of 25 μs (0 means no pulse, 1 means there is a pulse). The present invention utilizes the TFI image reconstruction method to represent the reciprocal of the light intensity at this moment by using the time interval of two adjacent pulses before and after each pixel (as shown in (a) in Figure 4), thereby forming a preliminary Reconstruct the image, that is, reconstruct 40,000 frames of preliminary images per second as the input of the subsequent network. The high-resolution pulse sequence as the ground truth also needs to do this preprocessing operation.

S3、脉冲去噪和空间上采样S3, pulse denoising and spatial upsampling

虽然步骤S1中采集到的低分辨率和高分辨率数据都是受到噪声污染的,但是由于脉冲信号的噪声基本符合高斯分布的规律,因此,本发明采用基于L2范数的卷积神经网络在学习中得到事件信号的去噪模型,求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;最优解求解公式如下:Although the low-resolution and high-resolution data collected in step S1 are polluted by noise, since the noise of the pulse signal basically conforms to the law of Gaussian distribution, the present invention adopts the convolutional neural network based on the L2 norm in the The denoising model of the event signal is obtained during the learning, the optimal solution of the denoising model is obtained, and the reconstructed image after denoising and upsampling in the form of 3D tensor is output; the optimal solution solution formula is as follows:

Figure BDA0003970856450000071
Figure BDA0003970856450000071

其中S是受到噪声污染的输入和输出训练数据,Ω是求得的去噪模型。where S is the input and output training data contaminated by noise, and Ω is the obtained denoising model.

因此,本发明仅利用真实的有噪声数据就可以训练出可去噪的网络模型。如图4所示,本发明利用3D UNet的结构来同时实现去噪和超分辨任务。Therefore, the present invention can train a denoising network model only by using real noisy data. As shown in Figure 4, the present invention utilizes the structure of 3D UNet to simultaneously achieve denoising and super-resolution tasks.

S4、脉冲信号重分布S4. Pulse signal redistribution

由于网络输出的去噪和上采样之后的事件以3D tensor的形式输出,因此,必须将tensor中的数转化成脉冲的表达形成,才能最终实现输入是事件输出也是事件的功能。由于预处理的时候本发明基于脉冲间隔初步重建了图像,因此输出图像的每个像素的值表示该像素所在时间点在时间轴上前后两个脉冲的间隔,基于这一原理,可以通过输出结果还原二进制模式的脉冲信号。具体地,本发明将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。Since the denoising and upsampling events of the network output are output in the form of a 3D tensor, the number in the tensor must be converted into a pulse expression to finally realize the function that the input is an event and the output is also an event. Since the present invention initially reconstructs the image based on the pulse interval during preprocessing, the value of each pixel of the output image represents the interval of two pulses before and after the time point of the pixel on the time axis. Based on this principle, the output result can be Restores the pulse signal in binary mode. Specifically, the present invention redistributes the pulses of the reconstructed images in the form of 3D tensor by distributing time stamps at equal intervals to restore high-resolution pulse signals.

本发明采用步骤S1中拍摄到的不同分辨率的真实数据对神经网络进行训练,具体训练过程如下:The present invention adopts the real data of different resolutions captured in step S1 to train the neural network, and the specific training process is as follows:

(1)拍摄训练数据集(1) Shooting training data set

a)在网上下载公开的高速慢放视频数据集。a) Download the public high-speed slow-motion video dataset from the Internet.

b)重新合成新的视频,视频的每一帧包含多个时间同步内容相同但空间分辨率不同的局部视窗,对应1倍,2倍分辨率(如果后续有更大分辨率的脉冲相机,也可以增加到4倍和8倍)。b) Re-synthesize a new video, each frame of the video contains multiple local windows with the same time synchronization content but different spatial resolutions, corresponding to 1 times and 2 times the resolution (if there is a pulse camera with a larger resolution in the future, also can be increased to 4 times and 8 times).

c)搭建如下图所示的数据采集装置,如图2所示,包括显示器和神经形态脉冲相机(或时间相机),还可包括水平仪和瞄准装置等,如图3所示,要确保事件拍摄的视角正对且平行于显示器。c) Build a data acquisition device as shown in the figure below, as shown in Figure 2, including a display and a neuromorphic pulse camera (or time camera), and may also include a level and aiming device, as shown in Figure 3, to ensure event shooting The viewing angle is directly and parallel to the display.

d)拍摄数据:开始进行拍摄时,除显示屏外,室内环境光源全部关闭,以减小外部环境对数据拍摄的影响。然后拍摄处理后的视频。d) Shooting data: When shooting starts, except for the display screen, all indoor ambient light sources are turned off to reduce the impact of the external environment on data shooting. Then take the processed video.

e)处理脉冲数据:对每一组拍摄到的脉冲数据,依次截取局部区域,形成独立的不同分辨率的脉冲数据,分别对应为不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集,每组包含的信息组合为{RGB帧,S1A,S1B,S2,时间戳序列}。在下面的网络训练中,本发明采用{RGB帧,S1A,S2,时间戳序列}的组合来完成2倍超分辨网络的训练,用{RGB帧,S1A,S1B,时间戳序列}组合来完成去噪网络的训练。e) Processing pulse data: For each group of captured pulse data, intercept local areas in turn to form independent pulse data with different resolutions, corresponding to pulse data with different resolutions, and finally form a complete RGB frame + multiple Datasets of resolution pulses, each group contains information combined as {RGB frame, S1A, S1B, S2, timestamp sequence}. In the following network training, the present invention uses the combination of {RGB frame, S1A, S2, time stamp sequence} to complete the training of the 2 times super-resolution network, and completes it with the combination of {RGB frame, S1A, S1B, time stamp sequence} Training of the denoising network.

(2)神经网络的训练(2) Training of neural network

a)事件信息预处理:在训练去噪和上采样网络时,LR(低动态范围)和HR(高动态范围)事件都被合并到一个32通道事件tensor中以完成有监督训练。在每个通道中每个像素对该时间区间内的事件进行求和。我们还尝试了不同的通道号,发现32个通道具有最佳性能。a) Event information preprocessing: When training denoising and upsampling networks, both LR (low dynamic range) and HR (high dynamic range) events are merged into a 32-channel event tensor to complete supervised training. The events in that time interval are summed per pixel in each channel. We also experimented with different channel numbers and found that 32 channels gave the best performance.

b)整个网络主要模块是3D UNet,在2倍超分辨网络中,网络为3D UNet每个层级的跳跃连接增加了3D反卷积层,以及跨层级特征融合,以实现分辨率的放大。在去噪网络中,不需要添加上述的反卷积网络。输入和输出的脉冲信号先采用基于脉冲间隔的方法进行初步的图像重建预处理,输出的tensor要四舍五入取整数值,便得到了超分去噪之后的重建图像,然后通过均等间隔分配时间戳的方式进行脉冲重新分配,可以还原出高分辨率的脉冲信号。b) The main module of the entire network is 3D UNet. In the 2x super-resolution network, the network adds a 3D deconvolution layer to the skip connection of each level of 3D UNet, and cross-level feature fusion to achieve resolution amplification. In the denoising network, there is no need to add the above-mentioned deconvolution network. The input and output pulse signals are pre-processed for preliminary image reconstruction based on the pulse interval method, and the output tensor is rounded to an integer value to obtain the reconstructed image after super-resolution and denoising, and then distribute time stamps at equal intervals. The pulse redistribution is carried out in this way, and the high-resolution pulse signal can be restored.

c)在训练期间,我们从采集到的数据中生成了24000个LR-HR脉冲对作为训练集。Benchsize设定为8,并训练了100个epoch。优化器为ADAM,loss为权重比为1:0.005的Charbonnier loss和TV loss组合而成的损失函数,初始学习率为0.001,每50个周期衰减0.5倍。使用PyTorch 1.6和NVIDIA 2080Ti GPU共花费了大约12个小时。c) During training, we generated 24,000 LR-HR pulse pairs from the collected data as a training set. Benchsize is set to 8 and trained for 100 epochs. The optimizer is ADAM, the loss is a loss function composed of Charbonnier loss and TV loss with a weight ratio of 1:0.005, the initial learning rate is 0.001, and the decay is 0.5 times every 50 cycles. It took a total of about 12 hours using PyTorch 1.6 and an NVIDIA 2080Ti GPU.

d)在测试过程中,仅需要输入实拍的脉冲序列就可以进行去噪和超分辨。d) During the test, denoising and super-resolution can be performed only by inputting the real shot pulse sequence.

相应于上述本发明实施例提供的方法,本发明提供了一种神经形态脉冲信号去噪和超分辨装置,包括显示屏和脉冲相机,以及以下模块以实现上述实施例任一项所述的方法:Corresponding to the method provided by the above-mentioned embodiments of the present invention, the present invention provides a neuromorphic pulse signal denoising and super-resolution device, including a display screen, a pulse camera, and the following modules to implement the method described in any one of the above-mentioned embodiments :

训练数据采集模块:用于通过脉冲相机同步拍摄不同空间分辨率的相同场景从而得到真实的训练数据集,并利用显示屏同步展示不同分辨率的运动视频在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集;Training data acquisition module: used to capture the same scene with different spatial resolutions synchronously through the pulse camera to obtain a real training data set, and use the display screen to display motion videos with different resolutions synchronously, and intercept different data from the data captured by the pulse camera resolution pulse data, and finally form a complete RGB frame + multi-resolution pulse data set;

脉冲数据转换模块:用于采用3D卷积神经网络对事件信息进行Encoder-Decoder处理;Pulse data conversion module: used for Encoder-Decoder processing of event information using 3D convolutional neural network;

脉冲去噪和空间上采样模块:用于获得事件信号的去噪模型并求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;Pulse denoising and spatial upsampling module: used to obtain the denoising model of the event signal and obtain the optimal solution of the denoising model, and output the reconstructed image in the form of 3D tensor after denoising and upsampling;

脉冲信号重分布模块:用于将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。Pulse signal redistribution module: It is used to redistribute the pulses of the reconstructed image in the form of 3D tensor by distributing time stamps at equal intervals, and restore high-resolution pulse signals.

本发明的方法或装置在应用时,可采用以下步骤:When the method or device of the present invention is applied, the following steps can be adopted:

a)在网上下载公开的高速慢放视频数据集,该数据集包含45个视频序列对应所有彩色帧,帧率调整为30fps,每个帧的空间分辨率为1280×720。a) Download the public high-speed slow-motion video dataset from the Internet, which contains 45 video sequences corresponding to all color frames, the frame rate is adjusted to 30fps, and the spatial resolution of each frame is 1280×720.

b)如图4为例,重新合成对应的新的45个视频,每个视频的帧率调整360fps,分辨率为1280×720。视频的每一帧包含3个时间同步内容相同但空间分辨率不同的局部视窗,其中最大的一个视窗分辨率为720×720,最小的两个视窗分辨率为360×360。为方便在拍摄时有充足的时间播放视频且开始相机拍摄,每个视频的开头和结尾帧都静置两秒。b) As shown in Figure 4 as an example, 45 corresponding new videos are re-synthesized, the frame rate of each video is adjusted to 360fps, and the resolution is 1280×720. Each frame of the video contains three local windows with the same time synchronization content but different spatial resolutions. The resolution of the largest window is 720×720, and the resolution of the smallest two windows is 360×360. In order to allow enough time to play the video and start the camera capture when shooting, the beginning and end frames of each video are paused for two seconds.

c)搭建系统如图2所示的数据采集装置:本实施例中,显示屏的型号为ASUSPG259QNR,分辨率为1920×1080,刷新率为360Hz。型号为VidarOne,分辨率为400×250,镜头为F/1.4的事件相机水平放置在显示屏的正前方约180cm处。为了确保拍摄的视角正对且平行于显示器。c) Building the data collection device shown in Figure 2: In this embodiment, the model of the display screen is ASUSPG259QNR, the resolution is 1920×1080, and the refresh rate is 360 Hz. The model is VidarOne, the resolution is 400×250, and the event camera with F/1.4 lens is placed horizontally about 180cm in front of the display screen. In order to ensure that the shooting angle is facing and parallel to the monitor.

d)相机视角和显示屏显示区域配准:如图3所示,在配准时,显示器中心点设置十字靶心,相机正前方(由面包板限定)放置瞄准点,并通过三点一线的方法最终确保相机平面和显示器平面平行,且中心点的连线垂直于两个平面。并通过在相机和显示器上放置水平仪来限制相机和显示屏的水平旋转角相同。从而确保相机视角和显示屏显示区域完全配准。配准后事件相机的视角如图4所示,视角内分别对应了显示器中的三个视窗的区域,其中最大视窗对应在脉冲相机分辨率为240×240,最小的两个视窗对应的分辨率为120×120。d) Registration of the camera angle of view and the display area of the display screen: as shown in Figure 3, during registration, a cross bullseye is set at the center of the display, an aiming point is placed directly in front of the camera (limited by the breadboard), and the three-point-one-line method is used to Finally, make sure that the camera plane and the display plane are parallel, and the line connecting the center point is perpendicular to the two planes. And limit the horizontal rotation angle of the camera and the display to be the same by placing a spirit level on the camera and the display. This ensures perfect registration of the camera viewing angle and the display area of the display. The angle of view of the event camera after registration is shown in Figure 4. The angle of view corresponds to the area of the three windows in the display. The largest window corresponds to the resolution of the pulse camera at 240×240, and the smallest two windows correspond to the resolution of 120×120.

e)开始进行拍摄时,除显示屏外,室内环境光源全部关闭,以减小外部环境对数据拍摄的影响。然后依次拍摄45个处理后的视频。e) When shooting starts, except for the display screen, all indoor ambient light sources are turned off to reduce the impact of the external environment on data shooting. Then 45 processed videos were taken sequentially.

f)处理脉冲数据:实现完成脉冲数据和彩色视频的时间配准。利用在拍摄是预留的标记来完成开始和结束的时间点对齐。对每一组拍摄到的事件数据,依照上图中空间坐标位置,依次截取局部区域,形成独立的不同分辨率的事件数据,分别对应为S1A,S1B,S2三个数据。本发明最终形成一个完整的45组RGB帧+多分辨率脉冲的数据集,每组包含的信息组合为{RGB帧,S1A,S1B,S2,时间戳序列}。在下面的网络训练中,本发明采用{S1A,S2,时间戳序列}的组合来完成2倍超分辨网络的训练,用{S1A,S1B,时间戳序列}组合来完成去噪网络的训练。f) Process pulse data: realize the time registration of pulse data and color video. Start and end timing alignment is done using the markers reserved during the shoot. For each group of captured event data, according to the spatial coordinate position in the above figure, the local area is sequentially intercepted to form independent event data with different resolutions, which correspond to three data of S1A, S1B, and S2 respectively. The present invention finally forms a complete data set of 45 groups of RGB frames + multi-resolution pulses, and the information contained in each group is combined into {RGB frame, S1A, S1B, S2, time stamp sequence}. In the following network training, the present invention uses the combination of {S1A, S2, time stamp sequence} to complete the training of the 2-fold super-resolution network, and uses the combination of {S1A, S1B, time stamp sequence} to complete the training of the denoising network.

综上,相比现有技术,本发明先用实拍数据集作为训练集,避免了由于仿真数据与真实数据的差距太大而导致训练后的网络对真实数据不兼容的问题。并通过在显示屏中设置不同分辨率的相同视频,并用脉冲相机来拍摄显示屏,从而得到不同分辨率的真实脉冲数据对。同时使用深度学习的方法,在输入只有脉冲序列的情况下,就可以有效实现对事件的去噪和超分辨任务,且极大的提升了处理速度。To sum up, compared with the prior art, the present invention first uses the real-shot data set as the training set, which avoids the problem that the trained network is not compatible with the real data due to the large gap between the simulation data and the real data. And by setting the same video with different resolutions in the display screen, and using the pulse camera to shoot the display screen, real pulse data pairs with different resolutions can be obtained. At the same time, using the method of deep learning, when the input is only the pulse sequence, the task of denoising and super-resolution of events can be effectively realized, and the processing speed is greatly improved.

相应于上述本发明实施例提供的方法,本发明实施例还提供了一种电子设备,包括:处理器、通信接口、存储器和通信总线,其中,处理器、通信接口、存储器通过通信总线完成相互间的通信;Corresponding to the methods provided by the above-mentioned embodiments of the present invention, the embodiments of the present invention also provide an electronic device, including: a processor, a communication interface, a memory, and a communication bus, wherein the processor, the communication interface, and the memory complete the mutual communication via the communication bus. communication between

存储器,用于存放计算机程序;memory for storing computer programs;

处理器,用于执行存储器上所存放的程序时,实现上述本发明实施例提供的方法流程。The processor is configured to implement the flow of the method provided by the foregoing embodiments of the present invention when executing the program stored in the memory.

上述控制设备设备中提到的通信总线可以是外设部件互连标准(PeripheralComponent Interconnect,PCI)总线或扩展工业标准结构(Extended Industry StandardArchitecture,EISA)总线等。该通信总线可以分为地址总线、数据总线、控制总线等。为便于表示,图中仅用一条粗线表示,但并不表示仅有一根总线或一种类型的总线。The communication bus mentioned in the above control device may be a Peripheral Component Interconnect (PCI) bus or an Extended Industry Standard Architecture (EISA) bus or the like. The communication bus can be divided into an address bus, a data bus, a control bus, and the like. For ease of representation, only one thick line is used in the figure, but it does not mean that there is only one bus or one type of bus.

通信接口用于上述电子设备与其他设备之间的通信。The communication interface is used for communication between the electronic device and other devices.

存储器可以包括随机存取存储器(Random Access Memory,RAM),也可以包括非易失性存储器(Non-Volatile Memory,NVM),例如至少一个磁盘存储器。可选的,存储器还可以是至少一个位于远离前述处理器的存储装置。The memory may include a random access memory (Random Access Memory, RAM), and may also include a non-volatile memory (Non-Volatile Memory, NVM), such as at least one disk memory. Optionally, the memory may also be at least one storage device located far away from the aforementioned processor.

上述的处理器可以是通用处理器,包括中央处理器(Central Processing Unit,CPU)、网络处理器(Network Processor,NP)等;还可以是数字信号处理器(Digital SignalProcessing,DSP)、专用集成电路(Application Specific Integrated Circuit,ASIC)、现场可编程门阵列(Field-Programmable Gate Array,FPGA)或者其他可编程逻辑器件、分立门或者晶体管逻辑器件、分立硬件组件。The above-mentioned processor can be a general-purpose processor, including a central processing unit (Central Processing Unit, CPU), a network processor (Network Processor, NP), etc.; it can also be a digital signal processor (Digital Signal Processing, DSP), an application-specific integrated circuit (Application Specific Integrated Circuit, ASIC), Field-Programmable Gate Array (Field-Programmable Gate Array, FPGA) or other programmable logic devices, discrete gate or transistor logic devices, discrete hardware components.

在本发明提供的又一实施例中,还提供了一种计算机可读存储介质,该计算机可读存储介质内存储有计算机程序,所述计算机程序被处理器执行时实现上述本发明实施例提供的任一方法的步骤。In yet another embodiment provided by the present invention, a computer-readable storage medium is also provided, and a computer program is stored in the computer-readable storage medium, and when the computer program is executed by a processor, the above-mentioned embodiment of the present invention is implemented. steps in any of the methods.

在本发明提供的又一实施例中,还提供了一种包含指令的计算机程序产品,当其在计算机上运行时,使得计算机执行上述本发明实施例提供的任一方法的步骤。In yet another embodiment provided by the present invention, a computer program product containing instructions is also provided, and when it is run on a computer, it causes the computer to execute the steps of any method provided by the above-mentioned embodiments of the present invention.

在上述实施例中,可以全部或部分地通过软件、硬件、固件或者其任意组合来实现。当使用软件实现时,可以全部或部分地以计算机程序产品的形式实现。所述计算机程序产品包括一个或多个计算机指令。在计算机上加载和执行所述计算机程序指令时,全部或部分地产生按照本发明实施例所述的流程或功能。所述计算机可以是通用计算机、专用计算机、计算机网络、或者其他可编程装置。所述计算机指令可以存储在计算机可读存储介质中,或者从一个计算机可读存储介质向另一个计算机可读存储介质传输,例如,所述计算机指令可以从一个网站站点、计算机、服务器或数据中心通过有线(例如同轴电缆、光纤、数字终端设备线(DSL))或无线(例如红外、无线、微波等)方式向另一个网站站点、计算机、服务器或数据中心进行传输。所述计算机可读存储介质可以是计算机能够存取的任何可用介质或者是包含一个或多个可用介质集成的服务器、数据中心等数据存储设备。所述可用介质可以是磁性介质,(例如,软盘、硬盘、磁带)、光介质(例如,DVD)、或者半导体介质(例如固态硬盘Solid State Disk(SSD))等。In the above embodiments, all or part of them may be implemented by software, hardware, firmware or any combination thereof. When implemented using software, it may be implemented in whole or in part in the form of a computer program product. The computer program product includes one or more computer instructions. When the computer program instructions are loaded and executed on the computer, the processes or functions according to the embodiments of the present invention will be generated in whole or in part. The computer can be a general purpose computer, a special purpose computer, a computer network, or other programmable devices. The computer instructions may be stored in or transmitted from one computer-readable storage medium to another computer-readable storage medium, for example, the computer instructions may be transmitted from a website, computer, server or data center Transmission to another website site, computer, server, or data center by wired (eg, coaxial cable, optical fiber, digital terminal equipment line (DSL)) or wireless (eg, infrared, wireless, microwave, etc.). The computer-readable storage medium may be any available medium that can be accessed by a computer, or a data storage device such as a server or a data center integrated with one or more available media. The available medium may be a magnetic medium (for example, a floppy disk, a hard disk, or a magnetic tape), an optical medium (for example, DVD), or a semiconductor medium (for example, a Solid State Disk (SSD)).

需要说明的是,在本文中,诸如第一和第二等之类的关系术语仅仅用来将一个实体或者操作与另一个实体或操作区分开来,而不一定要求或者暗示这些实体或操作之间存在任何这种实际的关系或者顺序。而且,术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含,从而使得包括一系列要素的过程、方法、物品或者设备不仅包括那些要素,而且还包括没有明确列出的其他要素,或者是还包括为这种过程、方法、物品或者设备所固有的要素。在没有更多限制的情况下,由语句“包括一个……”限定的要素,并不排除在包括所述要素的过程、方法、物品或者设备中还存在另外的相同要素。It should be noted that in this article, relational terms such as first and second are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply that there is a relationship between these entities or operations. There is no such actual relationship or order between them. Furthermore, the term "comprises", "comprises" or any other variation thereof is intended to cover a non-exclusive inclusion such that a process, method, article, or apparatus comprising a set of elements includes not only those elements, but also includes elements not expressly listed. other elements of or also include elements inherent in such a process, method, article, or device. Without further limitations, an element defined by the phrase "comprising a ..." does not exclude the presence of additional identical elements in the process, method, article or apparatus comprising said element.

本说明书中的各个实施例均采用相关的方式描述,各个实施例之间相同相似的部分互相参见即可,每个实施例重点说明的都是与其他实施例的不同之处。尤其,对于装置实施例、电子设备实施例、计算机可读存储介质实施例和计算机程序产品实施例而言,由于其基本相似于方法实施例,所以描述的比较简单,相关之处参见方法实施例的部分说明即可。Each embodiment in this specification is described in a related manner, the same and similar parts of each embodiment can be referred to each other, and each embodiment focuses on the differences from other embodiments. In particular, for apparatus embodiments, electronic device embodiments, computer-readable storage medium embodiments, and computer program product embodiments, since they are basically similar to method embodiments, the description is relatively simple. For related information, refer to method embodiments Part of the description is sufficient.

以上所述仅为本发明的较佳实施例而已,并非用于限定本发明的保护范围。凡在本发明的精神和原则之内所作的任何修改、等同替换、改进等,均包含在本发明的保护范围内。The above descriptions are only preferred embodiments of the present invention, and are not intended to limit the protection scope of the present invention. Any modification, equivalent replacement, improvement, etc. made within the spirit and principles of the present invention are included in the protection scope of the present invention.

Claims (10)

1.一种神经形态脉冲信号去噪和超分辨方法,其特征在于,包括以下步骤:1. a neuromorphic pulse signal denoising and super-resolution method, is characterized in that, comprises the following steps: S1、训练数据采集:通过用脉冲相机同步拍摄不同空间分辨率的相同场景,从而得到真实的训练数据集,利用显示屏同步展示不同分辨率的运动视频,然后在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集;S1. Training data collection: The real training data set is obtained by synchronously shooting the same scene with different spatial resolutions with the pulse camera, and the display screen is used to synchronously display motion videos with different resolutions, and then intercept from the data captured by the pulse camera Generate pulse data with different resolutions, and finally form a complete RGB frame + multi-resolution pulse data set; S2、脉冲数据转换:采用3D卷积神经网络对事件信息进行Encoder-Decoder处理;S2. Pulse data conversion: use 3D convolutional neural network to perform Encoder-Decoder processing on event information; S3、脉冲去噪和空间上采样:基于L2范数的卷积神经网络在学习中得到事件信号的去噪模型,求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;S3. Pulse denoising and spatial upsampling: The convolutional neural network based on the L2 norm obtains the denoising model of the event signal during learning, obtains the optimal solution of the denoising model, and outputs denoising and upsampling in the form of 3D tensor The reconstructed image afterwards; S4、脉冲信号重分布:将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。S4. Pulse signal redistribution: Redistribute the pulses of the reconstructed image in the form of 3D tensor by distributing time stamps at equal intervals to restore high-resolution pulse signals. 2.根据权利要求1所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,步骤S1的每组拍摄到的脉冲数据包含的信息组合为{RGB帧,S1A,S1B,S2,时间戳序列},{RGB帧,S1A,S2,时间戳序列}的信息组合用来完成2倍超分辨网络的训练,{RGB帧,S1A,S1B,时间戳序列}的信息组合用来完成去噪网络的训练。2. The neuromorphic pulse signal denoising and super-resolution method according to claim 1, wherein the information contained in the pulse data captured by each group of step S1 is combined into {RGB frame, S1A, S1B, S2, time Stamp sequence}, {RGB frame, S1A, S2, time stamp sequence} information combination is used to complete the training of 2 times super-resolution network, and the information combination of {RGB frame, S1A, S1B, time stamp sequence} is used to complete denoising Network training. 3.根据权利要求1所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,步骤S1的原始脉冲数据中,每一秒包含40000个H×W的0-1矩阵,以25μs的时间精度记录每个像素上是否有脉冲,0表示没有脉冲,1表示有脉冲。3. The neuromorphic pulse signal denoising and super-resolution method according to claim 1, characterized in that, in the original pulse data of step S1, each second contains 40000 H×W 0-1 matrices, with a time of 25 μs Time accuracy records whether there is a pulse on each pixel, 0 means no pulse, 1 means there is a pulse. 4.根据权利要求3所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,步骤S2中,Encoder-Decoder处理前先对原始的脉冲信号利用基于脉冲间隔的方法进行图像重建预处理。4. The neuromorphic pulse signal denoising and super-resolution method according to claim 3, characterized in that, in step S2, before the Encoder-Decoder process, the original pulse signal is used to perform image reconstruction preprocessing based on the pulse interval method . 5.根据权利要求4所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,预处理过程为:利用每个像素上前后两个相邻脉冲的时间间隔来表示该时刻光强的倒数,从而形成每个时刻初步的重建图像,即每秒重建出40000帧初步的图像,以作为后面网络的输入。5. neuromorphic pulse signal denoising and super-resolution method according to claim 4, is characterized in that, pretreatment process is: utilize the time interval of two adjacent pulses before and after each pixel to represent the light intensity at this moment Count down to form a preliminary reconstructed image at each moment, that is, reconstruct 40,000 frames of preliminary images per second as the input of the subsequent network. 6.根据权利要求1所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,步骤S3中去噪模型最优解表示为:6. neuromorphic pulse signal denoising and super-resolution method according to claim 1, is characterized in that, in the step S3, the optimal solution of the denoising model is expressed as:
Figure FDA0003970856440000021
Figure FDA0003970856440000021
其中S是受到噪声污染的输入和输出训练数据,Ω是求得的去噪模型。where S is the input and output training data contaminated by noise, and Ω is the obtained denoising model.
7.根据权利要求1所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,步骤S3利用3D UNet的结构来同时实现去噪和超分辨任务,在2倍超分辨网络中,3D UNet每个层级增加了3D反卷积层来进行跨层级特征融合,以实现分辨率的放大。7. neuromorphic pulse signal denoising and super-resolution method according to claim 1, is characterized in that, step S3 utilizes the structure of 3D UNet to realize denoising and super-resolution task simultaneously, in 2 times super-resolution network, 3D Each level of UNet adds a 3D deconvolution layer for cross-level feature fusion to achieve resolution amplification. 8.根据权利要求1所述的神经形态脉冲信号去噪和超分辨方法,其特征在于,在训练期间,从采集到的数据中生成了24000个LR-HR脉冲对作为训练集;Benchsize设定为8,并训练了100个epoch;优化器为ADAM,损失函数loss由权重比为1:0.005的Charbonnierloss和TVloss组合而成;初始学习率为0.001,每50个周期衰减0.5倍。8. neuromorphic pulse signal denoising and super-resolution method according to claim 1, is characterized in that, during training, has generated 24000 LR-HR pulse pairs as training set from the data gathered; Benchsize setting is 8, and trained for 100 epochs; the optimizer is ADAM, and the loss function loss is composed of Charbonnierloss and TVloss with a weight ratio of 1:0.005; the initial learning rate is 0.001, and the decay is 0.5 times every 50 cycles. 9.一种神经形态脉冲信号去噪和超分辨装置,其特征在于,包括显示屏和脉冲相机,以及以下模块以实现权利要求1-8任一项所述的方法:9. A neuromorphic pulse signal denoising and super-resolution device, characterized in that it includes a display screen and a pulse camera, and the following modules to realize the method according to any one of claims 1-8: 训练数据采集模块:用于通过脉冲相机同步拍摄不同空间分辨率的相同场景从而得到真实的训练数据集,并利用显示屏同步展示不同分辨率的运动视频在脉冲相机拍到的数据中截取出不同分辨率的脉冲数据,最终形成一个完整的RGB帧+多分辨率脉冲的数据集;Training data acquisition module: used to capture the same scene with different spatial resolutions synchronously through the pulse camera to obtain a real training data set, and use the display screen to display motion videos with different resolutions synchronously, and intercept different data from the data captured by the pulse camera resolution pulse data, and finally form a complete RGB frame + multi-resolution pulse data set; 脉冲数据转换模块:用于采用3D卷积神经网络对事件信息进行Encoder-Decoder处理;Pulse data conversion module: used for Encoder-Decoder processing of event information using 3D convolutional neural network; 脉冲去噪和空间上采样模块:用于获得事件信号的去噪模型并求得去噪模型的最优解,输出3D tensor形式的去噪和上采样之后的重建图像;Pulse denoising and spatial upsampling module: used to obtain the denoising model of the event signal and obtain the optimal solution of the denoising model, and output the reconstructed image in the form of 3D tensor after denoising and upsampling; 脉冲信号重分布模块:用于将3D tensor形式的重建图像通过均等间隔分配时间戳的方式进行脉冲重新分配,还原出高分辨率的脉冲信号。Pulse signal redistribution module: It is used to redistribute the pulses of the reconstructed image in the form of 3D tensor by distributing time stamps at equal intervals, and restore high-resolution pulse signals. 10.一种设备,其特征在于,包括处理器、通信接口、存储器和通信总线,所述处理器、所述通信接口、所述存储器通过所述通信总线完成相互间的通信;其特征在于,10. A kind of equipment, it is characterized in that, comprises processor, communication interface, memory and communication bus, described processor, described communication interface, described memory complete mutual communication through described communication bus; It is characterized in that, 所述存储器,用于存放计算机程序;The memory is used to store computer programs; 所述处理器,用于执行所述存储器上所存放的程序时,实现权利要求1-8任一项所述的方法。The processor is configured to implement the method according to any one of claims 1-8 when executing the program stored in the memory.
CN202211543963.7A 2022-11-29 2022-11-29 A neuromorphic pulse signal denoising and super-resolution method and device Pending CN115984124A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202211543963.7A CN115984124A (en) 2022-11-29 2022-11-29 A neuromorphic pulse signal denoising and super-resolution method and device

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202211543963.7A CN115984124A (en) 2022-11-29 2022-11-29 A neuromorphic pulse signal denoising and super-resolution method and device

Publications (1)

Publication Number Publication Date
CN115984124A true CN115984124A (en) 2023-04-18

Family

ID=85971353

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202211543963.7A Pending CN115984124A (en) 2022-11-29 2022-11-29 A neuromorphic pulse signal denoising and super-resolution method and device

Country Status (1)

Country Link
CN (1) CN115984124A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116389912A (en) * 2023-04-24 2023-07-04 北京大学 Method of reconstructing high frame rate and high dynamic range video by combining pulse camera with common camera

Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110969092A (en) * 2019-11-05 2020-04-07 国网河北省电力有限公司电力科学研究院 Pulse signal denoising method, device and terminal equipment
CN112699956A (en) * 2021-01-08 2021-04-23 西安交通大学 Neural morphology visual target classification method based on improved impulse neural network

Patent Citations (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN110969092A (en) * 2019-11-05 2020-04-07 国网河北省电力有限公司电力科学研究院 Pulse signal denoising method, device and terminal equipment
CN112699956A (en) * 2021-01-08 2021-04-23 西安交通大学 Neural morphology visual target classification method based on improved impulse neural network

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
PEIQI DUAN等: "EventZoom: Learning to Denoise and Super Resolve Neuromorphic Events", 《CVPR》, pages 12824 - 12833 *

Cited By (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN116389912A (en) * 2023-04-24 2023-07-04 北京大学 Method of reconstructing high frame rate and high dynamic range video by combining pulse camera with common camera
CN116389912B (en) * 2023-04-24 2023-10-10 北京大学 A method for reconstructing high frame rate and high dynamic range videos using pulse cameras combined with ordinary cameras

Similar Documents

Publication Publication Date Title
CN109671023B (en) A Super-resolution Reconstruction Method of Face Image
An et al. TR-MISR: Multiimage super-resolution based on feature fusion with transformers
Peng et al. LVE-S2D: Low-light video enhancement from static to dynamic
CN114494050B (en) A self-supervised video deblurring and image interpolation method based on event camera
CN110852961A (en) Real-time video denoising method and system based on convolutional neural network
CN109711364A (en) A kind of facial image super-resolution reconstruction method, device and computer equipment
Xiang et al. Learning super-resolution reconstruction for high temporal resolution spike stream
Zhao et al. Reconstructing clear image for high-speed motion scene with a retina-inspired spike camera
Duan et al. Guided event filtering: Synergy between intensity images and neuromorphic events for high performance imaging
CN111798395B (en) Event camera image reconstruction method and system based on TV constraint
CN114245007B (en) High-frame-rate video synthesis method, device, equipment and storage medium
CN110415344A (en) Motion compensation method based on event camera
CN112987026A (en) Event field synthetic aperture imaging algorithm based on hybrid neural network
CN111028302B (en) Compressed object imaging method and system based on deep learning
CN112001843A (en) Infrared image super-resolution reconstruction method based on deep learning
CN115880177A (en) Full-resolution low-light image enhancement method for aggregating context and enhancing details
Zhang et al. Fine-grained video deblurring with event camera
CN115048954B (en) A retina-like target detection method, device, storage medium and terminal
CN110866453B (en) Real-time crowd steady state recognition method and device based on convolutional neural network
CN115984124A (en) A neuromorphic pulse signal denoising and super-resolution method and device
CN109788270A (en) 3D-360 degree panorama image generation method and device
Yuan et al. Unsupervised real image super-resolution via knowledge distillation network
Duan LED: A large-scale real-world paired dataset for event camera denoising
CN112819742B (en) Event field synthetic aperture imaging method based on convolutional neural network
Li et al. Generalizing event-based HDR imaging to various exposures

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination
RJ01 Rejection of invention patent application after publication

Application publication date: 20230418