Nothing Special   »   [go: up one dir, main page]

CN116705046A - Sound pickup method, device, non-volatile storage medium and terminal equipment - Google Patents

Sound pickup method, device, non-volatile storage medium and terminal equipment Download PDF

Info

Publication number
CN116705046A
CN116705046A CN202210182711.XA CN202210182711A CN116705046A CN 116705046 A CN116705046 A CN 116705046A CN 202210182711 A CN202210182711 A CN 202210182711A CN 116705046 A CN116705046 A CN 116705046A
Authority
CN
China
Prior art keywords
sound
target
sound signal
signal
microphone array
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN202210182711.XA
Other languages
Chinese (zh)
Inventor
康洪涛
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Beijing Hexmeet Technology Co ltd
Original Assignee
Beijing Hexmeet Technology Co ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Beijing Hexmeet Technology Co ltd filed Critical Beijing Hexmeet Technology Co ltd
Priority to CN202210182711.XA priority Critical patent/CN116705046A/en
Publication of CN116705046A publication Critical patent/CN116705046A/en
Pending legal-status Critical Current

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0264Noise filtering characterised by the type of parameter measurement, e.g. correlation techniques, zero crossing techniques or predictive techniques
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/27Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
    • G10L25/30Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique using neural networks
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/48Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use
    • G10L25/51Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 specially adapted for particular use for comparison or discrimination
    • HELECTRICITY
    • H04ELECTRIC COMMUNICATION TECHNIQUE
    • H04RLOUDSPEAKERS, MICROPHONES, GRAMOPHONE PICK-UPS OR LIKE ACOUSTIC ELECTROMECHANICAL TRANSDUCERS; DEAF-AID SETS; PUBLIC ADDRESS SYSTEMS
    • H04R1/00Details of transducers, loudspeakers or microphones
    • H04R1/20Arrangements for obtaining desired frequency or directional characteristics
    • H04R1/32Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only
    • H04R1/40Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers
    • H04R1/406Arrangements for obtaining desired frequency or directional characteristics for obtaining desired directional characteristic only by combining a number of identical transducers microphones
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L21/00Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
    • G10L21/02Speech enhancement, e.g. noise reduction or echo cancellation
    • G10L21/0208Noise filtering
    • G10L21/0216Noise filtering characterised by the method used for estimating noise
    • G10L2021/02161Number of inputs available containing the signal or the noise to be suppressed
    • G10L2021/02166Microphone arrays; Beamforming

Landscapes

  • Engineering & Computer Science (AREA)
  • Health & Medical Sciences (AREA)
  • Acoustics & Sound (AREA)
  • Physics & Mathematics (AREA)
  • Signal Processing (AREA)
  • Human Computer Interaction (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Computational Linguistics (AREA)
  • Multimedia (AREA)
  • Quality & Reliability (AREA)
  • Otolaryngology (AREA)
  • Artificial Intelligence (AREA)
  • Evolutionary Computation (AREA)
  • Circuit For Audible Band Transducer (AREA)

Abstract

The application discloses a pickup method, a pickup device, a nonvolatile storage medium and terminal equipment. Wherein the method comprises the following steps: collecting sound signals using a microphone array comprising a plurality of microphones; detecting a target azimuth angle corresponding to the sound signal acquired by the microphone array; and under the condition that the target azimuth angle is in the preset angle interval, performing signal enhancement processing on the sound signal to obtain a target sound signal. The application solves the technical problem that the sound processing result is not ideal because the sound pickup equipment cannot accurately distinguish the sound needing to be enhanced.

Description

拾音方法、装置、非易失性存储介质及终端设备Sound pickup method, device, non-volatile storage medium and terminal equipment

技术领域technical field

本发明涉及声音设备领域,具体而言,涉及一种拾音方法、装置、非易失性存储介质及终端设备。The present invention relates to the field of sound equipment, in particular to a sound pickup method, device, non-volatile storage medium and terminal equipment.

背景技术Background technique

麦克风如何清晰拾音以及如何甄别麦克风采集到的声音是有价值的声音还是噪音及其他类型的需要抑制的声音是业界面临的一个问题。How to pick up sound clearly by the microphone and how to distinguish whether the sound collected by the microphone is valuable sound or noise and other types of sound that need to be suppressed is a problem faced by the industry.

当声音采集设备进行收音时,声音采集设备的麦克风会采集到不同类型、不同对象的声音,然而声音采集设备的使用者可能只需要麦克风采集到自己的声音即可,而将其他声音抑制掉,现有的利用麦克风拾音的方法和系统,在存在噪声和混响的情况下,算法性能下降明显而且语音的失真较大,同时当存在两个及以上说话人同时讲话时,效果也不理想。When the sound collection device collects sound, the microphone of the sound collection device will collect sounds of different types and objects. However, the user of the sound collection device may only need the microphone to collect his own voice, and suppress other sounds. In the existing methods and systems for using microphones to pick up sound, in the presence of noise and reverberation, the performance of the algorithm drops significantly and the distortion of the speech is relatively large. At the same time, when there are two or more speakers speaking at the same time, the effect is not ideal .

针对上述的问题,目前尚未提出有效的解决方案。For the above problems, no effective solution has been proposed yet.

发明内容Contents of the invention

本发明实施例提供了一种拾音方法、装置、非易失性存储介质及终端设备,以至少解决拾音设备无法准确区分需要增强的声音导致声音处理结果不理想的技术问题。Embodiments of the present invention provide a sound pickup method, device, non-volatile storage medium, and terminal equipment to at least solve the technical problem that the sound pickup equipment cannot accurately distinguish the sound that needs to be enhanced, resulting in unsatisfactory sound processing results.

根据本发明实施例的一个方面,提供了一种拾音方法,包括:采用包括多个麦克风的麦克风阵列采集声音信号;检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角;在所述目标方位角位于预设角度区间内的情况下,对所述声音信号进行信号增强处理,得到目标声音信号。According to an aspect of an embodiment of the present invention, there is provided a sound pickup method, including: collecting sound signals by using a microphone array including a plurality of microphones; detecting the corresponding target azimuth angle when the sound signals are collected by the microphone array; When the target azimuth angle is within the preset angle interval, signal enhancement processing is performed on the sound signal to obtain the target sound signal.

可选地,所述检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角,包括:将所述声音信号转换为数字信号;将所述数字信号输入卷积神经网络CNN模型,得到所述数字信号对应的目标方位角,其中,所述CNN模型用于对声音方向进行定位。Optionally, the detection of the corresponding target azimuth angle when the sound signal is collected by the microphone array includes: converting the sound signal into a digital signal; inputting the digital signal into a convolutional neural network (CNN) model, The target azimuth angle corresponding to the digital signal is obtained, wherein the CNN model is used to locate the sound direction.

可选地,所述将所述数字信号输入卷积神经网络CNN模型,得到所述数字信号对应的目标方位角,包括:采用所述CNN模型的特征提取模块,从所述数字信号中提取广义互相关性特征和滤波器组特征;采用所述CNN模型的定位模块,基于所述广义互相关性特征和所述滤波器组特征,确定所述数字信号对应的目标方位角。Optionally, the inputting the digital signal into the convolutional neural network (CNN) model to obtain the target azimuth angle corresponding to the digital signal includes: using the feature extraction module of the CNN model to extract generalized Cross-correlation features and filter bank features; using the positioning module of the CNN model to determine the target azimuth angle corresponding to the digital signal based on the generalized cross-correlation features and the filter bank features.

可选地,所述采用所述CNN模型的定位模块基于所述广义互相关性特征和所述滤波器组特征,确定所述数字信号对应的目标方位角,包括:采用所述CNN模型的所述定位模块,基于所述广义互相关性特征和所述滤波器组特征,确定所述麦克风阵列从任意一个方位角采集所述声音信号的概率值;采用所述定位模块,基于所述概率值,确定所述数字信号对应的目标方位角。Optionally, the positioning module using the CNN model determines the target azimuth angle corresponding to the digital signal based on the generalized cross-correlation feature and the filter bank feature, including: using the CNN model The positioning module, based on the generalized cross-correlation feature and the filter bank feature, determines the probability value that the microphone array collects the sound signal from any azimuth angle; using the positioning module, based on the probability value , to determine the target azimuth angle corresponding to the digital signal.

可选地,所述麦克风阵列中包括的多个麦克风为多个指向性麦克风,多个所述指向性麦克风依次位于正多边形的多个顶点上。Optionally, the multiple microphones included in the microphone array are multiple directional microphones, and the multiple directional microphones are sequentially located on multiple vertices of a regular polygon.

可选地,所述方法还包括:输出所述目标声音信号至目标设备,其中,所述目标设备用于播放或者存储所述目标声音信号。Optionally, the method further includes: outputting the target sound signal to a target device, wherein the target device is used to play or store the target sound signal.

可选地,所述方法还包括:在所述目标方位角位于所述预设角度区间以外的情况下,对所述声音信号进行抑制,得到噪声信号;输出所述噪声信号至所述目标设备。Optionally, the method further includes: when the target azimuth is outside the preset angle range, suppressing the sound signal to obtain a noise signal; outputting the noise signal to the target device .

可选地,在采用包括多个麦克风的麦克风阵列采集声音信号之前,所述方法还包括:展示第一提示信息,其中,所述第一提示信息用于提示所述预设角度区间;在所述目标方位角位于所述预设角度区间内的情况下,所述方法还包括:展示第二提示信息,其中,所述第二提示信息用于提示所述声音信号对应的所述目标方位角位于所述预设角度区间内;在所述目标方位角位于所述预设角度区间以外的情况下,所述方法还包括:展示第三提示信息,其中,所述第三提示信息用于提示所述声音信号对应的所述目标方位角位于所述预设角度区间以外。Optionally, before adopting a microphone array including a plurality of microphones to collect sound signals, the method further includes: presenting first prompt information, wherein the first prompt information is used to prompt the preset angle interval; When the target azimuth is within the preset angle interval, the method further includes: displaying second prompt information, wherein the second prompt information is used to prompt the target azimuth corresponding to the sound signal located within the preset angle interval; when the target azimuth is outside the preset angle interval, the method further includes: displaying third prompt information, wherein the third prompt information is used to prompt The target azimuth angle corresponding to the sound signal is outside the preset angle range.

根据本发明实施例的另一方面,还提供了一种拾音方法,包括:接收声音信号,其中,所述声音信号由包括多个麦克风的麦克风阵列采集得到;检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角;在所述目标方位角位于预设角度区间内的情况下,对所述声音信号进行信号增强处理,得到目标声音信号;输出所述目标声音信号至目标设备,其中,所述目标设备用于播放或者存储所述目标声音信号。According to another aspect of the embodiments of the present invention, there is also provided a sound pickup method, including: receiving a sound signal, wherein the sound signal is collected by a microphone array including a plurality of microphones; detecting that the sound signal is collected by the When the microphone array collects the corresponding target azimuth angle; when the target azimuth angle is within the preset angle interval, perform signal enhancement processing on the sound signal to obtain the target sound signal; output the target sound signal to the target device, wherein the target device is used to play or store the target sound signal.

根据本发明实施例的另一方面,还提供了一种拾音装置,包括:采集模块,用于采用包括多个麦克风的麦克风阵列采集声音信号;第一检测模块,用于检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角;第一增强模块,用于在所述目标方位角位于预设角度区间内的情况下,对所述声音信号进行信号增强处理,得到目标声音信号。According to another aspect of the embodiments of the present invention, there is also provided a sound pickup device, including: a collection module, configured to collect sound signals using a microphone array including a plurality of microphones; a first detection module, used to detect the sound signals The corresponding target azimuth angle when collected by the microphone array; the first enhancement module is used to perform signal enhancement processing on the sound signal to obtain the target sound when the target azimuth angle is within a preset angle interval Signal.

根据本发明实施例的另一方面,还提供了一种拾音装置,包括:接收模块,用于接收声音信号,其中,所述声音信号由包括多个麦克风的麦克风阵列采集得到;第二检测模块,用于检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角;第二增强模块,用于在所述目标方位角位于预设角度区间内的情况下,对所述声音信号进行信号增强处理,得到目标声音信号;输出模块,用于输出所述目标声音信号至目标设备,其中,所述目标设备用于播放或者存储所述目标声音信号。According to another aspect of the embodiments of the present invention, there is also provided a sound pickup device, including: a receiving module for receiving a sound signal, wherein the sound signal is collected by a microphone array including a plurality of microphones; the second detection A module for detecting the corresponding target azimuth angle when the sound signal is collected by the microphone array; a second enhancement module for detecting the sound signal when the target azimuth angle is within a preset angle interval The signal is subjected to signal enhancement processing to obtain a target sound signal; an output module is configured to output the target sound signal to a target device, wherein the target device is used to play or store the target sound signal.

根据本发明实施例的又一方面,还提供了一种非易失性存储介质,所述非易失性存储介质包括存储的程序,其中,在所述程序运行时控制所述非易失性存储介质所在设备执行上述任意一项所述拾音方法。According to still another aspect of the embodiments of the present invention, there is also provided a non-volatile storage medium, the non-volatile storage medium includes a stored program, wherein the non-volatile memory is controlled when the program is running. The device where the storage medium is located executes any one of the sound pickup methods described above.

根据本发明实施例的再一方面,还提供了一种终端设备,所述终端设备包括处理器,所述处理器用于运行程序,其中,所述程序运行时执行上述任意一项所述拾音方法。According to still another aspect of the embodiments of the present invention, there is also provided a terminal device, the terminal device includes a processor, and the processor is used to run a program, wherein, when the program is running, it executes any one of the above audio pickup method.

在本发明实施例中,通过采用包括多个麦克风的麦克风阵列采集声音信号;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号,达到了增强来自预设角度区间内的声音的目的,从而实现了提高拾音设备的拾音效果的技术效果,进而解决了拾音设备无法准确区分需要增强的声音导致声音处理结果不理想的技术问题。In the embodiment of the present invention, the sound signal is collected by using a microphone array including a plurality of microphones; the corresponding target azimuth angle is detected when the sound signal is collected by the microphone array; The sound signal is subjected to signal enhancement processing to obtain the target sound signal, which achieves the purpose of enhancing the sound from the preset angle range, thereby achieving the technical effect of improving the sound pickup effect of the sound pickup device, and further solving the problem that the sound pickup device cannot be accurately distinguished Technical issues that require enhanced sound resulting in suboptimal sound processing results.

附图说明Description of drawings

此处所说明的附图用来提供对本发明的进一步理解,构成本申请的一部分,本发明的示意性实施例及其说明用于解释本发明,并不构成对本发明的不当限定。在附图中:The accompanying drawings described here are used to provide a further understanding of the present invention and constitute a part of the application. The schematic embodiments of the present invention and their descriptions are used to explain the present invention and do not constitute improper limitations to the present invention. In the attached picture:

图1示出了一种用于实现拾音方法的计算机终端的硬件结构框图;Fig. 1 shows a kind of block diagram of the hardware structure of the computer terminal that is used to realize sound pickup method;

图2是根据本发明实施例提供的拾音方法一的流程示意图;Fig. 2 is a schematic flow chart of sound pickup method 1 provided according to an embodiment of the present invention;

图3是根据本发明可选实施方式提供的三角形麦克风阵列的示意图;3 is a schematic diagram of a triangular microphone array provided according to an optional embodiment of the present invention;

图4是根据本发明可选实施方式提供的CNN模型的结构示意图;Fig. 4 is a schematic structural diagram of a CNN model provided according to an optional embodiment of the present invention;

图5是根据本发明可选实施方式提供的终端设备的拾音流程示意图;FIG. 5 is a schematic diagram of a sound pickup process of a terminal device provided according to an optional embodiment of the present invention;

图6是根据本发明实施例提供的拾音方法二的流程示意图;FIG. 6 is a schematic flow diagram of a sound pickup method 2 provided according to an embodiment of the present invention;

图7是根据本发明实施例提供的拾音装置一的结构框图;Fig. 7 is a structural block diagram of a sound pickup device 1 provided according to an embodiment of the present invention;

图8是根据本发明实施例提供的拾音装置二的结构框图。Fig. 8 is a structural block diagram of a second sound pickup device provided according to an embodiment of the present invention.

具体实施方式Detailed ways

为了使本技术领域的人员更好地理解本发明方案,下面将结合本发明实施例中的附图,对本发明实施例中的技术方案进行清楚、完整地描述,显然,所描述的实施例仅仅是本发明一部分的实施例,而不是全部的实施例。基于本发明中的实施例,本领域普通技术人员在没有做出创造性劳动前提下所获得的所有其他实施例,都应当属于本发明保护的范围。In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is an embodiment of a part of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts shall fall within the protection scope of the present invention.

需要说明的是,本发明的说明书和权利要求书及上述附图中的术语“第一”、“第二”等是用于区别类似的对象,而不必用于描述特定的顺序或先后次序。应该理解这样使用的数据在适当情况下可以互换,以便这里描述的本发明的实施例能够以除了在这里图示或描述的那些以外的顺序实施。此外,术语“包括”和“具有”以及他们的任何变形,意图在于覆盖不排他的包含,例如,包含了一系列步骤或单元的过程、方法、系统、产品或设备不必限于清楚地列出的那些步骤或单元,而是可包括没有清楚地列出的或对于这些过程、方法、产品或设备固有的其它步骤或单元。It should be noted that the terms "first" and "second" in the description and claims of the present invention and the above drawings are used to distinguish similar objects, but not necessarily used to describe a specific sequence or sequence. It is to be understood that the data so used are interchangeable under appropriate circumstances such that the embodiments of the invention described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the terms "comprising" and "having", as well as any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or device comprising a sequence of steps or elements is not necessarily limited to the expressly listed instead, may include other steps or elements not explicitly listed or inherent to the process, method, product or apparatus.

根据本发明实施例,提供了一种拾音方法的实施例,需要说明的是,在附图的流程图示出的步骤可以在诸如一组计算机可执行指令的计算机系统中执行,并且,虽然在流程图中示出了逻辑顺序,但是在某些情况下,可以以不同于此处的顺序执行所示出或描述的步骤。According to an embodiment of the present invention, an embodiment of a sound pickup method is provided. It should be noted that the steps shown in the flowcharts of the accompanying drawings can be executed in a computer system such as a set of computer-executable instructions, and, although In the flowcharts, a logical order is shown, but in some cases the steps shown or described may be performed in an order different from that shown or described herein.

本申请实施例一所提供的方法实施例可以在移动终端、计算机终端或者类似的运算装置中执行。图1示出了一种用于实现拾音方法的计算机终端的硬件结构框图。如图1所示,计算机终端10可以包括一个或多个(图中采用102a、102b,……,102n来示出)处理器(处理器可以包括但不限于微处理器MCU或可编程逻辑器件FPGA等的处理装置)、用于存储数据的存储器104。除此以外,还可以包括:显示器、输入/输出接口(I/O接口)、通用串行总线(USB)端口(可以作为BUS总线的端口中的一个端口被包括)、网络接口、电源和/或相机。本领域普通技术人员可以理解,图1所示的结构仅为示意,其并不对上述电子装置的结构造成限定。例如,计算机终端10还可包括比图1中所示更多或者更少的组件,或者具有与图1所示不同的配置。The method embodiment provided in Embodiment 1 of the present application may be executed in a mobile terminal, a computer terminal, or a similar computing device. Fig. 1 shows a block diagram of the hardware structure of a computer terminal for realizing the sound pickup method. As shown in Figure 1, the computer terminal 10 may include one or more (shown by 102a, 102b, ..., 102n in the figure) processors (processors may include but not limited to microprocessors MCU or programmable logic devices Processing device such as FPGA), memory 104 for storing data. In addition, it can also include: a display, an input/output interface (I/O interface), a universal serial bus (USB) port (which can be included as one of the ports of the BUS bus), a network interface, a power supply, and/or or camera. Those of ordinary skill in the art can understand that the structure shown in FIG. 1 is only a schematic diagram, and it does not limit the structure of the above-mentioned electronic device. For example, computer terminal 10 may also include more or fewer components than shown in FIG. 1 , or have a different configuration than that shown in FIG. 1 .

应当注意到的是上述一个或多个处理器和/或其他数据处理电路在本文中通常可以被称为“数据处理电路”。该数据处理电路可以全部或部分的体现为软件、硬件、固件或其他任意组合。此外,数据处理电路可为单个独立的处理模块,或全部或部分的结合到计算机终端10中的其他元件中的任意一个内。如本申请实施例中所涉及到的,该数据处理电路作为一种处理器控制(例如与接口连接的可变电阻终端路径的选择)。It should be noted that the one or more processors and/or other data processing circuits described above may generally be referred to herein as "data processing circuits". The data processing circuit may be implemented in whole or in part as software, hardware, firmware or other arbitrary combinations. In addition, the data processing circuit can be a single independent processing module, or be fully or partially integrated into any of the other components in the computer terminal 10 . As mentioned in the embodiment of the present application, the data processing circuit is used as a processor control (for example, the selection of the terminal path of the variable resistor connected to the interface).

存储器104可用于存储应用软件的软件程序以及模块,如本发明实施例中的拾音方法对应的程序指令/数据存储装置,处理器通过运行存储在存储器104内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的应用程序的拾音方法。存储器104可包括高速随机存储器,还可包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器104可进一步包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至计算机终端10。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。The memory 104 can be used to store software programs and modules of application software, such as the program instruction/data storage device corresponding to the sound pickup method in the embodiment of the present invention, and the processor executes each program by running the software programs and modules stored in the memory 104. A functional application and data processing, that is, to realize the sound pickup method of the above-mentioned application program. The memory 104 may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some instances, the memory 104 may further include a memory that is remotely located relative to the processor, and these remote memories may be connected to the computer terminal 10 through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.

显示器可以例如触摸屏式的液晶显示器(LCD),该液晶显示器可使得用户能够与计算机终端10的用户界面进行交互。The display can be, for example, a touch-screen liquid crystal display (LCD), which enables a user to interact with the user interface of the computer terminal 10 .

图2是根据本发明实施例提供的拾音方法一的流程示意图,如图2所示,该方法包括如下步骤:Fig. 2 is a schematic flow chart of a sound pickup method 1 provided according to an embodiment of the present invention. As shown in Fig. 2 , the method includes the following steps:

步骤S202,采用包括多个麦克风的麦克风阵列采集声音信号。需要说明的是,包括多个麦克风的麦克风阵列可以位于声音采集现场的声音采集设备中,声音采集设备可以为供用户使用的手持式话筒,或者固定在桌面上的会议话筒。Step S202, using a microphone array including a plurality of microphones to collect sound signals. It should be noted that the microphone array including multiple microphones can be located in the sound collection device at the sound collection site, and the sound collection device can be a hand-held microphone for users, or a conference microphone fixed on a table.

一种可能的应用场景中,包括麦克风阵列的声音采集设备可以位于会议室中,设备的使用者希望主要收集话筒面前的使用者的声音,而忽视或者抑制会议室中的其他声音。但是麦克风阵列采集到的声音信号可能是来自话筒面前的使用者,也可能是来自会议室中的其他方向,因此需要对此进行甄别。In a possible application scenario, the sound collection device including the microphone array may be located in a conference room, and the user of the device wishes to mainly collect the voice of the user in front of the microphone, while ignoring or suppressing other sounds in the conference room. However, the sound signal collected by the microphone array may come from the user in front of the microphone or from other directions in the meeting room, so it needs to be screened.

步骤S204,检测声音信号被麦克风阵列采集到时对应的目标方位角。本步骤中,目标方位角即指示了声音信号传来的方向,即声音信号沿目标方位角传到麦克风阵列并被麦克风阵列采集到。可选地,表示目标方位角可以以麦克风阵列位于的声音采集设备为中心,以水平面为坐标系所在的平面,建立平面直角坐标系或者极坐标系,进而将目标方位角表示在上述的坐标系中。可选地,目标方位角的取值范围可以在0°至360°之间变化。Step S204, detecting the corresponding target azimuth angle when the sound signal is collected by the microphone array. In this step, the target azimuth indicates the direction from which the sound signal is transmitted, that is, the sound signal is transmitted to the microphone array along the target azimuth and collected by the microphone array. Optionally, the target azimuth can be expressed with the sound collection device where the microphone array is located as the center, and the horizontal plane as the plane where the coordinate system is located, to establish a plane Cartesian or polar coordinate system, and then express the target azimuth in the above coordinate system middle. Optionally, the value range of the target azimuth angle may vary from 0° to 360°.

步骤S206,在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号。Step S206, when the target azimuth angle is within the preset angle interval, perform signal enhancement processing on the sound signal to obtain the target sound signal.

麦克风阵列可以实现全角度范围的声音拾取,本步骤可以进一步实现对麦克风阵列采集到的声音信号进行甄别,以确定目标方位角对应声音信号是否为用户希望采集并使用的声音。可选地,当声音信号位于预设角度区间内时,可以认为该声音即为需要采集的声音,此时对该声音信号进行增强处理,以最大化地提升声音效果。The microphone array can realize sound pickup in a full range of angles. This step can further realize the screening of the sound signals collected by the microphone array to determine whether the sound signal corresponding to the target azimuth angle is the sound that the user wants to collect and use. Optionally, when the sound signal is within the preset angle interval, the sound can be considered as the sound to be collected, and at this time, the sound signal is enhanced to maximize the sound effect.

通过上述步骤,通过采用包括多个麦克风的麦克风阵列采集声音信号;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号,达到了增强来自预设角度区间内的声音的目的,从而实现了提高拾音设备的拾音效果的技术效果,进而解决了拾音设备无法准确区分需要增强的声音导致声音处理结果不理想的技术问题。Through the above steps, the sound signal is collected by using a microphone array comprising a plurality of microphones; the corresponding target azimuth angle is detected when the sound signal is collected by the microphone array; when the target azimuth angle is within a preset angle interval, the sound signal is performed Signal enhancement processing, to obtain the target sound signal, to achieve the purpose of enhancing the sound from the preset angle range, thereby achieving the technical effect of improving the sound pickup effect of the sound pickup device, and then solving the problem that the sound pickup device cannot accurately distinguish the sound that needs to be enhanced Sound Technical issues that cause unsatisfactory sound processing results.

上述方法可以适用于如下场景中,需要说明的是,如下场景仅为说明性地举例,本实施例述及的拾音方法还可以应用于其他应用场景中。The foregoing method may be applicable to the following scenarios. It should be noted that the following scenarios are only illustrative examples, and the sound pickup method mentioned in this embodiment may also be applied to other application scenarios.

在示例性的应用场景中,声音采集设备可以是会议室中话筒,设备中包括麦克风阵列,用户希望话筒重点采集话筒面前的发言人的声音,而避免采集到会议室内其他人或者物的声音。此时,话筒可以将预设角度区间设置为发言人所在的角度区间,然后采集传入麦克风阵列中的声音信号,根据声音信号判断该声音是否来自于预设角度区间。在该声音对应的目标方位角在预设角度区间内时,可以认为该声音即为发言人发出的声音,因此对该声音信号进行增强,得到目标声音信号。In an exemplary application scenario, the sound collection device may be a microphone in a conference room, and the device includes a microphone array. The user hopes that the microphone will focus on collecting the voice of the speaker in front of the microphone, and avoid collecting the voices of other people or objects in the conference room. At this time, the microphone can set the preset angle interval as the angle interval where the speaker is located, and then collect the sound signal transmitted into the microphone array, and judge whether the sound comes from the preset angle interval according to the sound signal. When the target azimuth angle corresponding to the sound is within the preset angle interval, the sound can be considered to be the sound of the speaker, so the sound signal is enhanced to obtain the target sound signal.

作为一种可选的实施例,在目标方位角位于预设角度区间以外的情况下,还可以选择对声音信号进行抑制,得到噪声信号,即将会议室中来自其他方向的声音进行抑制或者屏蔽,以保证最后采集并输出给扬声器的声音信号为主要来自发言人的声音。通过本可选的实施例,可以实现对需要保留的声音的甄选,结合上述增强声音信号以得到目标声音信号的实施例,可以实现增强发言人的声音并抑制其他方向上的声音,将位于预定角度区间内的发言人的声音突出出来,提高对其声音的呈现效果。As an optional embodiment, when the target azimuth is outside the preset angle range, you can also choose to suppress the sound signal to obtain a noise signal, that is, to suppress or shield the sound from other directions in the meeting room, To ensure that the final sound signal collected and output to the loudspeaker is mainly the sound from the speaker. Through this optional embodiment, the selection of the sound that needs to be retained can be realized. Combining with the above-mentioned embodiment of enhancing the sound signal to obtain the target sound signal, it can be realized that the voice of the speaker can be enhanced and the sound in other directions can be suppressed. The voice of the speaker in the angular range is highlighted, improving the rendering effect of his voice.

可选地,在对声音信号进行增强或者抑制之后,还可以将目标声音信号或者噪声信号输出至用于播放或者存储目标声音信号目标设备。其中,目标设备可以是现场的扬声器,还可以是远端的服务器或者远端的其他终端设备。Optionally, after the sound signal is enhanced or suppressed, the target sound signal or the noise signal may also be output to a target device for playing or storing the target sound signal. Wherein, the target device may be a live speaker, or a remote server or other remote terminal devices.

例如,在目标设备包括扬声器,且会议室现场的声音采集设备、扬声器集成在一个会议终端设备上的情况下,会议终端设备中包括处理器,处理器可以从声音采集设备中获取麦克风阵列采集的声音信号,然后声音信号进行数据处理,计算得到声音信号对应的目标方位角,然后与该会议终端设备预设的角度区间进行对比,判断目标方位角是否位于预设角度区间内;在该声音位于预设角度区间内的情况下,处理器增强该声音信号,得到目标声音信号,并将目标声音信号输出至扬声器;在该声音位于预设角度区间以外的情况下,处理器抑制该声音信号得到噪声信号,然后同样输出至扬声器;最后可以由扬声器同时播放增强后的目标声音信号以及抑制后的噪声信号,实现对来自预设角度区间的声音的突出,提高其声音呈现效果。For example, when the target device includes a loudspeaker, and the on-site sound collection device in the meeting room and the loudspeaker are integrated on a conference terminal device, the conference terminal device includes a processor, and the processor can obtain the audio data collected by the microphone array from the sound collection device. sound signal, and then perform data processing on the sound signal to calculate the target azimuth angle corresponding to the sound signal, and then compare it with the preset angle range of the conference terminal device to determine whether the target azimuth angle is within the preset angle range; In the case of the preset angle interval, the processor enhances the sound signal to obtain the target sound signal, and outputs the target sound signal to the speaker; when the sound is outside the preset angle interval, the processor suppresses the sound signal to obtain The noise signal is then also output to the speaker; finally, the enhanced target sound signal and the suppressed noise signal can be played simultaneously by the speaker to highlight the sound from the preset angle range and improve its sound presentation effect.

又例如,目标设备可以是远端的服务器,声音采集设备收集到声音信号后,可以将该声音信号交由本地的处理器处理,得到增强或者抑制的信号,然后以视频流或者音频流的形式发送给远端的服务器进行转发或者保存;或者声音采集设备直接将声音信号上传服务器或者发送给远端的其他终端设备,由其他终端设备对声音信号进行处理和保存。For another example, the target device can be a remote server. After the sound collection device collects the sound signal, the sound signal can be processed by a local processor to obtain an enhanced or suppressed signal, and then the signal can be transmitted in the form of a video stream or an audio stream. Send to the remote server for forwarding or saving; or the sound collection device directly uploads the sound signal to the server or sends it to other remote terminal devices, and the other terminal devices process and save the sound signal.

作为一种可选的实施例,在采用包括多个麦克风的麦克风阵列采集声音信号之前,还可以包括如下步骤:展示第一提示信息,其中,第一提示信息用于提示预设角度区间;在目标方位角位于预设角度区间内的情况下,方法还包括:展示第二提示信息,其中,第二提示信息用于提示声音信号对应的目标方位角位于预设角度区间内;在目标方位角位于预设角度区间以外的情况下,方法还包括:展示第三提示信息,其中,第三提示信息用于提示声音信号对应的目标方位角位于预设角度区间以外。As an optional embodiment, before adopting a microphone array including a plurality of microphones to collect sound signals, the following steps may also be included: displaying first prompt information, wherein the first prompt information is used to prompt a preset angle interval; When the target azimuth is within the preset angle interval, the method further includes: displaying second prompt information, wherein the second prompt information is used to prompt that the target azimuth corresponding to the sound signal is within the preset angle interval; If it is outside the preset angle interval, the method further includes: displaying third prompt information, wherein the third prompt information is used to prompt that the target azimuth angle corresponding to the sound signal is outside the preset angle interval.

本可选的实施例中,可以在声音采集设备上设置一个用于展示提示信息的装置,例如设置一个环形提示灯,声音采集设备可以控制提示灯只亮环上的一部分区域,该部分区域可以对应预设角度区间,该种情况下可以认为设备在展示第一提示信息,用于指示终端设备增强声音的角度范围,提示发言人沿该角度对准声音采集设备进行发言。进一步地,还可以在声音采集设备上设置其他指示灯来在发言人进行试音的时候提示其是否在预设角度区间的范围内对准了声音采集设备,例如,当指示灯闪烁绿色信号时,其在展示第二提示信息,当指示灯闪烁红色信号时,其在展示第三提示信息。In this optional embodiment, a device for displaying prompt information can be set on the sound collection device, such as a ring-shaped prompt light, and the sound collection device can control the prompt light to only light up a part of the area on the ring, which can be Corresponding to the preset angle range, in this case, it can be considered that the device is displaying the first prompt information, which is used to instruct the terminal device to enhance the angle range of the sound, and remind the speaker to speak at the sound collection device along this angle. Further, other indicator lights can also be set on the sound collection device to remind the speaker whether it is aimed at the sound collection device within the range of the preset angle interval when the speaker is performing a sound test, for example, when the indicator light flashes a green signal , it is displaying the second prompt information, and when the indicator light flashes red signal, it is displaying the third prompt information.

作为一种可选的实施例,可以将声音采集设备上的环形提示灯设置为可响应用户的手势交互,根据手势交互动作改变预设角度区间,将声音采集设备对应的预设角度区间修改为目标角度区间。本可选实施例提供的方法可以提高声音采集设备的功能灵活性,在用户发现自己的声音没有被加强时,通过触摸环形提示灯的区域,将环形提示灯上点亮的对应预设角度区间的位置通过拖拽手势移动到对应自己方位的位置,声音采集设备可以根据该变化,将预设角度区间调整为目标角度区间,然后让整个拾音设备增强来自目标角度区间的声音,而抑制来自其他角度区间的声音。As an optional embodiment, the ring-shaped prompt light on the sound collection device can be set to respond to the user's gesture interaction, change the preset angle interval according to the gesture interaction action, and modify the preset angle interval corresponding to the sound collection device to Target angle range. The method provided in this optional embodiment can improve the functional flexibility of the sound collection device. When the user finds that his voice has not been enhanced, the user can touch the area of the ring warning light to lighten the corresponding preset angle range on the ring warning light. The position of the sound pickup device can be moved to the position corresponding to its position by dragging gestures. According to the change, the sound collection device can adjust the preset angle range to the target angle range, and then let the entire sound pickup device enhance the sound from the target angle range, while suppressing the sound from the target angle range. Sounds from other angles.

作为一种可选的实施例,麦克风阵列中包括的多个麦克风可以为多个指向性麦克风,多个指向性麦克风依次位于正多边形的多个顶点上。指向性麦克风即为对不同方位传来的声音的敏感性不同的麦克风,例如,可以采用心形指向性麦克风,心形指向性麦克风只拾取来自其正前方的声音,而尽可能忽略来自其侧方和后方的声音。进一步地,可以将多个指向性麦克风设置在正多边形的多个顶点上,并依次错开其拾音角度范围,让每个指向性麦克风的拾音范围互不重叠并互相补充,实现对360°范围的全方位拾音,采用该种设置方式,可以提高整体麦克风阵列的拾音效果,拾音的效果更清晰。图3是根据本发明可选实施方式提供的三角形麦克风阵列的示意图,如图3所示,在麦克风阵列中包括三个心形指向性麦克风的情况下,可以将三个心形指向性麦克风设置在等边三角形的三个顶点上,并使其正方向依次成120°角度差指向不同的方位,其中每个心形指向性麦克风的拾音范围为以正前方为轴正负偏差60°,每个心形指向性麦克风拾音范围总计120°,三个心形指向性麦克风实现360°的全方位拾音。本领域技术人员可以理解的,在将多个心形指向性麦克风构成的形状设置为其他正多边形时,可以增加心形指向性麦克风的数量并调整参数改变每个心形指向性麦克风的拾音角度范围,避免拾音范围重叠。As an optional embodiment, the multiple microphones included in the microphone array may be multiple directional microphones, and the multiple directional microphones are sequentially located on multiple vertices of a regular polygon. A directional microphone is a microphone with different sensitivities to sounds coming from different directions. For example, a cardioid directional microphone can be used. Side and rear sounds. Further, multiple directional microphones can be arranged on multiple vertices of the regular polygon, and their pickup angle ranges are staggered in turn, so that the pickup ranges of each directional microphone do not overlap and complement each other, realizing 360° The range of omnidirectional sound pickup, using this setting method, can improve the sound pickup effect of the overall microphone array, and the sound pickup effect is clearer. 3 is a schematic diagram of a triangular microphone array provided according to an optional embodiment of the present invention. As shown in FIG. On the three vertices of an equilateral triangle, and make its positive direction point to different directions with an angle difference of 120° in turn, the pickup range of each cardioid directional microphone is a positive and negative deviation of 60° with the front as the axis, Each cardioid microphone picks up a total of 120°, and three cardioid microphones achieve 360° omnidirectional pickup. Those skilled in the art can understand that when the shape formed by a plurality of cardioid directional microphones is set to other regular polygons, the number of cardioid directional microphones can be increased and the parameters can be adjusted to change the sound pickup of each cardioid directional microphone. Angle range to avoid overlap of pickup range.

作为一种可选的实施例,检测声音信号被麦克风阵列采集到时对应的目标方位角,可以包括如下步骤:将声音信号转换为数字信号;将数字信号输入卷积神经网络CNN模型,得到数字信号对应的目标方位角,其中,CNN模型用于对声音方向进行定位。本实施例中,麦克风阵列采集到声音信号后,可以经过ADC模数转换器将模拟信号转换为多通道数字信号,作为CNN模型的输入。As an optional embodiment, detecting the corresponding target azimuth angle when the sound signal is collected by the microphone array may include the following steps: converting the sound signal into a digital signal; inputting the digital signal into the convolutional neural network CNN model to obtain a digital The target azimuth angle corresponding to the signal, where the CNN model is used to locate the direction of the sound. In this embodiment, after the sound signal is collected by the microphone array, the analog signal can be converted into a multi-channel digital signal through an ADC analog-to-digital converter, which can be used as the input of the CNN model.

现有常规的声源定位方法仅支持单声源,且在有噪声和混响时定位不准确。为解决这个问题,在本可选的实施例中采用了基于卷积神经网络CNN模型的多声源检测与定位算法,基于CNN模型进行多声源的检测和定位,可以支持两个及以上声源的定位,提升了存在噪声和混响的情况下的声源定位的性能。其中,CNN模型可以采用基于CNN的四层卷积的模型结构,进一步地,还可以根据产品特性或者应用场景,对CNN模型的层数、结构或者参数进行一定调整,例如可以增加卷积层数,采用五层卷积的模型结构。The existing conventional sound source localization method only supports a single sound source, and the localization is inaccurate when there is noise and reverberation. In order to solve this problem, in this optional embodiment, a multi-sound source detection and localization algorithm based on the convolutional neural network (CNN) model is adopted, and multi-sound source detection and localization are performed based on the CNN model, which can support two or more sound sources. Source localization improves the performance of sound source localization in the presence of noise and reverberation. Among them, the CNN model can adopt a model structure based on CNN's four-layer convolution. Further, according to product characteristics or application scenarios, certain adjustments can be made to the number of layers, structure or parameters of the CNN model, for example, the number of convolution layers can be increased. , using a five-layer convolutional model structure.

作为一种可选的实施例,将数字信号输入卷积神经网络CNN模型,得到数字信号对应的目标方位角,可以采用如下方式:采用CNN模型的特征提取模块,从数字信号中提取广义互相关性特征和滤波器组特征;采用CNN模型的定位模块,基于广义互相关性特征和滤波器组特征,确定数字信号对应的目标方位角。其中,广义互相关性特征即GeneralizedCross Correlation,简称GCC;滤波器组特征即filter bank,简称FB,反映了信号的幅频特性,即不同频带对应的幅值。需要说明的是,输入CNN模型的数字信号至少来自于麦克风阵列中的两个麦克风,即麦克风阵列中的至少两个麦克风分别采集到声音信号后,将声音信号转换为数字信号并输入CNN模型,CNN模型基于此提取数字信号中的广义互相关性特征以及滤波器组特征,并将特征输入定位模块,确定对应的目标方位角。可选地,定位模块可以采用多层卷积结构。As an optional embodiment, the digital signal is input into the CNN model of the convolutional neural network to obtain the target azimuth angle corresponding to the digital signal, and the following method can be adopted: the feature extraction module of the CNN model is used to extract the generalized cross-correlation from the digital signal characteristics and filter bank features; the positioning module of the CNN model is used to determine the target azimuth angle corresponding to the digital signal based on the generalized cross-correlation features and filter bank features. Among them, the generalized cross correlation feature is Generalized Cross Correlation, referred to as GCC; the filter bank feature is filter bank, referred to as FB, which reflects the amplitude-frequency characteristics of the signal, that is, the amplitude corresponding to different frequency bands. It should be noted that the digital signal input to the CNN model comes from at least two microphones in the microphone array, that is, after at least two microphones in the microphone array respectively collect the sound signal, the sound signal is converted into a digital signal and input to the CNN model, Based on this, the CNN model extracts the generalized cross-correlation features and filter bank features in the digital signal, and inputs the features into the positioning module to determine the corresponding target azimuth. Optionally, the localization module may adopt a multi-layer convolutional structure.

作为一种可选的实施例,采用CNN模型的定位模块基于广义互相关性特征和滤波器组特征,确定数字信号对应的目标方位角,可以采用CNN模型的定位模块,基于广义互相关性特征和滤波器组特征,确定麦克风阵列从任意一个方位角采集声音信号的概率值;采用定位模块,基于概率值,确定数字信号对应的目标方位角。As an optional embodiment, the positioning module of the CNN model is used to determine the target azimuth angle corresponding to the digital signal based on the generalized cross-correlation feature and the filter bank feature. and filter bank features to determine the probability value of the sound signal collected by the microphone array from any azimuth angle; the positioning module is used to determine the target azimuth angle corresponding to the digital signal based on the probability value.

图4是根据本发明可选实施方式提供的CNN模型的结构示意图,如图4所示,CNN模型首先从数字信号中提取GCC和FB,其中GCCFB的特征维度为51×40×6维;进一步地,将GCCFB依次输入五层卷积层,然后输出该数字信号的方位角度为0°至360°范围内的可能性,其中DOA表示采用最大似然法进行概率估计。Fig. 4 is a schematic structural diagram of the CNN model provided according to an optional embodiment of the present invention. As shown in Fig. 4, the CNN model first extracts GCC and FB from the digital signal, wherein the feature dimension of GCCFB is 51 * 40 * 6 dimensions; further Specifically, the GCCFB is sequentially input into five convolutional layers, and then outputs the possibility that the azimuth angle of the digital signal is in the range of 0° to 360°, where DOA means that the maximum likelihood method is used for probability estimation.

图5是根据本发明可选实施方式提供的终端设备的拾音流程示意图,具体的,终端设备的拾音流程可以包括如下步骤:FIG. 5 is a schematic diagram of a sound pickup process of a terminal device according to an optional embodiment of the present invention. Specifically, the sound pickup process of a terminal device may include the following steps:

S51,配置终端设备的预设角度区间,其中终端设备可以包括声音采集设备、处理器、扬声器。S51. Configure a preset angle interval of the terminal device, where the terminal device may include a sound collection device, a processor, and a speaker.

S52,声音采集设备中的包括多个麦克风的麦克风阵列采集到声音信号并输入到处理器中,处理器将声音信号处理为数字信号。S52. The microphone array including multiple microphones in the sound collection device collects the sound signal and inputs it to the processor, and the processor processes the sound signal into a digital signal.

S53,处理器的CNN模型提取数字信号的特征并输入卷积层,对声音信号的来源方位进行检测和定位。S53, the CNN model of the processor extracts the feature of the digital signal and inputs it into the convolution layer to detect and locate the source direction of the sound signal.

S54,判断声音信号对应的方位角是否在预设角度区间以内,若是,则前往步骤S55,若不在区间范围内,则前往步骤S56。S54, judging whether the azimuth angle corresponding to the sound signal is within the preset angle range, if yes, proceed to step S55, and if not within the range, proceed to step S56.

S55,对数字信号进行加强,得到目标声音信号。S55. Strengthen the digital signal to obtain the target sound signal.

S56,对数字信号进行抑制,得到噪声信号。S56, suppressing the digital signal to obtain a noise signal.

S57,输出处理后的音频信号至目标设备,其中,目标设备可以是终端设备中的扬声器,也可以是与终端设备连接的远端服务器,或者其他用于播放或者存储音频数据的远程终端设备。S57. Output the processed audio signal to the target device, where the target device may be a speaker in the terminal device, or a remote server connected to the terminal device, or other remote terminal devices for playing or storing audio data.

图6是根据本发明实施例提供的拾音方法二的流程示意图,如图6所示,该方法包括如下步骤:Fig. 6 is a schematic flow chart of the sound pickup method 2 provided according to an embodiment of the present invention. As shown in Fig. 6, the method includes the following steps:

步骤S602,接收声音信号,其中,所述声音信号由包括多个麦克风的麦克风阵列采集得到。Step S602, receiving a sound signal, wherein the sound signal is collected by a microphone array including a plurality of microphones.

步骤S604,检测所述声音信号被所述麦克风阵列采集到时对应的目标方位角。Step S604, detecting the corresponding target azimuth angle when the sound signal is collected by the microphone array.

步骤S606,在所述目标方位角位于预设角度区间内的情况下,对所述声音信号进行信号增强处理,得到目标声音信号。Step S606, if the target azimuth angle is within the preset angle range, perform signal enhancement processing on the sound signal to obtain the target sound signal.

步骤S608,输出所述目标声音信号至目标设备,其中,所述目标设备用于播放或者存储所述目标声音信号。Step S608, outputting the target sound signal to a target device, wherein the target device is used to play or store the target sound signal.

通过上述步骤,达到了增强来自预设角度区间内的声音的目的,从而实现了提高拾音设备的拾音效果的技术效果,进而解决了拾音设备无法准确区分需要增强的声音导致声音处理结果不理想的技术问题。Through the above steps, the purpose of enhancing the sound from the preset angle range is achieved, thereby achieving the technical effect of improving the sound pickup effect of the sound pickup device, and further solving the sound processing results caused by the sound pickup device being unable to accurately distinguish the sound that needs to be enhanced Not ideal technical issues.

根据本发明实施例,还提供了一种用于实施上述拾音方法一的拾音装置一,图7是根据本发明实施例提供的拾音装置一的结构框图,如图7所示,该拾音装置一70包括:采集模块72,第一检测模块74和第一增强模块76,下面对该拾音装置一进行说明。According to an embodiment of the present invention, a sound pickup device 1 for implementing the above sound pickup method 1 is also provided. FIG. 7 is a structural block diagram of a sound pickup device 1 provided according to an embodiment of the present invention. As shown in FIG. 7 , the The first sound pickup device 70 includes: an acquisition module 72 , a first detection module 74 and a first enhancement module 76 , and the first sound pickup device will be described below.

采集模块72,用于采用包括多个麦克风的麦克风阵列采集声音信号;Acquisition module 72, for adopting the microphone array that comprises a plurality of microphones to collect sound signal;

第一检测模块74,连接于上述采集模块72,用于检测声音信号被麦克风阵列采集到时对应的目标方位角;The first detection module 74 is connected to the above-mentioned acquisition module 72, and is used to detect the corresponding target azimuth angle when the sound signal is collected by the microphone array;

第一增强模块76,连接于上述第一检测模块74,用于在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号。The first enhancement module 76 is connected to the above-mentioned first detection module 74, and is used for performing signal enhancement processing on the sound signal to obtain the target sound signal when the target azimuth angle is within the preset angle interval.

此处需要说明的是,上述采集模块72,第一检测模块74和第一增强模块76对应于实施例中的步骤S202至步骤S206,三个模块与对应的步骤所实现的实例和应用场景相同,但不限于上述实施例所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例提供的计算机终端10中。It should be noted here that the acquisition module 72, the first detection module 74 and the first enhancement module 76 correspond to steps S202 to S206 in the embodiment, and the examples and application scenarios implemented by the three modules are the same as those of the corresponding steps , but not limited to the content disclosed in the above embodiments. It should be noted that, as a part of the device, the above modules can run in the computer terminal 10 provided in the embodiment.

根据本发明实施例,还提供了一种用于实施上述拾音方法二的拾音装置二,图8是根据本发明实施例提供的拾音装置二的结构框图,如图8所示,该拾音装置二80包括:接收模块82,第二检测模块84,第二增强模块86和输出模块88,下面对该拾音装置二80进行说明。According to an embodiment of the present invention, there is also provided a sound pickup device 2 for implementing the above sound pickup method 2. FIG. 8 is a structural block diagram of a sound pickup device 2 provided according to an embodiment of the present invention. As shown in FIG. 8 , the The second sound pickup device 80 includes: a receiving module 82 , a second detection module 84 , a second enhancement module 86 and an output module 88 , and the second sound pickup device 80 will be described below.

接收模块82,用于接收声音信号,其中,声音信号由包括多个麦克风的麦克风阵列采集得到;The receiving module 82 is configured to receive a sound signal, wherein the sound signal is collected by a microphone array comprising a plurality of microphones;

第二检测模块84,连接于上述接收模块82,用于检测声音信号被麦克风阵列采集到时对应的目标方位角;The second detection module 84 is connected to the above-mentioned receiving module 82, and is used to detect the corresponding target azimuth angle when the sound signal is collected by the microphone array;

第二增强模块86,连接于上述第二检测模块84,用于在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号;The second enhancement module 86 is connected to the above-mentioned second detection module 84, and is used to perform signal enhancement processing on the sound signal to obtain the target sound signal when the target azimuth angle is within the preset angle interval;

输出模块88,连接于上述第二增强模块86,用于输出目标声音信号至目标设备,其中,目标设备用于播放或者存储目标声音信号。The output module 88 is connected to the above-mentioned second enhancement module 86, and is used to output the target sound signal to the target device, wherein the target device is used to play or store the target sound signal.

此处需要说明的是,上述接收模块82,第二检测模块84,第二增强模块86和输出模块88对应于实施例中的步骤S602至步骤S608,多个模块与对应的步骤所实现的实例和应用场景相同,但不限于上述实施例所公开的内容。需要说明的是,上述模块作为装置的一部分可以运行在实施例提供的计算机终端10中。It should be noted here that the above receiving module 82, the second detection module 84, the second enhancement module 86 and the output module 88 correspond to steps S602 to S608 in the embodiment, and the examples realized by multiple modules and corresponding steps It is the same as the application scenario, but not limited to the content disclosed in the above embodiments. It should be noted that, as a part of the device, the above modules can run in the computer terminal 10 provided in the embodiment.

本发明的实施例可以提供一种终端设备,可选地,在本实施例中,上述终端设备可以位于计算机网络的多个网络设备中的至少一个网络设备。该终端设备包括存储器和处理器。An embodiment of the present invention may provide a terminal device. Optionally, in this embodiment, the terminal device may be located in at least one network device among multiple network devices in a computer network. The terminal device includes a memory and a processor.

其中,存储器可用于存储软件程序以及模块,如本发明实施例中的拾音方法和装置对应的程序指令/模块,处理器通过运行存储在存储器内的软件程序以及模块,从而执行各种功能应用以及数据处理,即实现上述的拾音方法。存储器可包括高速随机存储器,还可以包括非易失性存储器,如一个或者多个磁性存储装置、闪存、或者其他非易失性固态存储器。在一些实例中,存储器可进一步包括相对于处理器远程设置的存储器,这些远程存储器可以通过网络连接至计算机终端。上述网络的实例包括但不限于互联网、企业内部网、局域网、移动通信网及其组合。Wherein, the memory can be used to store software programs and modules, such as the program instructions/modules corresponding to the sound pickup method and device in the embodiment of the present invention, and the processor executes various functional applications by running the software programs and modules stored in the memory. And data processing, promptly realize above-mentioned pickup method. The memory may include high-speed random access memory, and may also include non-volatile memory, such as one or more magnetic storage devices, flash memory, or other non-volatile solid-state memory. In some instances, the memory may further include a memory located remotely from the processor, and these remote memories may be connected to the computer terminal through a network. Examples of the aforementioned networks include, but are not limited to, the Internet, intranets, local area networks, mobile communication networks, and combinations thereof.

处理器可以通过传输装置调用存储器存储的信息及应用程序,以执行下述步骤:采用包括多个麦克风的麦克风阵列采集声音信号;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号。The processor can call the information stored in the memory and the application program through the transmission device to perform the following steps: adopt a microphone array including a plurality of microphones to collect sound signals; detect the corresponding target azimuth angle when the sound signal is collected by the microphone array; When the azimuth is within the preset angle range, signal enhancement processing is performed on the sound signal to obtain the target sound signal.

处理器还可以通过传输装置调用存储器存储的信息及应用程序,以执行下述步骤:接收声音信号,其中,声音信号由包括多个麦克风的麦克风阵列采集得到;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号;输出目标声音信号至目标设备,其中,目标设备用于播放或者存储目标声音信号。The processor can also call the information stored in the memory and the application program through the transmission device to perform the following steps: receive the sound signal, wherein the sound signal is collected by a microphone array including a plurality of microphones; detect when the sound signal is collected by the microphone array Corresponding target azimuth; when the target azimuth is within the preset angle range, perform signal enhancement processing on the sound signal to obtain the target sound signal; output the target sound signal to the target device, wherein the target device is used for playback or storage Target sound signal.

本领域普通技术人员可以理解上述实施例的各种方法中的全部或部分步骤是可以通过程序来指令终端设备相关的硬件来完成,该程序可以存储于一非易失性存储介质中,存储介质可以包括:闪存盘、只读存储器(Read-Only Memory,ROM)、随机存取器(RandomAccess Memory,RAM)、磁盘或光盘等。Those of ordinary skill in the art can understand that all or part of the steps in the various methods of the above embodiments can be completed by instructing the hardware related to the terminal device through a program, and the program can be stored in a non-volatile storage medium, the storage medium It may include: a flash disk, a read-only memory (Read-Only Memory, ROM), a random access device (Random Access Memory, RAM), a magnetic disk or an optical disk, and the like.

本发明的实施例还提供了一种非易失性存储介质。可选地,在本实施例中,上述非易失性存储介质可以用于保存上述实施例所提供的拾音方法所执行的程序代码。The embodiment of the present invention also provides a non-volatile storage medium. Optionally, in this embodiment, the above-mentioned non-volatile storage medium may be used to store program codes executed by the sound pickup method provided in the above-mentioned embodiment.

可选地,在本实施例中,上述非易失性存储介质可以位于计算机网络中计算机终端群中的任意一个计算机终端中,或者位于移动终端群中的任意一个移动终端中。Optionally, in this embodiment, the above-mentioned non-volatile storage medium may be located in any computer terminal in the group of computer terminals in the computer network, or in any mobile terminal in the group of mobile terminals.

可选地,在本实施例中,非易失性存储介质被设置为存储用于执行以下步骤的程序代码:采用包括多个麦克风的麦克风阵列采集声音信号;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号。Optionally, in this embodiment, the non-volatile storage medium is configured to store program codes for performing the following steps: collecting sound signals by using a microphone array including a plurality of microphones; detecting when the sound signal is collected by the microphone array Corresponding target azimuth; when the target azimuth is within the preset angle range, perform signal enhancement processing on the sound signal to obtain the target sound signal.

可选地,在本实施例中,非易失性存储介质被设置为存储用于执行以下步骤的程序代码:接收声音信号,其中,声音信号由包括多个麦克风的麦克风阵列采集得到;检测声音信号被麦克风阵列采集到时对应的目标方位角;在目标方位角位于预设角度区间内的情况下,对声音信号进行信号增强处理,得到目标声音信号;输出目标声音信号至目标设备,其中,目标设备用于播放或者存储目标声音信号。Optionally, in this embodiment, the non-volatile storage medium is configured to store program codes for performing the following steps: receiving a sound signal, wherein the sound signal is collected by a microphone array including a plurality of microphones; detecting the sound The corresponding target azimuth angle when the signal is collected by the microphone array; when the target azimuth angle is within the preset angle range, perform signal enhancement processing on the sound signal to obtain the target sound signal; output the target sound signal to the target device, wherein, The target device is used to play or store the target sound signal.

上述本发明实施例序号仅仅为了描述,不代表实施例的优劣。The serial numbers of the above embodiments of the present invention are for description only, and do not represent the advantages and disadvantages of the embodiments.

在本发明的上述实施例中,对各个实施例的描述都各有侧重,某个实施例中没有详述的部分,可以参见其他实施例的相关描述。In the above-mentioned embodiments of the present invention, the descriptions of each embodiment have their own emphases, and for parts not described in detail in a certain embodiment, reference may be made to relevant descriptions of other embodiments.

在本申请所提供的几个实施例中,应该理解到,所揭露的技术内容,可通过其它的方式实现。其中,以上所描述的装置实施例仅仅是示意性的,例如单元的划分,可以为一种逻辑功能划分,实际实现时可以有另外的划分方式,例如多个单元或组件可以结合或者可以集成到另一个系统,或一些特征可以忽略,或不执行。另一点,所显示或讨论的相互之间的耦合或直接耦合或通信连接可以是通过一些接口,单元或模块的间接耦合或通信连接,可以是电性或其它的形式。In the several embodiments provided in this application, it should be understood that the disclosed technical content can be realized in other ways. Wherein, the device embodiments described above are only illustrative. For example, the division of units can be a logical function division. In actual implementation, there may be other division methods. For example, multiple units or components can be combined or integrated into Another system, or some features may be ignored, or not implemented. In another point, the mutual coupling or direct coupling or communication connection shown or discussed may be through some interfaces, and the indirect coupling or communication connection of units or modules may be in electrical or other forms.

作为分离部件说明的单元可以是或者也可以不是物理上分开的,作为单元显示的部件可以是或者也可以不是物理单元,即可以位于一个地方,或者也可以分布到多个单元上。可以根据实际的需要选择其中的部分或者全部单元来实现本实施例方案的目的。A unit described as a separate component may or may not be physically separated, and a component shown as a unit may or may not be a physical unit, that is, it may be located in one place, or may be distributed over multiple units. Part or all of the units can be selected according to actual needs to achieve the purpose of the solution of this embodiment.

另外,在本发明各个实施例中的各功能单元可以集成在一个处理单元中,也可以是各个单元单独物理存在,也可以两个或两个以上单元集成在一个单元中。上述集成的单元既可以采用硬件的形式实现,也可以采用软件功能单元的形式实现。In addition, each functional unit in each embodiment of the present invention may be integrated into one processing unit, each unit may exist separately physically, or two or more units may be integrated into one unit. The above-mentioned integrated units can be implemented in the form of hardware or in the form of software functional units.

集成的单元如果以软件功能单元的形式实现并作为独立的产品销售或使用时,可以存储在一个非易失性取存储介质中。基于这样的理解,本发明的技术方案本质上或者说对现有技术做出贡献的部分或者该技术方案的全部或部分可以以软件产品的形式体现出来,该计算机软件产品存储在一个存储介质中,包括若干指令用以使得一台计算机设备(可为个人计算机、服务器或者网络设备等)执行本发明各个实施例所述方法的全部或部分步骤。而前述的存储介质包括:U盘、只读存储器(ROM,Read-Only Memory)、随机存取存储器(RAM,Random Access Memory)、移动硬盘、磁碟或者光盘等各种可以存储程序代码的介质。If an integrated unit is realized in the form of a software function unit and sold or used as an independent product, it can be stored in a non-volatile storage medium. Based on such an understanding, the essence of the technical solution of the present invention or the part that contributes to the prior art or all or part of the technical solution can be embodied in the form of a software product, and the computer software product is stored in a storage medium , including several instructions to make a computer device (which may be a personal computer, a server, or a network device, etc.) execute all or part of the steps of the methods described in various embodiments of the present invention. The aforementioned storage media include: U disk, read-only memory (ROM, Read-Only Memory), random access memory (RAM, Random Access Memory), mobile hard disk, magnetic disk or optical disk and other media that can store program codes. .

以上所述仅是本发明的优选实施方式,应当指出,对于本技术领域的普通技术人员来说,在不脱离本发明原理的前提下,还可以做出若干改进和润饰,这些改进和润饰也应视为本发明的保护范围。The above is only a preferred embodiment of the present invention, it should be pointed out that, for those of ordinary skill in the art, without departing from the principle of the present invention, some improvements and modifications can also be made, and these improvements and modifications can also be made. It should be regarded as the protection scope of the present invention.

Claims (13)

1. A sound pickup method, comprising:
collecting sound signals using a microphone array comprising a plurality of microphones;
detecting a target azimuth angle corresponding to the sound signal acquired by the microphone array;
and under the condition that the target azimuth angle is in a preset angle interval, performing signal enhancement processing on the sound signal to obtain a target sound signal.
2. The method of claim 1, wherein detecting a target azimuth angle corresponding to when the sound signal is acquired by the microphone array comprises:
converting the sound signal into a digital signal;
and inputting the digital signal into a convolutional neural network CNN model to obtain a target azimuth angle corresponding to the digital signal, wherein the CNN model is used for positioning the sound direction.
3. The method of claim 2, wherein inputting the digital signal into a convolutional neural network CNN model to obtain a target azimuth corresponding to the digital signal comprises:
extracting generalized cross-correlation features and filter bank features from the digital signals by adopting a feature extraction module of the CNN model;
and determining a target azimuth angle corresponding to the digital signal based on the generalized cross-correlation characteristic and the filter bank characteristic by adopting a positioning module of the CNN model.
4. The method of claim 3, wherein the determining, by the positioning module employing the CNN model, a target azimuth corresponding to the digital signal based on the generalized cross-correlation feature and the filter bank feature comprises:
determining a probability value of the microphone array for acquiring the sound signal from any azimuth angle based on the generalized cross-correlation feature and the filter bank feature by adopting the positioning module of the CNN model;
and determining a target azimuth angle corresponding to the digital signal based on the probability value by adopting the positioning module.
5. The method of claim 1, wherein the plurality of microphones included in the microphone array are a plurality of directional microphones, and wherein the plurality of directional microphones are sequentially positioned on a plurality of vertices of a regular polygon.
6. The method according to claim 1, wherein the method further comprises:
and outputting the target sound signal to target equipment, wherein the target equipment is used for playing or storing the target sound signal.
7. The method of claim 6, wherein the method further comprises:
under the condition that the target azimuth angle is outside the preset angle interval, the sound signal is restrained to obtain a noise signal;
and outputting the noise signal to the target equipment.
8. The method of claim 1, wherein the step of determining the position of the substrate comprises,
before collecting sound signals with a microphone array comprising a plurality of microphones, the method further comprises:
displaying first prompt information, wherein the first prompt information is used for prompting the preset angle interval;
in the case that the target azimuth is within the preset angle interval, the method further includes: displaying second prompt information, wherein the second prompt information is used for prompting that the target azimuth corresponding to the sound signal is located in the preset angle interval;
in the case that the target azimuth angle is outside the preset angle interval, the method further includes: and displaying third prompt information, wherein the third prompt information is used for prompting that the target azimuth corresponding to the sound signal is located outside the preset angle interval.
9. A sound pickup method, comprising:
receiving a sound signal, wherein the sound signal is acquired by a microphone array comprising a plurality of microphones;
detecting a target azimuth angle corresponding to the sound signal acquired by the microphone array;
under the condition that the target azimuth angle is in a preset angle interval, carrying out signal enhancement processing on the sound signal to obtain a target sound signal;
and outputting the target sound signal to target equipment, wherein the target equipment is used for playing or storing the target sound signal.
10. A sound pickup apparatus, comprising:
the system comprises an acquisition module, a sound signal acquisition module and a sound signal acquisition module, wherein the acquisition module is used for acquiring sound signals by adopting a microphone array comprising a plurality of microphones;
the first detection module is used for detecting a target azimuth angle corresponding to the sound signal acquired by the microphone array;
the first enhancement module is used for carrying out signal enhancement processing on the sound signal under the condition that the target azimuth angle is in a preset angle interval to obtain a target sound signal.
11. A sound pickup apparatus, comprising:
the receiving module is used for receiving sound signals, wherein the sound signals are acquired by a microphone array comprising a plurality of microphones;
the second detection module is used for detecting a target azimuth angle corresponding to the sound signal acquired by the microphone array;
the second enhancement module is used for carrying out signal enhancement processing on the sound signal to obtain a target sound signal under the condition that the target azimuth angle is in a preset angle interval;
and the output module is used for outputting the target sound signal to target equipment, wherein the target equipment is used for playing or storing the target sound signal.
12. A non-volatile storage medium, characterized in that the non-volatile storage medium comprises a stored program, wherein the device in which the non-volatile storage medium is located is controlled to perform the pick-up method according to any one of claims 1 to 9 when the program is run.
13. A terminal device, characterized in that the terminal device comprises a processor for running a program, wherein the program is run to perform the sound pick-up method according to any one of claims 1 to 9.
CN202210182711.XA 2022-02-25 2022-02-25 Sound pickup method, device, non-volatile storage medium and terminal equipment Pending CN116705046A (en)

Priority Applications (1)

Application Number Priority Date Filing Date Title
CN202210182711.XA CN116705046A (en) 2022-02-25 2022-02-25 Sound pickup method, device, non-volatile storage medium and terminal equipment

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
CN202210182711.XA CN116705046A (en) 2022-02-25 2022-02-25 Sound pickup method, device, non-volatile storage medium and terminal equipment

Publications (1)

Publication Number Publication Date
CN116705046A true CN116705046A (en) 2023-09-05

Family

ID=87843805

Family Applications (1)

Application Number Title Priority Date Filing Date
CN202210182711.XA Pending CN116705046A (en) 2022-02-25 2022-02-25 Sound pickup method, device, non-volatile storage medium and terminal equipment

Country Status (1)

Country Link
CN (1) CN116705046A (en)

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117768816A (en) * 2023-11-15 2024-03-26 兴科迪科技(泰州)有限公司 Method and device for realizing sound collection based on small-size PCBA

Cited By (1)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
CN117768816A (en) * 2023-11-15 2024-03-26 兴科迪科技(泰州)有限公司 Method and device for realizing sound collection based on small-size PCBA

Similar Documents

Publication Publication Date Title
US12112094B2 (en) Devices with enhanced audio
US11302341B2 (en) Microphone array based pickup method and system
US20170330563A1 (en) Processing Speech from Distributed Microphones
EP2974253B1 (en) Normalization of soundfield orientations based on auditory scene analysis
US11019306B2 (en) Combining installed audio-visual sensors with ad-hoc mobile audio-visual sensors for smart meeting rooms
US20100123785A1 (en) Graphic Control for Directional Audio Input
CN113203988B (en) Sound source positioning method and device
CN107360387A (en) Video recording method and device and terminal equipment
US20140241702A1 (en) Dynamic audio perspective change during video playback
US20210092514A1 (en) Methods and systems for recording mixed audio signal and reproducing directional audio
JP2017028608A (en) Video conference terminal equipment
CN107333093A (en) A kind of sound processing method, device, terminal and computer-readable recording medium
CN104185116A (en) Automatic acoustic radiation mode determining method
CN110035372A (en) Output control method and device of sound amplification system, sound amplification system and computer equipment
WO2020234015A1 (en) An apparatus and associated methods for capture of spatial audio
CN116705046A (en) Sound pickup method, device, non-volatile storage medium and terminal equipment
CN113542466A (en) Audio processing method, electronic device and storage medium
CN110351629A (en) A kind of reception method, audio signal reception device and terminal
CN112735455A (en) Method and device for processing sound information
WO2022161446A1 (en) Control method and apparatus, and electronic device
CN115484431A (en) Video processing method, device, equipment and storage medium for video conference
WO2021004067A1 (en) Display device
CN114765031A (en) Radio reception device, radio reception method, terminal and computer readable storage medium
US12028178B2 (en) Conferencing session facilitation systems and methods using virtual assistant systems and artificial intelligence algorithms
US20240381045A1 (en) Multi-device localization

Legal Events

Date Code Title Description
PB01 Publication
PB01 Publication
SE01 Entry into force of request for substantive examination
SE01 Entry into force of request for substantive examination