CN1131473A - 在速率可变的声码器中选择编码速率的方法和装置 - Google Patents
在速率可变的声码器中选择编码速率的方法和装置 Download PDFInfo
- Publication number
- CN1131473A CN1131473A CN95190717A CN95190717A CN1131473A CN 1131473 A CN1131473 A CN 1131473A CN 95190717 A CN95190717 A CN 95190717A CN 95190717 A CN95190717 A CN 95190717A CN 1131473 A CN1131473 A CN 1131473A
- Authority
- CN
- China
- Prior art keywords
- value
- rate
- subband energy
- code rate
- energy values
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Granted
Links
- 238000000034 method Methods 0.000 title claims abstract description 31
- 206010038743 Restlessness Diseases 0.000 claims description 33
- 238000005311 autocorrelation function Methods 0.000 claims description 11
- 238000004364 calculation method Methods 0.000 claims 11
- 230000005236 sound signal Effects 0.000 description 10
- 206010019133 Hangover Diseases 0.000 description 6
- 241000282344 Mellivora capensis Species 0.000 description 5
- 230000006835 compression Effects 0.000 description 5
- 238000007906 compression Methods 0.000 description 5
- 238000001514 detection method Methods 0.000 description 5
- 238000001914 filtration Methods 0.000 description 5
- 238000004891 communication Methods 0.000 description 3
- 238000005516 engineering process Methods 0.000 description 3
- 230000000694 effects Effects 0.000 description 2
- 230000000737 periodic effect Effects 0.000 description 2
- 230000000052 comparative effect Effects 0.000 description 1
- 239000004744 fabric Substances 0.000 description 1
- 238000007689 inspection Methods 0.000 description 1
Images
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
- G10L19/0208—Subband vocoders
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/02—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders
- G10L19/0204—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using spectral analysis, e.g. transform vocoders or subband vocoders using subband decomposition
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/22—Mode decision, i.e. based on audio signal content versus external parameters
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
- G10L19/24—Variable rate codecs, e.g. for generating different qualities using a scalable representation such as hierarchical encoding or layered encoding
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Speech or voice signal processing techniques to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/08—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters
- G10L19/10—Determination or coding of the excitation function; Determination or coding of the long-term prediction parameters the excitation function being a multipulse excitation
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Signal Processing (AREA)
- Quality & Reliability (AREA)
- Spectroscopy & Molecular Physics (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
- Compression Or Coding Systems Of Tv Signals (AREA)
- Two-Way Televisions, Distribution Of Moving Picture Or The Like (AREA)
- Dc Digital Transmission (AREA)
Abstract
本发明提供一种降低把低能量非嗓音话音作为背景噪声进行编码的概率的方法。用数字副带滤波器(4)和(6)把输入信号分成副带,在副带速率判定部件(12)和(14)中把这些副带中的能量与一组阈值比较,然后在编码速率选择器(16)内检查这些比较结果,通过这些步骤来确定编码速率。用这种方法,可以把非嗓音话音与背景噪声区别开。本发明还提供一种用输入信号的信噪比设置阈值电平的装置,本发明还提供一种用速率可变的声码器对音乐进行编码的方法,它通过检查输入信号的周期性以把音乐与背景噪声区别开来。
Description
本发明涉及一种声码器。本发明尤其涉及在速率可变的声码器中确定话音编码速率的新颖的和经改进的方法。
速率可变话音压缩系统一般在开始进行编码之前使用一些速率确定算法。这种速率确定算法把较高的比特率编码方法赋予了有话音出现的音频信号段,把较低的比特率编码方法赋予无声段。在这种方法中,可以实现较低的平均比特率,而重新构成的话音仍保持较高质量。因此,为了有效地进行工作,速率可变的话音声码器需要一种健全的速率确定算法,以能在各种背景噪声环境中区别话音和无声。
在1991年6月11日申请的,名称为“速率可变的声码器”的待批美国专利申请No.07/713/661中揭示了这样一种速率可变的话音压缩系统或速率可变的声码器,该专利申请已转让给本发明的受让人,援引在此,以作参考。在这种速率可变的声码器的特定的实现方法中,用码激励线性预测编码技术(CELP)以根据话音活动性的程度确定的几种速率中的一种速率对输入话音进行编码。话音的活动性程度根据除了有声话音之外还可以包含背景噪声的输入音频样值内的能量来确定。为了使声码器在各种背景噪声下都提供高质量的声音编码,需要一种合适的调整阈值的技术来补偿背景噪声对速率判定算法的影响。
声码器一般用在诸如蜂窝电话等通信设备或个人通信设备中,以对转换成数字形式进行传输的模拟音频信号进行数字信号压缩。在可以使用蜂窝电话或个人通信设备的移动的环境中,高的背景噪声能量使得用基于信号能量的速率确定算法难以把低能量的非嗓音声音从低背景噪声中区分开来。因此,经常对非嗓音声音以较低的比特率进行编码,声音质量下降,诸如“s”、“x”、“ch”、“sh”、“t”等辅音在重新构成的话音中被丢失。
根据仅把背景噪声能量作为依据的速率判定的声码器在设定阈值时没有考虑信号相对于背景噪声的强度。当背景噪声提高时,根据仅把背景噪声作为依据的声码器必然会一起压缩阈值。如果信号电平仍然维持不变,但设置阈值电平的校正方法是把信号电平与背景噪声电平一起提升,那么,压缩阈值电平不是最佳的解决方法。在速率可变的声码器内需要另一种考虑了信号强度的设置阈值电平的方法。
剩余的决定性的问题是在通过基于背景噪声能量的速率判定声码器来播放音乐时产生的。当人在说话时,他们必须暂停以便呼吸,这可以把阈值重新设置到适当的背景噪声电平上。然而,在通过声码器传输时,在音乐持续的情况下,没有暂停发生,并且阈值将持续提高,一直到开始对音乐以小于全速率的速率进行编码。在这种情况下,速率可变的编码器把音乐与背景噪声混为一谈。
本发明是一种新颖的和经改进的在速率可变的声码器内确定编码速率的方法和装置。本发明的第一个目的是提供一种方法,用这种方法可降低把低能量的非嗓音话音作为背景噪声进行编码的概率。在本发明中,把输入信号滤波成高频分量和低频分量。然后单独地对输入信号的滤波信号进行分析,以检测是否有话音的存在。因为非嗓音话音有高频分量,所以相对于高频带来说其强度与背景噪声相比的区别比在整个频带上与背景噪声相比的区别来得更大。
本发明的第二个目的是提供一种装置,这种装置在设置阈值时考虑了信号能量以及背景噪声能量。在本发明中,根据输入信号的信噪比(SNR)的估计值来设定声音检测阈值。在一个典型的实施例中,把在存在话音期间的信号能量估计为最大信号能量,把在无声期间的背景噪声能量估计为最小信号能量。
本发明的第三个目的是提供一种通过速率可变的声码器对音乐进行编码的方法。在一个典型的实施例中,速率选择装置检测阈值电平上升的连续帧的数量,并检查帧数的周期。如果输入信号是有周期性的,这表示存在音乐。如果检测到有音乐存在,那么把阈值设置到以全速率对信号进行编码的电平上。
通过下面结合附图的详细描述,本发明的特征、目的和优点将变得更明显,在整个描述中相同的参考字符表示相同的部件。
图1是本发明的方框图。
参见图1,把输入信号S(n)提供给副带能量计算部件4和副带能量计算部件6。输入信号S(n)包含音频信号和背景噪声。音频信号一般为话音,但也可以是音乐。在一个典型的实施例中,以每二十毫秒帧160样值的形式提供S(n)。在一个典型的实施例中,输入信号S(n)的频率分量从0kHz到4kHz,大约与人的话音信号的带宽相似。
在一个典型的实施例中,把4kHz的输入信号S(n)滤波成两个分立的副带。这个分立的副带分别在0到2kHz和2kHz到4kHz之间。在一个典型的实施例中,可以用副带滤波器把输入信号分成副带,这种设计在已有技术中属于熟知的技术,并且在1994年2月1日提交的,名称为“频率选择自适应滤波”的美国专利申请No.08/189,819中有详细的描述,该申请已转让给本发明的受让人,援引在此以作参考。
对于低通滤波器,副带滤波器的脉冲响应表示为hL(n),对于高通滤波器,副带滤波器的脉冲响应表示为hH(n)。可以如现有技术中所熟知的那样,简单地取副带滤波器输出的样值平方之和计算得到的信号的所产生的副带分量的能量,给出RL(0)和RH(0)值。
在一个较佳实施例中,当把输入信号S(n)提供给副带能量计算部件4时,如下计算输入帧的低频分量的能量值RL(0):
其它在副带能量计算部件6内用相似的方式计算高频能量RH(0)。
可以在减小计算负荷之前计算副带滤波器的自相关函数的值。另外,把计算得到的一些RS(i)值在对输入信号S(n)进行编码时的另一些计算中使用,这进一步减轻了本发明的编码速率选择的方法的纯计算负荷。例如,运算LPC滤波器抽头值需要计算一组输入信号自相关系数。
对LPC滤波器抽头值的计算在现有技术中是众所周知的,并且在上面提到美国专利申请08/004,484中有详细的描述。如果一种是用需要十个抽头的LPC滤波器对话音进行编码,除了在对信号进行编码所用的之外,仅需要计算i值从11到L-1的RS(i)值,因为,i值从0到10的RS(i)在计算LPC滤波器抽头值时已经使用了。在一个典型的实施例中,副带滤波器具有17个抽头,L=17。
副带能量计算部件4向副带速率判定部件12提供计算得到的RL(0)值,副带能量计算部件6向副带速率判定部件14提供计算得到的RH(0)值。速率判定部件12把RL(0)值与两个预定的阈值TL1/2和TLfull作比较,把根据比较结果选定建议的编码速率RATEL。速率的选定方式如下:RATEL=八分之一速率 RL(0)≤TL1/2 (4)RATEL=半速率 TL1/2<RL(0)≤TLfull (5)RATEL=全速率 RL(0)>TLfull (6)副带速率判定部件14以相似的方式工作,并根据高频能量值RH(0)和一组不同的阈值TH1/2和THfull来选择一建议的编码速率。副带速率判定部件12把其建议的编码速率RATEL提供给编码速率选择部件16,副带速率判定部件14把其建议的编码速率RATEH提供给编码速率选择部件16。在一个典型的实施例中,编码速率选择部件16选择两个建议的速率中较高的一个速率,并把较高的速率作为选出的编码速率(ENCODING RATE)提供。
副带能量计算部件4还把低频能量值RL(0)提供给阈值修正部件8,计算下一输入帧的阈值TL1/2和TLfull。相似地,副带能量计算部件6把高频能量值RH(0)提供给阈值修正部件10,计算下一输入帧的阈值TL1/2和Tlfull。
阈值修正部件8接收低频能量值RL(0),并确定S(n)是否含有背景噪声或音频信号。在一个典型的实现方法中,阈值修正部件8确定是否有音频信号存在的方法是检查归一化自相关函数NACF,它由下式给出: 其中,e(n)为话音质量的特性分量残留信号,它由LPC滤波器滤波输入信号S(n)引起。
由LPC滤波器对信号滤波的设计在现有技术中是众所周知的,并且在上面提及的美国专利申请08/004,484中有详细的描述。LPC滤波器对输入信号S(n)进行滤波,除去话音质量特性分量的相互影响。把NACF与阈值比较,确定是否出现了音频信号。如果NACF大于预定的阈值,它指示输入帧具有表示诸如话音或音乐的音频信号存在的周期性特征。请注意,当一部分话音和音乐不是周期性时,表现出NACF的值较小,背景噪声一般决不会显示出周期性,因此NACF几乎总是表现出较小的值。
如果确定S(n)包含背景噪声,NACF值小于阈值TH1,那末把值RL(0)用于更新当前背景噪声估计值BGNL的值。在一个典型的实施例中,TH1为0.35。把RL(0)与当前的背景噪声估计值BGNL比较。如果RL(0)小于BGNL,那末不管NACF的值如何,总把背景噪声估计值BGNL设置成等于RL(0)值。
背景噪声估计值只有在NACF小于阈值TH1时才增加。如果RL(0)大于BGNL,并且NACF小于TH1,那么把背景噪声能量BGNL设置成α1*BGNL,其中,α1为大于1的数字。在一个典型的实施例中,α1等于1.03。只要NACF小于阈值TH1,并且RL(0)大于BGNL的当前值,那末BGNL就继续增加,直到BGNL到达预定的最大值BGNmax,在该点上,背景估计值BGNL被设置到BGN-max。
如果NACF值超过第二预定值TH2表示检测到音频信号,则更新信号能量估计值SL。在一个典型的实施例中,TH2被设置成0.5。把RL(0)的值与当前低通信号能量估计值SL比较。如果RL(0)大于当前SL值,则把SL设置成等于RL(0)。如果RL(0)小于当前SL值,而且仅在NACF大于TH2时,把SL设置成等于α2*SL。在一个典型的实施例中,α2被设置为0.96。
然后,阈值修正部件8根据下面的等式8计算信噪比估计值:
=0,对SNRL≤20,
=7,对SNRL≥55。 (10)其中nint是把小数值四舍五入到最近的整数的函数。
然后阈值修正部件8根据信噪比指数ISNRL选择或计算两个换算系数kL1/2/和kLfull。下面的表1提供了一个典型的换算值查找表: 表1
ISNRL KL1/2 Klfull
0 7.0 9.0
1 7.0 12.6
2 8.0 17.0
3 8.6 18.5
4 8.9 19.4
5 9.4 20.9
6 11.0 25.5
7 15.8 39.8这两个值用于根据下面式子计算选择速率的阈值:
TL1/2=KL1/2*BGNL (11)和
TLfull=KLfull*BGNL (12)其中,TL1/2为低频半速率阈值,TLfull为低频全速率阈值。
阈值修正部件8向速率判定部件12提供修正后的阈值TL1/2和TLfull。阈值修正部件10以相似的方式工作,并向副带速率判定部件14提供阈值TH1/2和THfull。
音频信号能量估计值S的初始值(S可以是SL或SH)如下进行设置。把初始信号能量估计值SINIT设置到-18.0dBm0,其中3.17dBm0表示全正弦波的信号强度,在一个典型的实施例中,它是一个幅度范围从-8031到8031的数字正弦波。SINIT一直被使用,直到确定出现了有声信号。
开始检测有声信号的方法是把NACF值与一阈值比较,当NACF在预定的连续数帧超过该阈值时,则确定出现了有声信号。在一个典型的实施例中,NACF必须连续10帧超过阈值。在这个条件得到满足后,在前10帧把信号能量估计值S设置到最大信号能量。
最初把背景噪声估计值BGNL的初始值设置成BGNmax。只要接收到的副带帧能量小于BGNmax,就把背景噪声估计值复位到接收到的副带能量电平值上,并如上所述产生背景噪声BGNL估计值。
在一个较佳实施例中,当跟了一串全速率话音帧时产生释放延迟情况,则检测低速率帧。在一个典型的实施例中,当在对四个连续的话音帧以全速率进行编码后跟一幅把编码速率设置到小于全速率的速率,并且计算得到的信噪比小于预定最小的SNR的帧时,把该帧的编码速率设置到全速率。在一个典型的实施例中,如在公式8中定义的那样,预定最小SNR为27.5dB。
在一较佳实施例中,释放延迟的帧数是信噪比的函数。在一个典型的实施例中,释放延迟的帧数如下确定:
释放延迟帧数=1 22.5<SNR<27.5 (13)
释放延迟帧数=2 SNR≤22.5 (14)
释放延迟帧数=0 SNR≥27.5 (15)
本发明还提供一种检测是否有音乐存在的方法,如上所述音乐缺少可以测量背景噪声以进行复位的暂停。该检测音乐是否存在的方法假设在通话开始时没有出现音乐。这可以使本发明的编码速率选择装置适当地估计初始背景噪声能量BGNinit。因为音乐不象背景噪声具有周期性的特征,本发明检查NACF的值来区别音乐和背景噪声。本发明的音乐检测方法根据下式计算平均NACF:
如果背景噪声BGN对预定的帧数T已经增加,并且NAC-FAVE超过了预定阈值,那么检测到了音乐,把背景噪声BGN复位到BGNinit。应注意,为了使该方法可行,必须把值T设置得足够小,以使编码速率不低于全速率。因此,T值应当设置成有声信号和BGNinit的函数。
提供了上面对较佳实施例的描述能使本技术领域的熟练人员实现或使用本发明。对于本技术领域的熟练人员来说对这些实施例的各种变化是容易的,此处限定的一般原理可以应用于其它实施例而无需创造性技能。因此,本发明并不限于此处所示的实施例,它被赋予与由此处的原理和新颖的特征相一致的最宽的范围。
Claims (30)
1.一种为速率可变声码器确定编码速率的装置,其特征在于,包含:
副带能量计算装置,用于接收输入信号,根据预定的副带能量计算公式确定多个副带能量值;
速率确定装置,用于接收所述多个副带能量值,根据所述多个副带能量值确定所述编码速率。
3.如权利要求1所述的装置,其特征在于,进一步包含设置在所述副带能量计算装置和所述速率确定装置之间的阈值计算装置,用于接收所述副带能量值,根据多个副带能量值确定一组编码速率阈值。
4.如权利要求3所述的装置,其特征在于,所述阈值计算装置根据所述多个副带能量值确定信噪比。
5.如权利要求4所述的装置,其特征在于,所述阈值计算装置根据所述信噪比确定换算值。
6.如权利要求5所述的装置,其特征在于,阈值计算装置通过把背景噪声估计值与所述换算值相乘来确定至少一个阈值。
7.如权利要求1所述的装置,其特征在于,所述速率确定装置把所述多个副带能量值中的至少一个与至少一个的阈值比较以确定所述编码速率。
8.如权利要求6所述的装置,其特征在于,所述速率确定装置把所述多个副带能量值中的至少一个与所述至少一个阈值比较以确定所述编码速率。
9.如权利要求1所述的装置,其特征在于,所述速率确定装置确定多个建议的编码速率,每个建议的编码速率对应于所述多个副带能量值中的每一个值,所述速率确定装置根据所述多个建议的编码速率确定所述编码速率。
10.一种确定速率可变的声码器的编码速率的装置,其特征在于,包含:
信噪比装置,用于接收输入信号,根据所述输入信号确定信噪比值;
速率确定装置,接收所述信噪比值,根据所述信噪比值确定所述编码速率。
11.一种确定速率可变的声码器的编码速率的装置,其特征在于,包含:
副带能量计算器,它接收输入信号,并根据预定的副带能量计算公式确定多个副带能量值;
速率选择器,它接收所述多个副带能量值,并根据所述多个副带能量值选择所述编码速率。
13.如权利要求11所述的装置,其特征在于,进一步包含设置在所述副带能量计算器和所述速率选择器之间的阈值计算器,接收所述副带能量值,并根据多个副带能量值确定一组编码速率阈值。
14.如权利要求13所述的装置,其特征在于,所述阈值计算器根据所述多个副带能量值确定信噪比值。
15.如权利要求14所述的装置,其特征在于,所述阈值计算器根据所述信噪比确定换算值。
16.如权利要求15所述的装置,其特征在于,阈值计算器通过把背景噪声估计值与所述换算值相乘来确定至少一个阈值。
17.如权利要求11所述的装置,其特征在于,所述速率选择器把所述多个副带能量值中的至少一个值与至少一个的阈值比较,确定所述编码速率。
18.如权利要求16所述的装置,其特征在于,所述速率选择器把所述多个副带能量值中的至少一个值与所述至少一个的阈值比较,确定所述编码速率。
19.如权利要求11所述的装置,其特征在于,所述速率选择器确定多个建议的编码速率,各建议的编码速率对应于各所述副带能量值,所述速率选择器根据所述多个建议的编码速率确定所述编码速率。
20.一种确定速率可变的声码器的编码速率的装置,其特征在于,包含:
信噪比计算器,它接收输入信号,并根据所述输入信号确定信噪比值;
速率选择器,它接收所述信噪比值,并根据所述信噪比值选择所述编码速率。
21.一种确定速率可变的声码器的编码速率的方法,其特征在于,包含下列步骤:
接收输入信号;
根据预定的副带能量计算公式确定多个副带能量值;和
根据所述多个副带能量值确定所述编码速率。
23.如权利要求21所述的方法,其特征在于,进一步包含下列步骤,根据多个副带能量值确定一组编码速率阈值。
24.如权利要求23所述的方法,其特征在于,所述确定一组编码速率阈值的步骤根据所述多个副带能量值确定信噪比值。
25.如权利要求24所述的方法,其特征在于,所述确定一组编码速率阈值的步骤根据所述信噪比值确定换算值。
26.如权利要求25所述的方法,其特征在于,所述确定一组编码速率阈值的步骤通过把背景噪声估计值与所述换算值相乘来确定所述速率阈值。
27.如权利要求21所述的方法,其特征在于,所述确定所述编码速率的步骤把所述多个副带能量值中的至少一个值与至少一个的阈值比较,确定所述编码速率。
28.如权利要求26所述的方法,其特征在于,所述确定所述编码速率的步骤把所述多个副带能量值中的至少一个值与所述至少一个的阈值比较,以确定所述编码速率。
29.如权利要注21所述的方法,其特征在于,进一步包含下列步骤:根据各所述多个副带能量值产生建议的编码速率,所述确定编码速率的步骤选择所述建议的编码速率中的一个。
30.一种确定速率可变的声码器的编码速率的方法,其特征在于,包含下列步骤:
接收输入信号;
根据所述输入信号确定信噪比值;和
根据所述信噪比值确定所述编码速率。
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
US288,413 | 1994-08-10 | ||
US08/288,413 US5742734A (en) | 1994-08-10 | 1994-08-10 | Encoding rate selection in a variable rate vocoder |
Related Child Applications (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100016631A Division CN100508028C (zh) | 1994-08-10 | 1995-08-01 | 将释放延迟帧添加到由声码器编码的多个帧的方法和装置 |
CNA2004100016646A Division CN1512488A (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNB2004100016650A Division CN1320521C (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
Publications (2)
Publication Number | Publication Date |
---|---|
CN1131473A true CN1131473A (zh) | 1996-09-18 |
CN1168071C CN1168071C (zh) | 2004-09-22 |
Family
ID=23106989
Family Applications (5)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004100016646A Pending CN1512488A (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNB951907174A Expired - Lifetime CN1168071C (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNB2004100016650A Expired - Lifetime CN1320521C (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNA2006101003869A Pending CN1945696A (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNB2004100016631A Expired - Lifetime CN100508028C (zh) | 1994-08-10 | 1995-08-01 | 将释放延迟帧添加到由声码器编码的多个帧的方法和装置 |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNA2004100016646A Pending CN1512488A (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
Family Applications After (3)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CNB2004100016650A Expired - Lifetime CN1320521C (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNA2006101003869A Pending CN1945696A (zh) | 1994-08-10 | 1995-08-01 | 在速率可变的声码器中选择编码速率的方法和装置 |
CNB2004100016631A Expired - Lifetime CN100508028C (zh) | 1994-08-10 | 1995-08-01 | 将释放延迟帧添加到由声码器编码的多个帧的方法和装置 |
Country Status (20)
Country | Link |
---|---|
US (1) | US5742734A (zh) |
EP (6) | EP1530201B1 (zh) |
JP (8) | JP3502101B2 (zh) |
KR (3) | KR100455225B1 (zh) |
CN (5) | CN1512488A (zh) |
AT (5) | ATE358871T1 (zh) |
AU (1) | AU711401B2 (zh) |
BR (2) | BR9506036A (zh) |
CA (3) | CA2488918C (zh) |
DE (5) | DE69530066T2 (zh) |
DK (3) | DK0728350T3 (zh) |
ES (5) | ES2240602T5 (zh) |
FI (5) | FI117993B (zh) |
HK (2) | HK1015185A1 (zh) |
IL (1) | IL114874A (zh) |
MX (1) | MX9600920A (zh) |
PT (3) | PT728350E (zh) |
TW (1) | TW277189B (zh) |
WO (1) | WO1996005592A1 (zh) |
ZA (1) | ZA956081B (zh) |
Cited By (4)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
WO2008086700A1 (fr) * | 2007-01-05 | 2008-07-24 | Huawei Technologies Co., Ltd. | Procédé commandé par la source et système pour coder la fréquence d'un signal audio |
CN1815558B (zh) * | 1998-11-13 | 2010-09-29 | 高通股份有限公司 | 语音中非话音部分的低数据位速率编码 |
CN103366755A (zh) * | 2009-02-16 | 2013-10-23 | 韩国电子通信研究院 | 对音频信号进行编码和解码的方法和设备 |
CN105830154A (zh) * | 2013-12-19 | 2016-08-03 | 瑞典爱立信有限公司 | 估计音频信号中的背景噪声 |
Families Citing this family (61)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US6389010B1 (en) | 1995-10-05 | 2002-05-14 | Intermec Ip Corp. | Hierarchical data collection network supporting packetized voice communications among wireless terminals and telephones |
US7924783B1 (en) | 1994-05-06 | 2011-04-12 | Broadcom Corporation | Hierarchical communications system |
TW271524B (zh) | 1994-08-05 | 1996-03-01 | Qualcomm Inc | |
US5742734A (en) † | 1994-08-10 | 1998-04-21 | Qualcomm Incorporated | Encoding rate selection in a variable rate vocoder |
US6292476B1 (en) * | 1997-04-16 | 2001-09-18 | Qualcomm Inc. | Method and apparatus for providing variable rate data in a communications system using non-orthogonal overflow channels |
JPH09162837A (ja) * | 1995-11-22 | 1997-06-20 | Internatl Business Mach Corp <Ibm> | 圧縮方式を動的に変更する通信方法及び装置 |
JPH09185397A (ja) * | 1995-12-28 | 1997-07-15 | Olympus Optical Co Ltd | 音声情報記録装置 |
US5794199A (en) * | 1996-01-29 | 1998-08-11 | Texas Instruments Incorporated | Method and system for improved discontinuous speech transmission |
FI964975A (fi) * | 1996-12-12 | 1998-06-13 | Nokia Mobile Phones Ltd | Menetelmä ja laite puheen koodaamiseksi |
US6510208B1 (en) * | 1997-01-20 | 2003-01-21 | Sony Corporation | Telephone apparatus with audio recording function and audio recording method telephone apparatus with audio recording function |
US6202046B1 (en) | 1997-01-23 | 2001-03-13 | Kabushiki Kaisha Toshiba | Background noise/speech classification method |
US5920834A (en) * | 1997-01-31 | 1999-07-06 | Qualcomm Incorporated | Echo canceller with talk state determination to control speech processor functional elements in a digital telephone system |
DE19742944B4 (de) * | 1997-09-29 | 2008-03-27 | Infineon Technologies Ag | Verfahren zum Aufzeichnen eines digitalisierten Audiosignals |
US7072832B1 (en) * | 1998-08-24 | 2006-07-04 | Mindspeed Technologies, Inc. | System for speech encoding having an adaptive encoding arrangement |
US6240386B1 (en) | 1998-08-24 | 2001-05-29 | Conexant Systems, Inc. | Speech codec employing noise classification for noise compensation |
US6393074B1 (en) | 1998-12-31 | 2002-05-21 | Texas Instruments Incorporated | Decoding system for variable-rate convolutionally-coded data sequence |
JP2000244384A (ja) * | 1999-02-18 | 2000-09-08 | Mitsubishi Electric Corp | 移動通信端末装置及び移動通信端末装置における音声符号化レート決定方法 |
US6397177B1 (en) * | 1999-03-10 | 2002-05-28 | Samsung Electronics, Co., Ltd. | Speech-encoding rate decision apparatus and method in a variable rate |
WO2000069139A2 (en) * | 1999-05-10 | 2000-11-16 | Nokia Corporation | Header compression |
US7127390B1 (en) | 2000-02-08 | 2006-10-24 | Mindspeed Technologies, Inc. | Rate determination coding |
US6898566B1 (en) * | 2000-08-16 | 2005-05-24 | Mindspeed Technologies, Inc. | Using signal to noise ratio of a speech signal to adjust thresholds for extracting speech parameters for coding the speech signal |
US6640208B1 (en) * | 2000-09-12 | 2003-10-28 | Motorola, Inc. | Voiced/unvoiced speech classifier |
US6745012B1 (en) * | 2000-11-17 | 2004-06-01 | Telefonaktiebolaget Lm Ericsson (Publ) | Adaptive data compression in a wireless telecommunications system |
US7120134B2 (en) | 2001-02-15 | 2006-10-10 | Qualcomm, Incorporated | Reverse link channel architecture for a wireless communication system |
EP1470550B1 (en) * | 2002-01-30 | 2008-09-03 | Matsushita Electric Industrial Co., Ltd. | Audio encoding and decoding device and methods thereof |
US7657427B2 (en) | 2002-10-11 | 2010-02-02 | Nokia Corporation | Methods and devices for source controlled variable bit-rate wideband speech coding |
KR100841096B1 (ko) * | 2002-10-14 | 2008-06-25 | 리얼네트웍스아시아퍼시픽 주식회사 | 음성 코덱에 대한 디지털 오디오 신호의 전처리 방법 |
US7602722B2 (en) * | 2002-12-04 | 2009-10-13 | Nortel Networks Limited | Mobile assisted fast scheduling for the reverse link |
KR100754439B1 (ko) | 2003-01-09 | 2007-08-31 | 와이더댄 주식회사 | 이동 전화상의 체감 음질을 향상시키기 위한 디지털오디오 신호의 전처리 방법 |
EP3336843B1 (en) * | 2004-05-14 | 2021-06-23 | Panasonic Intellectual Property Corporation of America | Speech coding method and speech coding apparatus |
CN1295678C (zh) * | 2004-05-18 | 2007-01-17 | 中国科学院声学研究所 | 子带自适应谷点降噪系统和方法 |
KR100657916B1 (ko) | 2004-12-01 | 2006-12-14 | 삼성전자주식회사 | 주파수 대역간의 유사도를 이용한 오디오 신호 처리 장치및 방법 |
US20060224381A1 (en) * | 2005-04-04 | 2006-10-05 | Nokia Corporation | Detecting speech frames belonging to a low energy sequence |
KR100757858B1 (ko) * | 2005-09-30 | 2007-09-11 | 와이더댄 주식회사 | 선택적 인코딩 시스템 및 상기 선택적 인코딩 시스템의동작 방법 |
KR100717058B1 (ko) * | 2005-11-28 | 2007-05-14 | 삼성전자주식회사 | 고주파 성분 복원 방법 및 그 장치 |
CN101213589B (zh) * | 2006-01-12 | 2011-04-27 | 松下电器产业株式会社 | 对象声音分析装置和对象声音分析方法 |
EP1984911A4 (en) * | 2006-01-18 | 2012-03-14 | Lg Electronics Inc | DEVICE AND METHOD FOR SIGNAL CODING AND DECODING |
US8204754B2 (en) | 2006-02-10 | 2012-06-19 | Telefonaktiebolaget L M Ericsson (Publ) | System and method for an improved voice detector |
US8920343B2 (en) | 2006-03-23 | 2014-12-30 | Michael Edward Sabatino | Apparatus for acquiring and processing of physiological auditory signals |
CN100483509C (zh) * | 2006-12-05 | 2009-04-29 | 华为技术有限公司 | 声音信号分类方法和装置 |
JPWO2009038115A1 (ja) * | 2007-09-21 | 2011-01-06 | 日本電気株式会社 | 音声符号化装置、音声符号化方法及びプログラム |
WO2009038170A1 (ja) * | 2007-09-21 | 2009-03-26 | Nec Corporation | 音声処理装置、音声処理方法、プログラム及び音楽・メロディ配信システム |
US20090099851A1 (en) * | 2007-10-11 | 2009-04-16 | Broadcom Corporation | Adaptive bit pool allocation in sub-band coding |
US8560307B2 (en) * | 2008-01-28 | 2013-10-15 | Qualcomm Incorporated | Systems, methods, and apparatus for context suppression using receivers |
CN101335000B (zh) | 2008-03-26 | 2010-04-21 | 华为技术有限公司 | 编码的方法及装置 |
CN102576528A (zh) | 2009-10-19 | 2012-07-11 | 瑞典爱立信有限公司 | 用于语音活动检测的检测器和方法 |
US9047878B2 (en) * | 2010-11-24 | 2015-06-02 | JVC Kenwood Corporation | Speech determination apparatus and speech determination method |
CN102985969B (zh) * | 2010-12-14 | 2014-12-10 | 松下电器(美国)知识产权公司 | 编码装置、解码装置和编码方法、解码方法 |
US8990074B2 (en) | 2011-05-24 | 2015-03-24 | Qualcomm Incorporated | Noise-robust speech coding mode classification |
US8666753B2 (en) * | 2011-12-12 | 2014-03-04 | Motorola Mobility Llc | Apparatus and method for audio encoding |
US9263054B2 (en) * | 2013-02-21 | 2016-02-16 | Qualcomm Incorporated | Systems and methods for controlling an average encoding rate for speech signal encoding |
US9564136B2 (en) | 2014-03-06 | 2017-02-07 | Dts, Inc. | Post-encoding bitrate reduction of multiple object audio |
JP6250140B2 (ja) * | 2014-03-24 | 2017-12-20 | 日本電信電話株式会社 | 符号化方法、符号化装置、プログラム、および記録媒体 |
KR102061316B1 (ko) * | 2014-07-28 | 2019-12-31 | 니폰 덴신 덴와 가부시끼가이샤 | 부호화 방법, 장치, 프로그램 및 기록 매체 |
ES2869141T3 (es) * | 2014-07-29 | 2021-10-25 | Ericsson Telefon Ab L M | Estimación de ruido de fondo en señales de audio |
KR101619293B1 (ko) | 2014-11-12 | 2016-05-11 | 현대오트론 주식회사 | 전원 반도체의 제어 방법 및 제어 장치 |
CN107742521B (zh) * | 2016-08-10 | 2021-08-13 | 华为技术有限公司 | 多声道信号的编码方法和编码器 |
EP3751567B1 (en) | 2019-06-10 | 2022-01-26 | Axis AB | A method, a computer program, an encoder and a monitoring device |
CN110992963B (zh) * | 2019-12-10 | 2023-09-29 | 腾讯科技(深圳)有限公司 | 网络通话方法、装置、计算机设备及存储介质 |
WO2021253235A1 (zh) * | 2020-06-16 | 2021-12-23 | 华为技术有限公司 | 语音活动检测方法和装置 |
CN113611325B (zh) * | 2021-04-26 | 2023-07-04 | 珠海市杰理科技股份有限公司 | 基于清浊音实现的语音信号变速方法、装置和音频设备 |
Family Cites Families (74)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
US3633107A (en) * | 1970-06-04 | 1972-01-04 | Bell Telephone Labor Inc | Adaptive signal processor for diversity radio receivers |
JPS5017711A (zh) * | 1973-06-15 | 1975-02-25 | ||
US4076958A (en) * | 1976-09-13 | 1978-02-28 | E-Systems, Inc. | Signal synthesizer spectrum contour scaler |
US4214125A (en) * | 1977-01-21 | 1980-07-22 | Forrest S. Mozer | Method and apparatus for speech synthesizing |
CA1123955A (en) * | 1978-03-30 | 1982-05-18 | Tetsu Taguchi | Speech analysis and synthesis apparatus |
DE3023375C1 (zh) * | 1980-06-23 | 1987-12-03 | Siemens Ag, 1000 Berlin Und 8000 Muenchen, De | |
JPS57177197A (en) * | 1981-04-24 | 1982-10-30 | Hitachi Ltd | Pick-up system for sound section |
USRE32580E (en) * | 1981-12-01 | 1988-01-19 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech coder |
JPS6011360B2 (ja) * | 1981-12-15 | 1985-03-25 | ケイディディ株式会社 | 音声符号化方式 |
US4535472A (en) * | 1982-11-05 | 1985-08-13 | At&T Bell Laboratories | Adaptive bit allocator |
DE3276651D1 (en) * | 1982-11-26 | 1987-07-30 | Ibm | Speech signal coding method and apparatus |
DE3370423D1 (en) * | 1983-06-07 | 1987-04-23 | Ibm | Process for activity detection in a voice transmission system |
US4672670A (en) * | 1983-07-26 | 1987-06-09 | Advanced Micro Devices, Inc. | Apparatus and methods for coding, decoding, analyzing and synthesizing a signal |
EP0163829B1 (en) * | 1984-03-21 | 1989-08-23 | Nippon Telegraph And Telephone Corporation | Speech signal processing system |
DE3412430A1 (de) * | 1984-04-03 | 1985-10-03 | Nixdorf Computer Ag, 4790 Paderborn | Schalteranordnung |
EP0167364A1 (en) * | 1984-07-06 | 1986-01-08 | AT&T Corp. | Speech-silence detection with subband coding |
FR2577084B1 (fr) * | 1985-02-01 | 1987-03-20 | Trt Telecom Radio Electr | Systeme de bancs de filtres d'analyse et de synthese d'un signal |
US4856068A (en) * | 1985-03-18 | 1989-08-08 | Massachusetts Institute Of Technology | Audio pre-processing methods and apparatus |
US4885790A (en) * | 1985-03-18 | 1989-12-05 | Massachusetts Institute Of Technology | Processing of acoustic waveforms |
US4630304A (en) * | 1985-07-01 | 1986-12-16 | Motorola, Inc. | Automatic background noise estimator for a noise suppression system |
US4827517A (en) * | 1985-12-26 | 1989-05-02 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech processor using arbitrary excitation coding |
US4797929A (en) * | 1986-01-03 | 1989-01-10 | Motorola, Inc. | Word recognition in a speech recognition system using data reduced word templates |
CA1299750C (en) * | 1986-01-03 | 1992-04-28 | Ira Alan Gerson | Optimal method of data reduction in a speech recognition system |
US4899384A (en) * | 1986-08-25 | 1990-02-06 | Ibm Corporation | Table controlled dynamic bit allocation in a variable rate sub-band speech coder |
US4771465A (en) * | 1986-09-11 | 1988-09-13 | American Telephone And Telegraph Company, At&T Bell Laboratories | Digital speech sinusoidal vocoder with transmission of only subset of harmonics |
US4797925A (en) * | 1986-09-26 | 1989-01-10 | Bell Communications Research, Inc. | Method for coding speech at low bit rates |
US4903301A (en) * | 1987-02-27 | 1990-02-20 | Hitachi, Ltd. | Method and system for transmitting variable rate speech signal |
US5054072A (en) * | 1987-04-02 | 1991-10-01 | Massachusetts Institute Of Technology | Coding of acoustic waveforms |
US4868867A (en) * | 1987-04-06 | 1989-09-19 | Voicecraft Inc. | Vector excitation speech or audio coder for transmission or storage |
US4890327A (en) * | 1987-06-03 | 1989-12-26 | Itt Corporation | Multi-rate digital voice coder apparatus |
US4899385A (en) * | 1987-06-26 | 1990-02-06 | American Telephone And Telegraph Company | Code excited linear predictive vocoder |
CA1337217C (en) * | 1987-08-28 | 1995-10-03 | Daniel Kenneth Freeman | Speech coding |
JPS6491200A (en) * | 1987-10-02 | 1989-04-10 | Fujitsu Ltd | Voice analysis system and voice synthesization system |
US4852179A (en) * | 1987-10-05 | 1989-07-25 | Motorola, Inc. | Variable frame rate, fixed bit rate vocoding method |
US4817157A (en) * | 1988-01-07 | 1989-03-28 | Motorola, Inc. | Digital speech coder having improved vector excitation source |
US4897832A (en) † | 1988-01-18 | 1990-01-30 | Oki Electric Industry Co., Ltd. | Digital speech interpolation system and speech detector |
DE3871369D1 (de) * | 1988-03-08 | 1992-06-25 | Ibm | Verfahren und einrichtung zur sprachkodierung mit niedriger datenrate. |
DE3883519T2 (de) * | 1988-03-08 | 1994-03-17 | Ibm | Verfahren und Einrichtung zur Sprachkodierung mit mehreren Datenraten. |
ES2047664T3 (es) * | 1988-03-11 | 1994-03-01 | British Telecomm | Deteccion de actividad de voz. |
US5023910A (en) * | 1988-04-08 | 1991-06-11 | At&T Bell Laboratories | Vector quantization in a harmonic speech coding arrangement |
US4864561A (en) * | 1988-06-20 | 1989-09-05 | American Telephone And Telegraph Company | Technique for improved subjective performance in a communication system using attenuated noise-fill |
JPH0783315B2 (ja) * | 1988-09-26 | 1995-09-06 | 富士通株式会社 | 可変レート音声信号符号化方式 |
CA1321645C (en) * | 1988-09-28 | 1993-08-24 | Akira Ichikawa | Method and system for voice coding based on vector quantization |
JP3033060B2 (ja) * | 1988-12-22 | 2000-04-17 | 国際電信電話株式会社 | 音声予測符号化・復号化方式 |
US5222189A (en) * | 1989-01-27 | 1993-06-22 | Dolby Laboratories Licensing Corporation | Low time-delay transform coder, decoder, and encoder/decoder for high-quality audio |
EP0392126B1 (en) * | 1989-04-11 | 1994-07-20 | International Business Machines Corporation | Fast pitch tracking process for LTP-based speech coders |
JPH0754434B2 (ja) * | 1989-05-08 | 1995-06-07 | 松下電器産業株式会社 | 音声認識装置 |
US5060269A (en) * | 1989-05-18 | 1991-10-22 | General Electric Company | Hybrid switched multi-pulse/stochastic speech coding technique |
GB2235354A (en) * | 1989-08-16 | 1991-02-27 | Philips Electronic Associated | Speech coding/encoding using celp |
US5054075A (en) * | 1989-09-05 | 1991-10-01 | Motorola, Inc. | Subband decoding method and apparatus |
US5185800A (en) * | 1989-10-13 | 1993-02-09 | Centre National D'etudes Des Telecommunications | Bit allocation device for transformed digital audio broadcasting signals with adaptive quantization based on psychoauditive criterion |
US5307441A (en) † | 1989-11-29 | 1994-04-26 | Comsat Corporation | Wear-toll quality 4.8 kbps speech codec |
JP3004664B2 (ja) * | 1989-12-21 | 2000-01-31 | 株式会社東芝 | 可変レート符号化方法 |
JP2861238B2 (ja) * | 1990-04-20 | 1999-02-24 | ソニー株式会社 | ディジタル信号符号化方法 |
JP2751564B2 (ja) * | 1990-05-25 | 1998-05-18 | ソニー株式会社 | ディジタル信号符号化装置 |
US5103459B1 (en) * | 1990-06-25 | 1999-07-06 | Qualcomm Inc | System and method for generating signal waveforms in a cdma cellular telephone system |
JPH04100099A (ja) * | 1990-08-20 | 1992-04-02 | Nippon Telegr & Teleph Corp <Ntt> | 音声検出装置 |
JPH04157817A (ja) * | 1990-10-20 | 1992-05-29 | Fujitsu Ltd | 可変レート符号化装置 |
US5206884A (en) * | 1990-10-25 | 1993-04-27 | Comsat | Transform domain quantization technique for adaptive predictive coding |
JP2906646B2 (ja) * | 1990-11-09 | 1999-06-21 | 松下電器産業株式会社 | 音声帯域分割符号化装置 |
US5317672A (en) * | 1991-03-05 | 1994-05-31 | Picturetel Corporation | Variable bit rate speech encoder |
KR940001861B1 (ko) * | 1991-04-12 | 1994-03-09 | 삼성전자 주식회사 | 오디오 대역신호의 음성/음악 판별장치 |
US5187745A (en) * | 1991-06-27 | 1993-02-16 | Motorola, Inc. | Efficient codebook search for CELP vocoders |
DE69233397T2 (de) * | 1991-06-11 | 2005-08-11 | Qualcomm, Inc., San Diego | Vorrichtung und Methode zur Maskierung von Fehlern in Datenrahmen |
JP2705377B2 (ja) * | 1991-07-31 | 1998-01-28 | 松下電器産業株式会社 | 帯域分割符号化方法 |
EP0525774B1 (en) * | 1991-07-31 | 1997-02-26 | Matsushita Electric Industrial Co., Ltd. | Digital audio signal coding system and method therefor |
US5410632A (en) † | 1991-12-23 | 1995-04-25 | Motorola, Inc. | Variable hangover time in a voice activity detector |
JP3088838B2 (ja) * | 1992-04-09 | 2000-09-18 | シャープ株式会社 | 音楽検出回路及び該回路を用いた音声信号入力装置 |
JP2976701B2 (ja) * | 1992-06-24 | 1999-11-10 | 日本電気株式会社 | 量子化ビット数割当方法 |
US5341456A (en) * | 1992-12-02 | 1994-08-23 | Qualcomm Incorporated | Method for determining speech encoding rate in a variable rate vocoder |
US5457769A (en) * | 1993-03-30 | 1995-10-10 | Earmark, Inc. | Method and apparatus for detecting the presence of human voice signals in audio signals |
US5644596A (en) † | 1994-02-01 | 1997-07-01 | Qualcomm Incorporated | Method and apparatus for frequency selective adaptive filtering |
US5742734A (en) † | 1994-08-10 | 1998-04-21 | Qualcomm Incorporated | Encoding rate selection in a variable rate vocoder |
US6134215A (en) | 1996-04-02 | 2000-10-17 | Qualcomm Incorpoated | Using orthogonal waveforms to enable multiple transmitters to share a single CDM channel |
-
1994
- 1994-08-10 US US08/288,413 patent/US5742734A/en not_active Expired - Lifetime
-
1995
- 1995-07-08 TW TW084107075A patent/TW277189B/zh not_active IP Right Cessation
- 1995-07-20 ZA ZA956081A patent/ZA956081B/xx unknown
- 1995-08-01 EP EP05001938A patent/EP1530201B1/en not_active Expired - Lifetime
- 1995-08-01 AU AU32751/95A patent/AU711401B2/en not_active Expired
- 1995-08-01 BR BR9506036A patent/BR9506036A/pt not_active Application Discontinuation
- 1995-08-01 CN CNA2004100016646A patent/CN1512488A/zh active Pending
- 1995-08-01 EP EP02009465A patent/EP1233408B1/en not_active Expired - Lifetime
- 1995-08-01 CN CNB951907174A patent/CN1168071C/zh not_active Expired - Lifetime
- 1995-08-01 KR KR10-2003-7005884A patent/KR100455225B1/ko not_active IP Right Cessation
- 1995-08-01 AT AT05001938T patent/ATE358871T1/de not_active IP Right Cessation
- 1995-08-01 MX MX9600920A patent/MX9600920A/es unknown
- 1995-08-01 PT PT95929372T patent/PT728350E/pt unknown
- 1995-08-01 AT AT02009467T patent/ATE298124T1/de active
- 1995-08-01 EP EP04003180A patent/EP1424686A3/en not_active Ceased
- 1995-08-01 WO PCT/US1995/009830 patent/WO1996005592A1/en active IP Right Grant
- 1995-08-01 DE DE69530066T patent/DE69530066T2/de not_active Expired - Lifetime
- 1995-08-01 EP EP02009467A patent/EP1239465B2/en not_active Expired - Lifetime
- 1995-08-01 KR KR1019960701839A patent/KR100455826B1/ko not_active IP Right Cessation
- 1995-08-01 ES ES02009467T patent/ES2240602T5/es not_active Expired - Lifetime
- 1995-08-01 DK DK95929372T patent/DK0728350T3/da active
- 1995-08-01 DE DE69534285T patent/DE69534285T3/de not_active Expired - Lifetime
- 1995-08-01 DE DE69535452T patent/DE69535452T2/de not_active Expired - Lifetime
- 1995-08-01 CN CNB2004100016650A patent/CN1320521C/zh not_active Expired - Lifetime
- 1995-08-01 AT AT95929372T patent/ATE235734T1/de active
- 1995-08-01 ES ES06013824T patent/ES2299122T3/es not_active Expired - Lifetime
- 1995-08-01 CA CA2488918A patent/CA2488918C/en not_active Expired - Lifetime
- 1995-08-01 DK DK02009465T patent/DK1233408T3/da active
- 1995-08-01 CA CA2488921A patent/CA2488921C/en not_active Expired - Lifetime
- 1995-08-01 DE DE69535709T patent/DE69535709T2/de not_active Expired - Lifetime
- 1995-08-01 JP JP50740496A patent/JP3502101B2/ja not_active Expired - Lifetime
- 1995-08-01 EP EP95929372A patent/EP0728350B1/en not_active Expired - Lifetime
- 1995-08-01 ES ES02009465T patent/ES2233739T3/es not_active Expired - Lifetime
- 1995-08-01 PT PT02009465T patent/PT1233408E/pt unknown
- 1995-08-01 DE DE69533881T patent/DE69533881T2/de not_active Expired - Lifetime
- 1995-08-01 ES ES05001938T patent/ES2281854T3/es not_active Expired - Lifetime
- 1995-08-01 CA CA002171009A patent/CA2171009C/en not_active Expired - Lifetime
- 1995-08-01 EP EP06013824A patent/EP1703493B1/en not_active Expired - Lifetime
- 1995-08-01 KR KR10-2003-7005883A patent/KR20040004420A/ko not_active Application Discontinuation
- 1995-08-01 AT AT02009465T patent/ATE285620T1/de active
- 1995-08-01 CN CNA2006101003869A patent/CN1945696A/zh active Pending
- 1995-08-01 DK DK02009467.8T patent/DK1239465T4/da active
- 1995-08-01 CN CNB2004100016631A patent/CN100508028C/zh not_active Expired - Lifetime
- 1995-08-01 BR BRPI9510780-0A patent/BR9510780B1/pt not_active IP Right Cessation
- 1995-08-01 AT AT06013824T patent/ATE386321T1/de not_active IP Right Cessation
- 1995-08-01 ES ES95929372T patent/ES2194921T3/es not_active Expired - Lifetime
- 1995-08-01 PT PT02009467T patent/PT1239465E/pt unknown
- 1995-08-08 IL IL11487495A patent/IL114874A/xx not_active IP Right Cessation
-
1996
- 1996-03-08 FI FI961112A patent/FI117993B/fi not_active IP Right Cessation
-
1998
- 1998-12-28 HK HK98116184A patent/HK1015185A1/xx not_active IP Right Cessation
-
2003
- 2003-08-21 JP JP2003297413A patent/JP3927159B2/ja not_active Expired - Lifetime
- 2003-08-21 JP JP2003297412A patent/JP2004004971A/ja not_active Withdrawn
-
2005
- 2005-07-01 FI FI20050704A patent/FI122272B/fi not_active IP Right Cessation
- 2005-07-01 FI FI20050703A patent/FI123708B/fi not_active IP Right Cessation
- 2005-07-01 FI FI20050702A patent/FI122273B/fi not_active IP Right Cessation
- 2005-10-31 HK HK05109679A patent/HK1077911A1/xx not_active IP Right Cessation
-
2006
- 2006-12-07 FI FI20061084A patent/FI119085B/fi not_active IP Right Cessation
-
2007
- 2007-05-31 JP JP2007145735A patent/JP4680956B2/ja not_active Expired - Lifetime
- 2007-05-31 JP JP2007145736A patent/JP2007293355A/ja not_active Withdrawn
- 2007-05-31 JP JP2007145738A patent/JP4680958B2/ja not_active Expired - Lifetime
- 2007-05-31 JP JP2007145737A patent/JP4680957B2/ja not_active Expired - Lifetime
-
2011
- 2011-04-21 JP JP2011095137A patent/JP4870846B2/ja not_active Expired - Lifetime
Cited By (7)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
CN1815558B (zh) * | 1998-11-13 | 2010-09-29 | 高通股份有限公司 | 语音中非话音部分的低数据位速率编码 |
WO2008086700A1 (fr) * | 2007-01-05 | 2008-07-24 | Huawei Technologies Co., Ltd. | Procédé commandé par la source et système pour coder la fréquence d'un signal audio |
CN101217037B (zh) * | 2007-01-05 | 2011-09-14 | 华为技术有限公司 | 对音频信号的编码速率进行源控的方法和系统 |
CN103366755A (zh) * | 2009-02-16 | 2013-10-23 | 韩国电子通信研究院 | 对音频信号进行编码和解码的方法和设备 |
CN103366755B (zh) * | 2009-02-16 | 2016-05-18 | 韩国电子通信研究院 | 对音频信号进行编码和解码的方法和设备 |
CN105830154A (zh) * | 2013-12-19 | 2016-08-03 | 瑞典爱立信有限公司 | 估计音频信号中的背景噪声 |
CN105830154B (zh) * | 2013-12-19 | 2019-06-28 | 瑞典爱立信有限公司 | 估计音频信号中的背景噪声 |
Also Published As
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN1168071C (zh) | 在速率可变的声码器中选择编码速率的方法和装置 | |
CN1257486C (zh) | 用于将可感知相关信息保留在音频信号中的方法和设备 | |
EP2047457B1 (en) | Systems, methods, and apparatus for signal change detection | |
CN1244090C (zh) | 具备背景噪声再现的语音编码 | |
CN110998722A (zh) | 低复杂性密集瞬态事件检测和译码 | |
Cowing et al. | 16 kbps APC with hybrid quantization |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
C06 | Publication | ||
PB01 | Publication | ||
C10 | Entry into substantive examination | ||
SE01 | Entry into force of request for substantive examination | ||
C14 | Grant of patent or utility model | ||
GR01 | Patent grant | ||
C56 | Change in the name or address of the patentee | ||
CP03 | Change of name, title or address |
Address after: california Patentee after: Qualcomm Inc. Address before: california Patentee before: Qualcomm Inc. |
|
CX01 | Expiry of patent term |
Expiration termination date: 20150801 Granted publication date: 20040922 |
|
EXPY | Termination of patent right or utility model |