CA2301995C - High quality speech coder at low bit rates - Google Patents
High quality speech coder at low bit rates Download PDFInfo
- Publication number
- CA2301995C CA2301995C CA002301995A CA2301995A CA2301995C CA 2301995 C CA2301995 C CA 2301995C CA 002301995 A CA002301995 A CA 002301995A CA 2301995 A CA2301995 A CA 2301995A CA 2301995 C CA2301995 C CA 2301995C
- Authority
- CA
- Canada
- Prior art keywords
- input speech
- signal
- obtaining
- speech signal
- pitch prediction
- Prior art date
- Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
- Expired - Fee Related
Links
Landscapes
- Compression, Expansion, Code Conversion, And Decoders (AREA)
Abstract
In a speech coder, a spectral parameter computer obtaining a plurality of spectral parameters from an input speech signal and quantizes the obtained spectral parameters. Even at low bit rates, the speech coder requires relatively less computational effort and experiences less sound quality deterioration.
Claims (12)
PROPERTY OR PRIVILEGE IS CLAIMED ARE DEFINED AS FOLLOWS:
1. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position corresponding to a pulse position meeting a predetermined condition with respect to the computed pitch prediction signal, setting a pulse position retrieval range on the basis of a position obtained by shifting the obtained sample position by a predetermined number of samples, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range thus set, and outputting data of the retrieved best position.
2. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position meeting a predetermined condition with respect to the pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting a pulse position retrieval range for retrieving a pulse position on the basis of a position obtained by shifting the obtained sample position by a predetermined number of samples, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range thus set, and outputting data of the retrieved best position.
3. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position corresponding to a pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting pulse position candidates through shifting the obtained sample position by the pitch period on the basis of the position shifted by predetermined numbers of samples from the sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving the position candidates for a best position, and outputting data of the retrieved best position.
4. The speech coder according to one of claims 1, 2 and 3, wherein the excitation quantizer includes a codebook for jointly quantizing the amplitudes or polarities of a plurality of pulses.
5. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample position meeting a predetermined condition with respect to the computed pitch prediction signal, setting a plurality of pulse position retrieval ranges on the basis of positions obtained by shifting the obtained sample position by corresponding shift extents, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, making retrieval of the pulse position retrieval ranges to select a best combination of a shift extent and a pulse position, and outputting data of the selected best combination.
6. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting a plurality of pulse position retrieval ranges on the basis of positions obtained by shifting the obtained sample position by corresponding shift extents, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, making retrieval of the pulse position retrieval ranges to select a best combination of a shift extent and a pulse position, and outputting data of the selected best combination.
7. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude pulses, obtaining a sample pulse position meeting a predetermined condition with respect to the computed pitch prediction signal in a time interval equal to the pitch period from the forefront of a frame, setting pulse position candidates through shifting the obtained sample position by the pitch period on the basis of the position shifted by predetermined numbers of samples from the sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving the position candidates for a best position, and outputting data of the retrieved best position.
8. The speech coder according to one of claims 5, 6 and 7, wherein the excitation quantizer includes a codebook for jointly quantizing the amplitudes or polarities of a plurality of pulses.
9. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, a mode judging means for extracting a characteristic amount from the input speech signal, judging a plurality of modes from the extracted feature quantity, and outputting mode data, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and making pitch prediction, and an excitation quantizer for forming an excitation signal of the input speech signal with M non-zero amplitude signals, obtaining a sample position meeting a predetermined condition with respect to the pitch prediction signal when the mode data represents a predetermined mode, setting a pulse position retrieval range on the basis of the obtained sample position, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, retrieving a best position in the pulse position retrieval range, and outputting data of the retrieved best position.
10. The speech coder according to claim 9, wherein the feature quantity is an average pitch prediction gain.
11. The speech coder according to claim 9, wherein the mode judging means judges the modes on the basis of comparison of the average pitch prediction gain with a plurality of threshold values.
12. A speech coder comprising a spectral parameter computer for obtaining a plurality of spectral parameters from an input speech signal and quantizing the obtained spectral parameters, an adaptive codebook means for obtaining a delay corresponding to a pitch period from the input speech signal, computing a pitch prediction signal, and executing pitch prediction, and an excitation quantizer for obtaining a position meeting a predetermined condition with respect to the pitch prediction signal computed in the adaptive codebook means, setting a plurality of pulse position retrieval ranges for respective pulses constituting an excitation signal, extracting the polarity of a cross-correlation calculated between input speech signal and impulse response obtained from said spectral parameter computer, for each position candidate in the retrieval range, and retrieving the best positions of the pulses in the pulse position retrieval ranges.
Applications Claiming Priority (5)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
JP261121/1996 | 1996-08-26 | ||
JP26112196A JP3360545B2 (en) | 1996-08-26 | 1996-08-26 | Audio coding device |
JP307143/1996 | 1996-10-31 | ||
JP30714396A JP3471542B2 (en) | 1996-10-31 | 1996-10-31 | Audio coding device |
CA002213909A CA2213909C (en) | 1996-08-26 | 1997-08-25 | High quality speech coder at low bit rates |
Related Parent Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002213909A Division CA2213909C (en) | 1996-08-26 | 1997-08-25 | High quality speech coder at low bit rates |
Publications (2)
Publication Number | Publication Date |
---|---|
CA2301995A1 CA2301995A1 (en) | 1998-02-26 |
CA2301995C true CA2301995C (en) | 2005-10-18 |
Family
ID=27170442
Family Applications (2)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002301994A Expired - Fee Related CA2301994C (en) | 1996-08-26 | 1997-08-25 | High quality speech coder at low bit rates |
CA002301995A Expired - Fee Related CA2301995C (en) | 1996-08-26 | 1997-08-25 | High quality speech coder at low bit rates |
Family Applications Before (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
CA002301994A Expired - Fee Related CA2301994C (en) | 1996-08-26 | 1997-08-25 | High quality speech coder at low bit rates |
Country Status (1)
Country | Link |
---|---|
CA (2) | CA2301994C (en) |
-
1997
- 1997-08-25 CA CA002301994A patent/CA2301994C/en not_active Expired - Fee Related
- 1997-08-25 CA CA002301995A patent/CA2301995C/en not_active Expired - Fee Related
Also Published As
Publication number | Publication date |
---|---|
CA2301994A1 (en) | 1998-02-26 |
CA2301995A1 (en) | 1998-02-26 |
CA2301994C (en) | 2004-10-26 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
CN100583242C (en) | Method and apparatus for speech decoding | |
US5724480A (en) | Speech coding apparatus, speech decoding apparatus, speech coding and decoding method and a phase amplitude characteristic extracting apparatus for carrying out the method | |
EP0890943B1 (en) | Voice coding and decoding system | |
US6023672A (en) | Speech coder | |
EP0766232B1 (en) | Speech coding apparatus | |
EP1008982B1 (en) | Voice encoder, voice decoder, voice encoder/decoder, voice encoding method, voice decoding method and voice encoding/decoding method | |
RU93058657A (en) | VOCODER WITH VARIABLE CODING AND DATA TRANSFER | |
WO1993010624A3 (en) | Progressive transmission of vector quantized data | |
EP1162603B1 (en) | High quality speech coder at low bit rates | |
KR100323011B1 (en) | Pitch period extractor of audio signal | |
EP0849724A2 (en) | High quality speech coder and coding method | |
EP0275099B1 (en) | Voice analyzing and synthesizing apparatus | |
EP1098298B1 (en) | Speech coding with an orthogonal search | |
CA2301995C (en) | High quality speech coder at low bit rates | |
EP0696793B1 (en) | A speech coder | |
CA2336360A1 (en) | Speech coder | |
CN1139988A (en) | Burst excited linear prediction | |
US7272553B1 (en) | Varying pulse amplitude multi-pulse analysis speech processor and method | |
JPH0784600A (en) | Vector quantizing device | |
US4924517A (en) | Encoder of a multi-pulse type capable of controlling the number of excitation pulses | |
KR20040041731A (en) | Variable fixed codebook searching method in CELP speech codec, and apparatus thereof | |
JP2000235399A (en) | Voice signal coding method, decoding method | |
JP3984021B2 (en) | Speech / acoustic signal encoding method and electronic apparatus | |
KR100296409B1 (en) | Multi-pulse excitation voice coding method | |
JPH043878B2 (en) |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
EEER | Examination request | ||
MKLA | Lapsed |