ATE206841T1 - METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS - Google Patents
METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALSInfo
- Publication number
- ATE206841T1 ATE206841T1 AT96104213T AT96104213T ATE206841T1 AT E206841 T1 ATE206841 T1 AT E206841T1 AT 96104213 T AT96104213 T AT 96104213T AT 96104213 T AT96104213 T AT 96104213T AT E206841 T1 ATE206841 T1 AT E206841T1
- Authority
- AT
- Austria
- Prior art keywords
- speech
- frames
- divided
- segments
- wavelet transformation
- Prior art date
Links
- 230000009466 transformation Effects 0.000 abstract 3
- 230000003044 adaptive effect Effects 0.000 abstract 2
- 230000000694 effects Effects 0.000 abstract 1
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis
- G10L19/04—Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis using predictive techniques
- G10L19/16—Vocoder architecture
- G10L19/18—Vocoders using multiple modes
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G10L2025/783—Detection of presence or absence of voice signals based on threshold decision
- G10L2025/786—Adaptive threshold
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/27—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00 characterised by the analysis technique
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
- G10L25/93—Discriminating between voiced and unvoiced parts of speech signals
Landscapes
- Engineering & Computer Science (AREA)
- Physics & Mathematics (AREA)
- Signal Processing (AREA)
- Health & Medical Sciences (AREA)
- Audiology, Speech & Language Pathology (AREA)
- Human Computer Interaction (AREA)
- Computational Linguistics (AREA)
- Acoustics & Sound (AREA)
- Multimedia (AREA)
- Time-Division Multiplex Systems (AREA)
- Compression, Expansion, Code Conversion, And Decoders (AREA)
- Machine Translation (AREA)
- Transmission Systems Not Characterized By The Medium Used For Transmission (AREA)
Abstract
The method classifies speech, in particular speech signals, for the adaptive control of a speech encoding process. This encoding reduces the bit rate while keeping the speech quality the same, or increases the quality while keeping the bit rate the same. After segmenting the speech signal for each frame, a wavelet transformation is calculated. Using adaptive thresholds, a set of parameters is derived which control a state model. The speech frames are divided into sub-frames. Each sub-frame is divided into one of several typical classes for the speech encoding. The speech signal may be divided into segments of constant length. To reduce the edge effects with the wavelet transformation, either the segment at the boundaries is reflected or the wavelet transformation is calculated at smaller intervals. The frames are preferably shifted such that the segments overlap, or at the edges the segments are filled with previous or predicted sample values.
Applications Claiming Priority (2)
Application Number | Priority Date | Filing Date | Title |
---|---|---|---|
DE19523598 | 1995-06-30 | ||
DE19538852A DE19538852A1 (en) | 1995-06-30 | 1995-10-19 | Method and arrangement for classifying speech signals |
Publications (1)
Publication Number | Publication Date |
---|---|
ATE206841T1 true ATE206841T1 (en) | 2001-10-15 |
Family
ID=26016384
Family Applications (1)
Application Number | Title | Priority Date | Filing Date |
---|---|---|---|
AT96104213T ATE206841T1 (en) | 1995-06-30 | 1996-03-16 | METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS |
Country Status (4)
Country | Link |
---|---|
EP (1) | EP0751495B1 (en) |
AT (1) | ATE206841T1 (en) |
ES (1) | ES2165933T3 (en) |
NO (1) | NO309831B1 (en) |
Families Citing this family (1)
Publication number | Priority date | Publication date | Assignee | Title |
---|---|---|---|---|
DE19716862A1 (en) * | 1997-04-22 | 1998-10-29 | Deutsche Telekom Ag | Voice activity detection |
-
1996
- 1996-03-16 EP EP96104213A patent/EP0751495B1/en not_active Expired - Lifetime
- 1996-03-16 AT AT96104213T patent/ATE206841T1/en not_active IP Right Cessation
- 1996-03-16 ES ES96104213T patent/ES2165933T3/en not_active Expired - Lifetime
- 1996-04-24 NO NO961636A patent/NO309831B1/en not_active IP Right Cessation
Also Published As
Publication number | Publication date |
---|---|
EP0751495A2 (en) | 1997-01-02 |
NO961636L (en) | 1997-01-02 |
NO961636D0 (en) | 1996-04-24 |
EP0751495B1 (en) | 2001-10-10 |
EP0751495A3 (en) | 1998-04-15 |
NO309831B1 (en) | 2001-04-02 |
ES2165933T3 (en) | 2002-04-01 |
Similar Documents
Publication | Publication Date | Title |
---|---|---|
DE60117144D1 (en) | LANGUAGE TRANSMISSION SYSTEM AND METHOD FOR TREATING LOST DATA FRAMES | |
DE69431622D1 (en) | METHOD AND DEVICE FOR ENCODING DIGITAL SOUND ENCODED WITH MULTIPLE BITS BY SUBTRACTING AN ADAPTIVE SHAKE SIGNAL, INSERTING HIDDEN CHANNEL BITS AND FILTERING, AND ENCODING DEVICE FOR USE IN THIS PROCESS | |
DE60219351D1 (en) | SIGNAL MODIFICATION METHOD FOR EFFICIENT CODING OF LANGUAGE SIGNALS | |
CA2343661A1 (en) | Method and apparatus for improving the intelligibility of digitally compressed speech | |
DE602004006206D1 (en) | System and method for high quality extension and shortening of a digital audio signal | |
ATE364220T1 (en) | METHOD AND APPARATUS FOR CONCEALING FRAME LOSING OF PREDICTION CODED LANGUAGE USING WAVEFORM EXTRAPOLATION | |
DE69521254D1 (en) | METHOD FOR VOICE CODING | |
ATE205011T1 (en) | METHOD AND DEVICE FOR REPRODUCING VOICE SIGNALS AND METHOD FOR TRANSMITTING IT | |
CA2188369A1 (en) | Method and an arrangement for classifying speech signals | |
DE69619129D1 (en) | SYSTEM AND METHOD FOR DOUBLE MULTIPLE FREQUENCY DETECTION BY USING VARIABLE FRAME LENGTHS | |
CA2102099A1 (en) | Variable rate vocoder | |
ATE15415T1 (en) | METHOD AND DEVICE FOR REDUNDANCY-REDUCING DIGITAL SPEECH PROCESSING. | |
ATE298124T1 (en) | METHOD AND APPARATUS FOR SELECTING THE CODING RATE IN A VARIABLE RATE VOCODER | |
NO982393D0 (en) | Process for quality control of seismic data processing and method for processing vertical, seismic profile data | |
ATE362634T1 (en) | METHOD AND APPARATUS FOR DETERMINING A SYNTHETIC HIGHER BAND SIGNAL IN A VOICE ENCODER | |
AU4490296A (en) | Speech coding method using synthesis analysis | |
DE69614937D1 (en) | Method and system for speech recognition with reduced recognition time taking account of changes in background noise | |
DE59806874D1 (en) | METHOD FOR CODING AND / OR DECODING VOICE SIGNALS USING A LONG-TERM PREDICTION AND A MULTI-PULSE EXCITATION SIGNAL | |
DE59809897D1 (en) | Voice Activity Detection | |
MY111784A (en) | Method and apparatus for encoding/decoding of background sounds | |
ATE206841T1 (en) | METHOD AND ARRANGEMENT FOR CLASSIFYING VOICE SIGNALS | |
DE69719024D1 (en) | METHOD FOR DECODING DATA SIGNALS BY MEANS OF A FIXED LENGTH DECISION WINDOW | |
DE59410189D1 (en) | Methods and devices for the detection and control of mass flows and related values | |
DE68901376D1 (en) | METHOD AND DEVICE FOR THE AUTOMATIC CUTTING OF WINE GRAPES. | |
DE69214474D1 (en) | A method for the detection of microorganisms |
Legal Events
Date | Code | Title | Description |
---|---|---|---|
REN | Ceased due to non-payment of the annual fee |