Nothing Special   »   [go: up one dir, main page]

ATE343196T1 - VOICE ACTIVITY DETECTION USING COMPRESSED VOICE SIGNAL PARAMETERS - Google Patents

VOICE ACTIVITY DETECTION USING COMPRESSED VOICE SIGNAL PARAMETERS

Info

Publication number
ATE343196T1
ATE343196T1 AT04425031T AT04425031T ATE343196T1 AT E343196 T1 ATE343196 T1 AT E343196T1 AT 04425031 T AT04425031 T AT 04425031T AT 04425031 T AT04425031 T AT 04425031T AT E343196 T1 ATE343196 T1 AT E343196T1
Authority
AT
Austria
Prior art keywords
voice
vad
activity detection
analysis
signal parameters
Prior art date
Application number
AT04425031T
Other languages
German (de)
Inventor
Matteo Aldrovandi
Original Assignee
Siemens Spa Italiana
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Siemens Spa Italiana filed Critical Siemens Spa Italiana
Application granted granted Critical
Publication of ATE343196T1 publication Critical patent/ATE343196T1/en

Links

Classifications

    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00 - G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS TECHNIQUES OR SPEECH SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING TECHNIQUES; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signals analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signals, using source filter models or psychoacoustic analysis

Landscapes

  • Engineering & Computer Science (AREA)
  • Computational Linguistics (AREA)
  • Signal Processing (AREA)
  • Health & Medical Sciences (AREA)
  • Audiology, Speech & Language Pathology (AREA)
  • Human Computer Interaction (AREA)
  • Physics & Mathematics (AREA)
  • Acoustics & Sound (AREA)
  • Multimedia (AREA)
  • Mobile Radio Communication Systems (AREA)
  • Spectrometry And Color Measurement (AREA)
  • Reduction Or Emphasis Of Bandwidth Of Signals (AREA)
  • Alarm Systems (AREA)

Abstract

There is provided a voice activity detector (VAD) (40) for assisting the voice quality enhancement in the uplink direction of a mobile communication system in which the voice quality enhancement means (50) are embodied in the transcoding and rate adapting unit (TRAU) (2). The VAD (40) comprises means (41, 42) for performing both a spectral analysis and an energetic analysis on a received speech signal and means (43, 45) for processing the results of said analysis and taking a decision on audio segment nature. The VAD performs spectral analysis directly on the coded signal. <IMAGE>
AT04425031T 2004-01-22 2004-01-22 VOICE ACTIVITY DETECTION USING COMPRESSED VOICE SIGNAL PARAMETERS ATE343196T1 (en)

Applications Claiming Priority (1)

Application Number Priority Date Filing Date Title
EP04425031A EP1557820B1 (en) 2004-01-22 2004-01-22 Voice activity detection operating with compressed speech signal parameters

Publications (1)

Publication Number Publication Date
ATE343196T1 true ATE343196T1 (en) 2006-11-15

Family

ID=34626566

Family Applications (1)

Application Number Title Priority Date Filing Date
AT04425031T ATE343196T1 (en) 2004-01-22 2004-01-22 VOICE ACTIVITY DETECTION USING COMPRESSED VOICE SIGNAL PARAMETERS

Country Status (3)

Country Link
EP (1) EP1557820B1 (en)
AT (1) ATE343196T1 (en)
DE (1) DE602004002845T2 (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
US9972334B2 (en) * 2015-09-10 2018-05-15 Qualcomm Incorporated Decoder audio classification
CN107331393B (en) * 2017-08-15 2020-05-12 成都启英泰伦科技有限公司 Self-adaptive voice activity detection method

Family Cites Families (3)

* Cited by examiner, † Cited by third party
Publication number Priority date Publication date Assignee Title
JP3685812B2 (en) * 1993-06-29 2005-08-24 ソニー株式会社 Audio signal transmitter / receiver
SE501981C2 (en) * 1993-11-02 1995-07-03 Ericsson Telefon Ab L M Method and apparatus for discriminating between stationary and non-stationary signals
US5657422A (en) * 1994-01-28 1997-08-12 Lucent Technologies Inc. Voice activity detection driven noise remediator

Also Published As

Publication number Publication date
DE602004002845T2 (en) 2007-06-06
DE602004002845D1 (en) 2006-11-30
EP1557820A1 (en) 2005-07-27
EP1557820B1 (en) 2006-10-18

Similar Documents

Publication Publication Date Title
US10721661B2 (en) Wireless device connection handover
US9507772B2 (en) Instant translation system
CN112397083B (en) Voice processing method and related device
US20160275936A1 (en) Electronic devices and methods for compensating for environmental noise in text-to-speech applications
US7496387B2 (en) Wireless headset for use in speech recognition environment
US20070057798A1 (en) Vocalife line: a voice-operated device and system for saving lives in medical emergency
ATE339757T1 (en) METHOD AND DEVICE FOR VOICE ACTIVITY DETECTION
ATE362632T1 (en) MESSAGE TRANSMISSION DEVICE
CA2973512A1 (en) Voice recognition system and method of robot system
AU2003225928A1 (en) Method for robust voice recognition by analyzing redundant features of source signal
DK2040486T3 (en) Method and apparatus for microphone adaptation for portable directional hearing aid using the wearer&#39;s own voice
ATE410768T1 (en) SYSTEM AND METHOD FOR OPERATING A VOICE RECOGNITION SYSTEM IN A VEHICLE
NZ562182A (en) Method and apparatus for anti-sparseness filtering of a bandwidth extended speech prediction excitation signal
EP1923866A4 (en) Sound source separating device, speech recognizing device, portable telephone, and sound source separating method, and program
WO2006056972A3 (en) Method and apparatus for speaker spotting
EP1662841A3 (en) Acoustic system with automatic change-over
AU2003254288A1 (en) Distributed speech recognition with back-end voice activity detection apparatus and method
DE60038545D1 (en) ARRANGEMENT TO IMPROVE LANGUAGE QUALITY FOR VOICE OVER IP (VOIP) CALLS
CN110428806A (en) Interactive voice based on microphone signal wakes up electronic equipment, method and medium
JP2009178783A (en) Communication robot and its control method
HK1094913A1 (en) System and method for personalised text-to-voice synthesis
WO2019228329A1 (en) Personal hearing device, external sound processing device, and related computer program product
US10652397B2 (en) Terminal device and method for performing call function
KR102238979B1 (en) Pre-processing apparatus for speech recognition and method thereof
KR20190102454A (en) Method of controling volume with noise adaptiveness and device implementing thereof

Legal Events

Date Code Title Description
RER Ceased as to paragraph 5 lit. 3 law introducing patent treaties