Zhu et al., 2014 - Google Patents
Relationship between Chinese speech intelligibility and speech transmission index under reproduced general room conditionsZhu et al., 2014
View PDF- Document ID
- 2267629851188654949
- Author
- Zhu P
- Mo F
- Kang J
- Publication year
- Publication venue
- Acta Acustica united with Acustica
External Links
Snippet
The subjective Chinese (Mandarin) articulation scores of a total of 50 sound conditions, namely at 12 receiver positions in four rooms, were obtained by expert listeners based on in- situ measured binaural room impulse responses (BRIRs) and binaural technology for …
- 230000005540 biological transmission 0 title abstract description 16
Classifications
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/02—Speech enhancement, e.g. noise reduction or echo cancellation
- G10L21/0208—Noise filtering
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L21/00—Processing of the speech or voice signal to produce another audible or non-audible signal, e.g. visual or tactile, in order to modify its quality or its intelligibility
- G10L21/003—Changing voice quality, e.g. pitch or formants
- G10L21/007—Changing voice quality, e.g. pitch or formants characterised by the process used
- G10L21/013—Adapting to target pitch
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis
- G10L19/008—Multichannel audio signal coding or decoding, i.e. using interchannel correlation to reduce redundancies, e.g. joint-stereo, intensity-coding, matrixing
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/301—Automatic calibration of stereophonic sound system, e.g. with test microphone
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/48—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00 specially adapted for particular use
-
- H—ELECTRICITY
- H04—ELECTRIC COMMUNICATION TECHNIQUE
- H04S—STEREOPHONIC SYSTEMS
- H04S7/00—Indicating arrangements; Control arrangements, e.g. balance control
- H04S7/30—Control circuits for electronic adaptation of the sound field
- H04S7/305—Electronic adaptation of stereophonic audio signals to reverberation of the listening space
-
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Zhu et al. | Relationship between Chinese speech intelligibility and speech transmission index under reproduced general room conditions | |
EP3751560B1 (en) | Automatic speech recognition system with integrated perceptual based adversarial audio attacks | |
Zhang et al. | Effects of telephone transmission on the performance of formant-trajectory-based forensic voice comparison–female voices | |
Xia et al. | Effects of reverberation and noise on speech intelligibility in normal-hearing and aided hearing-impaired listeners | |
Peng et al. | Relationship between Chinese speech intelligibility and speech transmission index in rooms based on auralization | |
Peng et al. | Chinese speech intelligibility and its relationship with the speech transmission index for children in elementary school classrooms | |
Monson et al. | Detection of high-frequency energy level changes in speech and singing | |
Zhu et al. | Experimental comparison of speech transmission index measurement in natural sound rooms and auditoria | |
Völk | Inter-and intra-individual variability in the blocked auditory canal transfer functions of three circum-aural headphones | |
Zhu et al. | Comparisons between simulated and in-situ measured speech intelligibility based on (binaural) room impulse responses | |
Zhu et al. | Influence of sound source characteristics in determining objective speech intelligibility metrics | |
Peng | Chinese word identification and sentence intelligibility in primary school classrooms | |
Sang et al. | Speech quality evaluation of a sparse coding shrinkage noise reduction algorithm with normal hearing and hearing impaired listeners | |
Brachmanski | Experimental comparison between speech transmission index (STI) and mean opinion scores (MOS) in rooms | |
Greenberg et al. | Report on Performance Results in the NIST 2010 Speaker Recognition Evaluation. | |
Kubo et al. | Effects of speaker's and listener's acoustic environments on speech intelligibility and annoyance | |
Jin et al. | The effect of noise envelope modulation on quality judgments of noisy speech | |
Brachmański | Estimation of logatom intelligibility with the STI method for polish speech transmitted via communication channels | |
CN113450780A (en) | Lombard effect classification method for auditory perception loudness space | |
Siyu et al. | Relationship between Chinese Mandarin intelligibility and speech transmission index STIPA under simulated tranmission conditions | |
Vaziri et al. | Evaluating noise suppression methods for recovering the Lombard speech from vocal output in an external noise field | |
Sudarsono et al. | Sound level calibration on soundscape reproduction using headphone | |
Kobayashi et al. | Bootstrap masker generation method for speech masking systems | |
Raitio et al. | On measuring the intelligibility of synthetic speech in noise—Do we need a realistic noise environment? | |
BRACHMAŃSKI | Objective measure for assessment of speech quality in rooms |