Kekre et al., 2011 - Google Patents
Speaker identification using row mean of DCT and walsh hadamard transformKekre et al., 2011
View PDF- Document ID
- 17410806764758275094
- Author
- Kekre H
- Kulkarni V
- Venkatraman S
- Priya A
- Narashiman S
- Publication year
- Publication venue
- International Journal on Computer Science and Engineering
External Links
Snippet
In this paper we propose a unique approach to text dependent speaker identification using transformation techniques such as DCT (Discrete Cosine Transform) and WHT (Walsh and Hadamard Transform). The feature vectors for identification are extracted using two different …
- 238000000034 method 0 abstract description 27
Classifications
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/52—Extraction of features or characteristics of the image by deriving mathematical or geometrical properties from the whole image
- G06K9/522—Frequency domain transformation; Autocorrelation
-
- G—PHYSICS
- G06—COMPUTING; CALCULATING; COUNTING
- G06K—RECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
- G06K9/00—Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
- G06K9/36—Image preprocessing, i.e. processing the image information without deciding about the identity of the image
- G06K9/46—Extraction of features or characteristics of the image
- G06K9/4642—Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms
- G06K9/4647—Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms summing image-intensity values; Projection and histogram analysis
Similar Documents
Publication | Publication Date | Title |
---|---|---|
Tiwari | MFCC and its applications in speaker recognition | |
Tolba | A high-performance text-independent speaker identification of Arabic speakers using a CHMM-based approach | |
Gill et al. | Vector quantization based speaker identification | |
Sumithra et al. | A study on feature extraction techniques for text independent speaker identification | |
Kekre et al. | Performance comparison of speaker recognition using vector quantization by LBG and KFCG | |
Kumar et al. | Speaker recognition using GMM | |
Kekre et al. | Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification | |
Kekre et al. | Speaker identification using row mean of DCT and walsh hadamard transform | |
Maazouzi et al. | MFCC and similarity measurements for speaker identification systems | |
Brunet et al. | Speaker recognition for mobile user authentication: An android solution | |
Tazi et al. | An hybrid front-end for robust speaker identification under noisy conditions | |
Kamble et al. | Emotion recognition for instantaneous Marathi spoken words | |
Goyal et al. | Issues and challenges of voice recognition in pervasive environment | |
Kekre et al. | Performance comparison of speaker identification using dct, walsh, haar on full and row mean of spectrogram | |
Sukor et al. | Speaker identification system using MFCC procedure and noise reduction method | |
Kekre et al. | Speaker identification using row mean vector of spectrogram | |
Khetri et al. | Automatic speech recognition for marathi isolated words | |
Limkar et al. | Speaker recognition using VQ and DTW | |
Nagaraja et al. | Mono and cross lingual speaker identification with the constraint of limited data | |
Sinha et al. | Why Eli Roth should not use TTS-systems for anonymization | |
Kekre et al. | Comparative Analysis of Speaker Identification using row mean of DFT, DCT, DST and Walsh Transforms | |
Kekre et al. | Speaker identification using frequency dsitribution in the transform domain | |
Chougule et al. | Speaker recognition in mismatch conditions: a feature level approach | |
Bansod et al. | Speaker Recognition using Marathi (Varhadi) Language | |
Samudre | Text-independent speaker identification using vector quantization |