Nothing Special   »   [go: up one dir, main page]

Kekre et al., 2011 - Google Patents

Speaker identification using row mean of DCT and walsh hadamard transform

Kekre et al., 2011

View PDF
Document ID
17410806764758275094
Author
Kekre H
Kulkarni V
Venkatraman S
Priya A
Narashiman S
Publication year
Publication venue
International Journal on Computer Science and Engineering

External Links

Snippet

In this paper we propose a unique approach to text dependent speaker identification using transformation techniques such as DCT (Discrete Cosine Transform) and WHT (Walsh and Hadamard Transform). The feature vectors for identification are extracted using two different …
Continue reading at citeseerx.ist.psu.edu (PDF) (other versions)

Classifications

    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • G06K9/52Extraction of features or characteristics of the image by deriving mathematical or geometrical properties from the whole image
    • G06K9/522Frequency domain transformation; Autocorrelation
    • GPHYSICS
    • G06COMPUTING; CALCULATING; COUNTING
    • G06KRECOGNITION OF DATA; PRESENTATION OF DATA; RECORD CARRIERS; HANDLING RECORD CARRIERS
    • G06K9/00Methods or arrangements for reading or recognising printed or written characters or for recognising patterns, e.g. fingerprints
    • G06K9/36Image preprocessing, i.e. processing the image information without deciding about the identity of the image
    • G06K9/46Extraction of features or characteristics of the image
    • G06K9/4642Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms
    • G06K9/4647Extraction of features or characteristics of the image by performing operations within image blocks or by using histograms summing image-intensity values; Projection and histogram analysis

Similar Documents

Publication Publication Date Title
Tiwari MFCC and its applications in speaker recognition
Tolba A high-performance text-independent speaker identification of Arabic speakers using a CHMM-based approach
Gill et al. Vector quantization based speaker identification
Sumithra et al. A study on feature extraction techniques for text independent speaker identification
Kekre et al. Performance comparison of speaker recognition using vector quantization by LBG and KFCG
Kumar et al. Speaker recognition using GMM
Kekre et al. Performance comparison of 2-D DCT on full/block spectrogram and 1-D DCT on row mean of spectrogram for speaker identification
Kekre et al. Speaker identification using row mean of DCT and walsh hadamard transform
Maazouzi et al. MFCC and similarity measurements for speaker identification systems
Brunet et al. Speaker recognition for mobile user authentication: An android solution
Tazi et al. An hybrid front-end for robust speaker identification under noisy conditions
Kamble et al. Emotion recognition for instantaneous Marathi spoken words
Goyal et al. Issues and challenges of voice recognition in pervasive environment
Kekre et al. Performance comparison of speaker identification using dct, walsh, haar on full and row mean of spectrogram
Sukor et al. Speaker identification system using MFCC procedure and noise reduction method
Kekre et al. Speaker identification using row mean vector of spectrogram
Khetri et al. Automatic speech recognition for marathi isolated words
Limkar et al. Speaker recognition using VQ and DTW
Nagaraja et al. Mono and cross lingual speaker identification with the constraint of limited data
Sinha et al. Why Eli Roth should not use TTS-systems for anonymization
Kekre et al. Comparative Analysis of Speaker Identification using row mean of DFT, DCT, DST and Walsh Transforms
Kekre et al. Speaker identification using frequency dsitribution in the transform domain
Chougule et al. Speaker recognition in mismatch conditions: a feature level approach
Bansod et al. Speaker Recognition using Marathi (Varhadi) Language
Samudre Text-independent speaker identification using vector quantization