DSVAE: Disentangled Representation Learning for Synthetic Speech Detection.

AllVideos Images Books Maps News Shopping

DSVAE: Interpretable Disentangled Representation for Synthetic ... - arXiv

Apr 6, 2023 · In this paper, we propose Disentangled Spectrogram Variational Auto Encoder (DSVAE) which is a two staged trained variational autoencoder that processes ...

[PDF] DSVAE: Interpretable Disentangled Representation for ...

www.semanticscholar.org › paper

Disentangled Spectrogram Variational Auto Encoder (DSVAE) is a two staged trained variational autoencoder that processes spectrograms of speech using ...

DSVAE: Disentangled Representation Learning for Synthetic ...

re.public.polimi.it › handle

Our experimental results show high accuracy (> 98%) on detecting synthetic speech from 6 known and 10 unknown speech synthesizers. Further, the visualization of ...

‪Ziyue Xiang‬ - ‪Google Scholar‬

scholar.google.com › citations

Co-authors ; Dsvae: Interpretable disentangled representation for synthetic speech detection. AKS Yadav, K Bhagtani, Z Xiang, P Bestagini, S Tubaro, EJ Delp.

Learning Disentangled Speech Representations - arxiv-sanity

arxiv-sanity-lite.com › ...

In this paper, we propose Disentangled Spectrogram Variational Auto Encoder (DSVAE) which is a two staged trained variational autoencoder that processes ...

Top 10 representations sorted by informativeness scores. We can ...

www.researchgate.net › figure › Top-10-...

... Disentangled representation learning methods leverage the idea that it is possible to divide learned representations into interpretable components [12, 70].

Where and What? Examining Interpretable Disentangled ...

www.researchgate.net › ... › Interpretation

... Disentangled representation learning methods leverage the idea that it is possible to divide learned representations into multiple explainable components [ ...

Kratika Bhagtani - Papers With Code

paperswithcode.com › author › kratika-b...

In this work, we examine bias in existing synthetic speech detectors to determine if they will unfairly target a particular gender, age and accent group.

‪Kratika Bhagtani‬ - ‪Google Scholar‬

scholar.google.de › citations

DSVAE: Disentangled Representation Learning for Synthetic Speech Detection. AKS Yadav, K Bhagtani, Z Xiang, P Bestagini, S Tubaro, EJ Delp. 2023 International ...

Ziyue Xiang - CatalyzeX

www.catalyzex.com › author

DSVAE also creates an activation map to highlight the spectrogram regions that discriminate synthetic and bona fide human speech signals. We evaluated the ...