An Attention-Based Joint Acoustic and Text on-Device End-To-End Model.

scholar.google.com › citations

… attention-based joint acoustic and text on-device end-to …
Sainath · Cited by 15

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

research.google › pubs › an-attention-bas...

We introduce a joint acoustic and text-only decoder (JATD) into the LAS decoder, which allows the LAS decoder to be trained on a much larger text-corporate.

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

ieeexplore.ieee.org › document

May 4, 2020 · We find that the JATD model obtains in a 3-10% relative improvement in WER compared to a LAS decoder trained only on supervised audio-text pairs ...

[PDF] an attention-based joint acoustic and text on-device end-to-end model

ronw.net › pubs › icassp2020-jatd

We find that the JATD model obtains in a 3-10% relative improvement in WER compared to a LAS decoder trained only on supervised audio-text pairs across a ...

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

www.semanticscholar.org › paper › An-...

A joint acoustic and text decoder (JATD) into the LAS decoder, which makes it possible to incorporate a much larger text corpus into training and obtains in ...

An Attention-Based Joint Acoustic And Text On-Device End-To-End Model

ieeetv.ieee.org › ondemand › ieee-icassp-...

Recently, we introduced a two-pass on-device end-to-end (E2E) speech recognition model, which runs RNN-T in the first-pass and then rescores/redecodes the ...

[PDF] An Attention-Based Joint Acoustic and Text On-Device End-to-End Model

sigport.org › pubdlcnt › pubdlcnt

May 6, 2020 · E2E models are trained on audio-text pairs, which is a fraction of data compared to a conventional ASR model. ○ E2E models lag behind ...

lingvo/PUBLICATIONS.md at master - GitHub

github.com › tensorflow › lingvo › blob

Strohman, “An attention-based joint acoustic and text on-device end-to-end model,” in Proc. IEEE International Conference on Acoustics, Speech, and Signal ...

Tara Sainath - Google Sites

sites.google.com › site › tsainath

Strohman, "An Attention-Based Joint Acoustic and Text On-Device End-to-End Model," in Proc. ICASSP, 2020. B. Li, S. Chang, T.N. Sainath, R. Pang, Y. He T ...

[2309.07369] Hybrid Attention-based Encoder-decoder Model for Efficient ...

arxiv.org › eess

Sep 14, 2023 · Our HAED model separates the acoustic and language models, allowing for the use of conventional text-based language model adaptation techniques.

Hybrid Attention-based Encoder-decoder Model for Efficient ...

arxiv.org › html

Sep 14, 2024 · In this work, we propose a novel hybrid attention-based encoder-decoder model that enables efficient text adaptation in an end-to-end speech ...

Scholarly articles for An Attention-Based Joint Acoustic and Text on-Device End-To-End Model.

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

[PDF] an attention-based joint acoustic and text on-device end-to-end model

An Attention-Based Joint Acoustic and Text on-Device End-To-End Model

An Attention-Based Joint Acoustic And Text On-Device End-To-End Model

[PDF] An Attention-Based Joint Acoustic and Text On-Device End-to-End Model

lingvo/PUBLICATIONS.md at master - GitHub

Tara Sainath - Google Sites

[2309.07369] Hybrid Attention-based Encoder-decoder Model for Efficient ...

Hybrid Attention-based Encoder-decoder Model for Efficient ...