Computer Science > Computation and Language

arXiv:2309.06706 (cs)

[Submitted on 13 Sep 2023 (v1), last revised 15 Feb 2024 (this version, v2)]

Title:Simultaneous Machine Translation with Large Language Models

Authors:Minghan Wang, Jinming Zhao, Thuy-Trang Vu, Fatemeh Shiri, Ehsan Shareghi, Gholamreza Haffari

View PDF

Abstract:Real-world simultaneous machine translation (SimulMT) systems face more challenges than just the quality-latency trade-off. They also need to address issues related to robustness with noisy input, processing long contexts, and flexibility for knowledge injection. These challenges demand models with strong language understanding and generation capabilities which may not often equipped by dedicated MT models. In this paper, we investigate the possibility of applying Large Language Models (LLM) to SimulMT tasks by using existing incremental-decoding methods with a newly proposed RALCP algorithm for latency reduction. We conducted experiments using the \texttt{Llama2-7b-chat} model on nine different languages from the MUST-C dataset. The results show that LLM outperforms dedicated MT models in terms of BLEU and LAAL metrics. Further analysis indicates that LLM has advantages in terms of tuning efficiency and robustness. However, it is important to note that the computational cost of LLM remains a significant obstacle to its application in SimulMT.\footnote{We will release our code, weights, and data with publication.}

Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:2309.06706 [cs.CL]
	(or arXiv:2309.06706v2 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2309.06706

Submission history

From: Minghan Wang [view email]
[v1] Wed, 13 Sep 2023 04:06:47 UTC (79 KB)
[v2] Thu, 15 Feb 2024 06:50:00 UTC (7,983 KB)

Computer Science > Computation and Language

Title:Simultaneous Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Simultaneous Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators