Document-Level Machine Translation with Large Language Models

Wang, Longyue; Lyu, Chenyang; Ji, Tianbo; Zhang, Zhirui; Yu, Dian; Shi, Shuming; Tu, Zhaopeng

Computer Science > Computation and Language

arXiv:2304.02210v1 (cs)

[Submitted on 5 Apr 2023 (this version), latest version 24 Oct 2023 (v2)]

Title:Document-Level Machine Translation with Large Language Models

Authors:Longyue Wang, Chenyang Lyu, Tianbo Ji, Zhirui Zhang, Dian Yu, Shuming Shi, Zhaopeng Tu

View PDF

Abstract:Large language models (LLMs) such as Chat-GPT can produce coherent, cohesive, relevant, and fluent answers for various natural language processing (NLP) tasks. Taking document-level machine translation (MT) as a testbed, this paper provides an in-depth evaluation of LLMs' ability on discourse modeling. The study fo-cuses on three aspects: 1) Effects of Discourse-Aware Prompts, where we investigate the impact of different prompts on document-level translation quality and discourse phenomena; 2) Comparison of Translation Models, where we compare the translation performance of Chat-GPT with commercial MT systems and advanced document-level MT methods; 3) Analysis of Discourse Modelling Abilities, where we further probe discourse knowledge encoded in LLMs and examine the impact of training techniques on discourse modeling. By evaluating a number of benchmarks, we surprisingly find that 1) leveraging their powerful long-text mod-eling capabilities, ChatGPT outperforms commercial MT systems in terms of human evaluation. 2) GPT-4 demonstrates a strong ability to explain discourse knowledge, even through it may select incorrect translation candidates in contrastive testing. 3) ChatGPT and GPT-4 have demonstrated superior performance and show potential to become a new and promising paradigm for document-level translation. This work highlights the challenges and opportunities of discourse modeling for LLMs, which we hope can inspire the future design and evaluation of LLMs.

Subjects:	Computation and Language (cs.CL); Artificial Intelligence (cs.AI)
Cite as:	arXiv:2304.02210 [cs.CL]
	(or arXiv:2304.02210v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.2304.02210

Submission history

From: Longyue Wang [view email]
[v1] Wed, 5 Apr 2023 03:49:06 UTC (1,563 KB)
[v2] Tue, 24 Oct 2023 14:00:21 UTC (799 KB)

Computer Science > Computation and Language

Title:Document-Level Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Document-Level Machine Translation with Large Language Models

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators