Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Pu, Xiao; Pappas, Nikolaos; Henderson, James; Popescu-Belis, Andrei

Computer Science > Computation and Language

arXiv:1810.02614 (cs)

[Submitted on 5 Oct 2018]

Title:Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Authors:Xiao Pu, Nikolaos Pappas, James Henderson, Andrei Popescu-Belis

View PDF

Abstract:This paper demonstrates that word sense disambiguation (WSD) can improve neural machine translation (NMT) by widening the source context considered when modeling the senses of potentially ambiguous words. We first introduce three adaptive clustering algorithms for WSD, based on k-means, Chinese restaurant processes, and random walks, which are then applied to large word contexts represented in a low-rank space and evaluated on SemEval shared-task data. We then learn word vectors jointly with sense vectors defined by our best WSD method, within a state-of-the-art NMT system. We show that the concatenation of these vectors, and the use of a sense selection mechanism based on the weighted average of sense vectors, outperforms several baselines including sense-aware ones. This is demonstrated by translation on five language pairs. The improvements are above one BLEU point over strong NMT baselines, +4% accuracy over all ambiguous nouns and verbs, or +20% when scored manually over several challenging words.

Comments:	To appear in TACL
Subjects:	Computation and Language (cs.CL)
Cite as:	arXiv:1810.02614 [cs.CL]
	(or arXiv:1810.02614v1 [cs.CL] for this version)
	https://doi.org/10.48550/arXiv.1810.02614

Submission history

From: Nikolaos Pappas [view email]
[v1] Fri, 5 Oct 2018 11:20:39 UTC (344 KB)

Full-text links:

Access Paper:

view license

Current browse context:

cs.CL

< prev | next >

new | recent | 1810

Change to browse by:

References & Citations

DBLP - CS Bibliography

listing | bibtex

Xiao Pu
Nikolaos Pappas
James Henderson
Andrei Popescu-Belis

export BibTeX citation

Computer Science > Computation and Language

Title:Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Computer Science > Computation and Language

Title:Integrating Weakly Supervised Word Sense Disambiguation into Neural Machine Translation

Submission history

Access Paper:

References & Citations

DBLP - CS Bibliography

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators