Adapting NMT to caption translation in Wikimedia Commons for low-resource languages
This paper presents a successful domain adaptation of a general neural machine
translation (NMT) system using a bilingual corpus created with captions for images in Wiki-
media Commons for the Spanish-Basque and English-Irish pairs.
Keywords: Machine Translation, Low-resource languages, Bilingual corpora, Language
resources from Wikipedia
Egileak (ixakideak):
Egileak:
Alberto Poncelas, Kepa Sarasola, Meghan Dowling, Andy Way, Gorka Labaka, Iñaki Alegria
Fitxategi publikoak:
Urtea:
2019
Artikuluaren erreferentzia:
Procesamiento del Lenguaje Natural, Revista no 63, septiembre de 2019, pp. 33-40