Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

Rachel Bawden; François Yvon

doi:10.48550/ARXIV.2303.01911

Communication Dans Un Congrès Année : 2023

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

(1) , (2)

1
2

Rachel Bawden

Fonction : Auteur
PersonId : 9441
IdHAL : rachel-bawden
ORCID : 0000-0001-9553-1768
IdRef : 233174591

Automatic Language Modelling and ANAlysis & Computational Humanities

François Yvon

Fonction : Auteur
PersonId : 5347
IdHAL : francois-yvon
ORCID : 0000-0002-7972-7442
IdRef : 057593531

Traitement du Langage Parlé - LISN

Résumé

The NLP community recently saw the release of a new large open-access multilingual language model, BLOOM (BigScience et al., 2022) covering 46 languages. We focus on BLOOM's multilingual ability by evaluating its machine translation performance across several datasets (WMT, Flores-101 and DiaBLa) and language pairs (high- and low-resourced). Our results show that 0-shot performance suffers from overgeneration and generating in the wrong language, but this is greatly improved in the few-shot setting, with very good results for a number of language pairs. We study several aspects including prompt design, model sizes, cross-lingual transfer and the use of discursive context.

Mots clés

Machine translation Evaluation Large language models LLMs MT NMT

Domaines

Informatique et langage [cs.CL]

Fichier principal

eamt23.pdf (254.9 Ko)

Origine	Fichiers produits par l'(les) auteur(s)
licence	Paternité - Partage selon les Conditions Initiales

Rachel Bawden : Connectez-vous pour contacter le contributeur

https://inria.hal.science/hal-04015863

Soumis le : mardi 9 mai 2023-14:26:12

Dernière modification le : mercredi 16 octobre 2024-14:42:09

Dates et versions

hal-04015863 , version 1 (06-03-2023)

hal-04015863 , version 2 (09-05-2023)

Licence

Paternité - Partage selon les Conditions Initiales

Identifiants

HAL Id : hal-04015863 , version 2
ARXIV : 2303.01911
DOI : 10.48550/ARXIV.2303.01911

Citer

Rachel Bawden, François Yvon. Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM. EAMT 2023 - 24th Annual Conference of the European Association for Machine Translation, Jun 2023, Tampere, Finland. ⟨10.48550/ARXIV.2303.01911⟩. ⟨hal-04015863v2⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CNRS INRIA CENTRALESUPELEC INRIA2 GENCI UNIV-PARIS-SACLAY ANR PRAIRIE-IA LISN GS-COMPUTER-SCIENCE LISN-TLP

438 Consultations

132 Téléchargements

Investigating the Translation Performance of a Large Multilingual Language Model: the Case of BLOOM

Résumé

Mots clés

Domaines

Dates et versions

Licence

Identifiants

Citer

Exporter

Collections

Altmetric

Partager