Spaces:
Paused
Paused
| # LASER: application to cross-lingual natural language inference | |
| This codes shows how to use the multilingual sentence embedding for | |
| cross-lingual NLI, using the XNLI corpus. | |
| We train a NLI classifier on the English MultiNLI corpus, optimizing | |
| the meta-parameters on the English XNLI development corpus. | |
| We then apply that classifier to the test set for all 14 transfer languages. | |
| The foreign languages development set is not used. | |
| ## Installation | |
| Just run `bash ./xnli.sh` | |
| which install XNLI and MultiNLI corpora, | |
| calculates the multilingual sentence embeddings, | |
| trains the classifier and displays results. | |
| The XNLI corpus is available [here](https://www.nyu.edu/projects/bowman/xnli/). | |
| ## Results | |
| You should get the following results for zero-short cross-lingual transfer. | |
| They slightly differ from those published in the initial version of the paper [2] | |
| due to the change to PyTorch 1.0 and variations in random number generation, new optimization of meta-parameters, etc. | |
| | en | fr | es | de | el | bg | ru | tr | ar | vi | th | zh | hi | sw | ur | | |
| |-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------|-------| | |
| | 74.65 | 72.26 | 73.15 | 72.48 | 72.73 | 73.35 | 71.08 | 69.84 | 70.48 | 71.94 | 69.20 | 71.38 | 65.95 | 62.14 | 61.82 | | |
| All numbers are accuracies on the test set | |
| ## References | |
| Details on the corpus are described in this paper: | |
| [1] Alexis Conneau, Guillaume Lample, Ruty Rinott, Adina Williams, Samuel R. Bowman, Holger Schwenk and Veselin Stoyanov, | |
| [*XNLI: Cross-lingual Sentence Understanding through Inference*](https://aclweb.org/anthology/D18-1269), | |
| EMNLP, 2018. | |
| Detailed system description: | |
| [2] Mikel Artetxe and Holger Schwenk, | |
| [*Massively Multilingual Sentence Embeddings for Zero-Shot Cross-Lingual Transfer and Beyond*](https://arxiv.org/pdf/1812.10464), | |
| arXiv, Dec 26 2018. | |