NAWATL
The Nawatl corpus PI-YALLI contains a set of several Nawatl documents, and a three different static embeddings models.
The corpus was splitted in 16 topics.
The corpus was manually compiled and processed by a team from Université d'Avignon (France), Universidad Veracruzana (Mexique) and independent researchers.
This corpus is suitable for testing and learning systems working on Nawatl language.
New versions, containing more texts, will be aggregated periodically.
The PI-YALLI corpus and its embeddings is distributed by Laboratoire Informatique d'Avignon (France) under LGPL license.
How to cite this corpus ?
Contact : Juan-Manuel Torres-Moreno
http://lia.univ-avignon.fr / Universite d'Avignon, France
juan *-* manuel *dot* torres *at* univ-avignon *dot* fr
Updated 20.06.25