Niger-Mali Audio Collection
This website shares the raw audio data distributed during the IWSLT 2022: Low-resource Speech Translation Track. This audio data was web-crawled from Studio Kalangou and Studio Tamani websites, with the authorization of Fondation Hirondelle. When using this data, please cite our paper and this website.
Before downloading the data, please fill THIS FORM. A password will be sent to you for downloading the files.
- Niger (Studio Kalangou)
- Accented French:
- 2019: 11-19, 12-19
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
- Fulfulde:
- 2019: 11-19, 12-19
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
- Hausa:
- 2019: 11-19, 12-19
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
- Tamasheq:
- 2019: 11-19, 12-19
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
- Zarma:
- 2019: 11-19, 12-19
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
- Mali (Studio Tamani)
- Tamasheq:
- 2020: 01-20, 02-20, 03-20, 04-20, 05-20, 06-20, 07-20, 08-20, 09-20, 10-20, 11-20, 12-20
- 2021: 01-21, 02-21, 03-21, 04-21, 05-21, 06-21, 07-21, 08-21, 09-21
- Pre-segmented version. This file is a filtered and segmented version of all the data above.
- List of URLs (for wget downloading)
Contact: For more information, please contact marcely.zanon-boito or yannick.esteve at univ-avignon.fr
All audio recordings are property of Studio Kanlagou, Studio Tamani and Fondation Hirondelle.
This work is licensed under a Creative Commons Attribution-NonCommercial-NoDerivs 3.0 Unported License.