Introduction
Sahara is a comprehensive benchmark for African NLP, part of our ACL 2025 paper, "Where Are We? Evaluating LLM Performance on African Languages". Africa's rich linguistic heritage remains underrepresented in NLP, largely due to historical policies that favor foreign languages and create significant data inequities. In the paper, we integrate theoretical insights on Africa's language landscape with an empirical evaluation using Sahara. Sahara is curated from large-scale, publicly accessible datasets capturing the continent's linguistic diversity. By systematically assessing the performance of leading large language models (LLMs) on Sahara, we demonstrate how policy-induced data variations directly impact model effectiveness across African languages. Our findings reveal that while a few languages perform reasonably well, many Indigenous languages remain marginalized due to sparse data. Sahara supports an impressive 517 languages and varieties, across 16 tasks, making it the most extensive and representative benchmark for African NLP.
Task CLusters
The benchmark comprises 16 tasks across four primary clusters, providing a robust framework for evaluation:
MCCR*
- - Context-based QA
- - General Knowledge
- - Mathematical Word Problems
- - Reading Comprehension
Text Classification
- - Cross-Lingual NLI
- - Language Identification
- - News Classification
- - Sentiment Analysis
- - Topic Classification
Text Generation
- - Machine Translation
- - Paraphrase
- - Summarization
- - Title Generation
Token-level
- - Named Entity Recognition
- - Phrase Chunking
- - Part of Speech Tagging
Citation
If you use the Sahara benchmark for your scientific publication, or if you find the resources in this website useful, please cite our paper.
@inproceedings{adebara-etal-2025-evaluating,
title = "Where Are We? Evaluating {LLM} Performance on {A}frican Languages",
author = "Adebara, Ife and
Toyin, Hawau Olamide and
Ghebremichael, Nahom Tesfu and
Elmadany, AbdelRahim A. and
Abdul-Mageed, Muhammad",
editor = "Che, Wanxiang and
Nabende, Joyce and
Shutova, Ekaterina and
Pilehvar, Mohammad Taher",
booktitle = "Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)",
month = jul,
year = "2025",
address = "Vienna, Austria",
publisher = "Association for Computational Linguistics",
url = "https://aclanthology.org/2025.acl-long.1572/",
pages = "32704--32731",
ISBN = "979-8-89176-251-0",
}