DFS Glossary Dataset
The expert-verified Digital Financial Services glossary, published as a structured, openly-licensed dataset in Amharic and Afaan Oromoo — open language infrastructure for two of Ethiopia's major languages.
Get the dataset
GitHub
Canonical source and version history. Propose new terms or report corrections through Issues.
View repository→Hugging Face
Ready for machine learning and NLP. Load both languages directly with the datasets library.
Open on Hugging Face→Zenodo
Permanently archived with a citable DOI for academic and research use.
10.5281/zenodo.20666618→How it was built
Expert-verified
Translated and reviewed by domain experts as part of the AKOFADA project.
Fidelity-first
Built from the original editable sources — never extracted from PDFs — to preserve Ge'ez (Amharic) character accuracy. Text is kept verbatim.
Cross-linked
The two languages share an English term as a join key, so Amharic and Afaan Oromoo entries align one-to-one.
Citation
Shega and AKOFADA (2026). DFS Glossary — Amharic & Afaan Oromoo (Version 1.0) [Data set]. Zenodo. https://doi.org/10.5281/zenodo.20666618
Licensed under Creative Commons Attribution 4.0 International (CC BY 4.0). Free to share and adapt, including commercially, with attribution. Produced by Shega · AKOFADA project · funded by the Gates Foundation.