Publications
Publications about some works that I have done or collaborated with. You can download the documents to read them in full.
Journal articles
2021
-
Tag-less back-translationMachine Translation, Dec 2021
-
A hybrid approach for improved low resource neural machine translation using monolingual dataEngineering Letters, Nov 2021
Conference and workshop papers
2023
-
HaVQA: A Dataset for Visual Question Answering and Multimodal Research inHausa LanguageIn Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
-
HausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-information for Multi-level Sexism ClassificationIn Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023
-
SemEval-2023 Task 12: Sentiment Analysis forAfrican Languages (AfriSenti-SemEval)In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023
-
AfricaNEWS: News Topic Classification for African languagesIn 4th Workshop on African Natural Language Processing, Jul 2023
-
MasakhaNEWS: News Topic Classification for African languagesIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, Nov 2023
2022
-
NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment AnalysisIn Proceedings of the Language Resources and Evaluation Conference, Jun 2022
-
Quantity vs. Quality of Monolingual Source Data in Automatic Text Translation: Can It Be Too Little If It Is Too Good?In 2022 IEEE Nigeria 4th International Conference on Disruptive Technologies for Sustainable Development (NIGERCON), Jun 2022
-
A Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models forAfrican News TranslationIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jul 2022
-
NECAT-CLWE: A Simple But Efficient Parallel Data Generation Approach for Unsupervised and Semi-Supervised Neural Machine TranslationIn 3rd Workshop on African Natural Language Processing, Jul 2022
-
The African Stopwords Project: Curating Stopwords for African LanguagesIn 3rd Workshop on African Natural Language Processing, Jul 2022
-
Domain-Specific Lexicon-Based Sentiment Analysis using Contextual Shifter PatternsIn Proceedings of the Sixth Workshop on Widening Natural Language Processing, Dec 2022
-
HERDPhobia: A Dataset for Hate Speech Detection against Fulani Herdsmen in NigeriaIn Proceedings of the Sixth Workshop on Widening Natural Language Processing, Dec 2022
-
MasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity RecognitionIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Dec 2022
-
Hausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine TranslationIn Proceedings of the Language Resources and Evaluation Conference, Jun 2022
-
Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African LanguagesIn Proceedings of the Seventh Conference on Machine Translation, Dec 2022
2019
-
HauWE: Hausa Words Embedding for Natural Language ProcessingIn 2019 2nd International Conference of the IEEE Nigeria Computer Chapter, NigeriaComputConf 2019, Dec 2019
Preprints
2023
-
AfriSenti: A Twitter Sentiment Analysis Benchmark for African LanguagesDec 2023
2022
-
BLOOM: A 176B-Parameter Open-Access Multilingual Language ModelDec 2022