Publications
My publications.
2025
- NAACLAfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African LanguagesIn Proceedings of the 2025 Conference of the Nations of the Americas Chapter of the Association for Computational Linguistics: Human Language Technologies (Volume 1: Long Papers), Apr 2025
- arXivBRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 LanguagesApr 2025
2024
- LREC-COLINGMitigating Translationese in Low-resource Languages: The Storyboard ApproachIn Proceedings of the 2024 Joint International Conference on Computational Linguistics, Language Resources and Evaluation (LREC-COLING 2024), May 2024
- SACAIRAnalysing Public Transport User Sentiment on Low Resource Multilingual DataIn Proceedings of the Fifth Southern African Conference for Artificial Intelligence Research, Jul 2024
2023
- ACLHaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa LanguageIn Findings of the Association for Computational Linguistics: ACL 2023, Jul 2023
- SemEvalHausaNLP at SemEval-2023 Task 10: Transfer Learning, Synthetic Data and Side-information for Multi-level Sexism ClassificationIn Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023
- ICCAITAnalyzing COVID-19 Vaccination Sentiments in Nigerian Cyberspace: Insights from a Manually Annotated Twitter DatasetIn Proceedings of the International Conference on Computing and Advances in Information Technology (ICCAIT 2023), Nov 2023
- ICCAITLeveraging Closed-Access Multilingual Embedding for Automatic Sentence Alignment in Low Resource LanguagesIn Proceedings of the International Conference on Computing and Advances in Information Technology (ICCAIT 2023), Nov 2023
- SemEvalSemEval-2023 Task 12: Sentiment Analysis for African Languages (AfriSenti-SemEval)In Proceedings of the The 17th International Workshop on Semantic Evaluation (SemEval-2023), Jul 2023
- IJCNLPMasakhaNEWS: News Topic Classification for African languagesIn Proceedings of the 13th International Joint Conference on Natural Language Processing and the 3rd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics, Nov 2023
- EMNLPAfriSenti: A Twitter Sentiment Analysis Benchmark for African LanguagesIn Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, Dec 2023
2022
- LRECNaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment AnalysisIn Proceedings of the Language Resources and Evaluation Conference, Jun 2022
- NAACLA Few Thousand Translations Go a Long Way! Leveraging Pre-trained Models for African News TranslationIn Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Jul 2022
- AfricaNLPNECAT-CLWE: A Simple But Efficient Parallel Data Generation Approach for Unsupervised and Semi-Supervised Neural Machine TranslationIn 3rd Workshop on African Natural Language Processing, Jul 2022
- AfricaNLPThe African Stopwords Project: Curating Stopwords for African LanguagesIn 3rd Workshop on African Natural Language Processing, Jul 2022
- WiNLPDomain-Specific Lexicon-Based Sentiment Analysis using Contextual Shifter PatternsIn Proceedings of the Sixth Workshop on Widening Natural Language Processing, Dec 2022
- WiNLPHERDPhobia: A Dataset for Hate Speech Detection against Fulani Herdsmen in NigeriaIn Proceedings of the Sixth Workshop on Widening Natural Language Processing, Dec 2022
- EMNLPMasakhaNER 2.0: Africa-centric Transfer Learning for Named Entity RecognitionIn Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, Dec 2022
- LRECHausa Visual Genome: A Dataset for Multi-Modal English to Hausa Machine TranslationIn Proceedings of the Language Resources and Evaluation Conference, Jun 2022
- WMTSeparating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low-Resourced African LanguagesIn Proceedings of the Seventh Conference on Machine Translation, Dec 2022
2021
- IAENG ELA hybrid approach for improved low resource neural machine translation using monolingual dataEngineering Letters, Nov 2021
2019
- IEEEHauWE: Hausa Words Embedding for Natural Language ProcessingIn 2019 2nd International Conference of the IEEE Nigeria Computer Chapter, NigeriaComputConf 2019, Nov 2019