Neural Machine Translation for Code-Switching Challenge
Build models handling mixed-language conversations common in multilingual societies
Build Statement
Hundreds of millions of Africans communicate in multiple languages simultaneously, seamlessly code-switching within conversations in patterns that reflect education, emotion, and culture. Current translation systems fail catastrophically when encountering mixed-language input, excluding African users from global digital platforms and limiting local language technology adoption. A message mixing English and Swahili, or French and Wolof, becomes incomprehensible to standard NLP systems. Developers must create neural translation models that understand and translate code-switched conversations, handling the complexity of mixed-language communication that dominates African digital spaces, enabling true multilingual participation in the digital economy.
Full Description
The Neural Machine Translation for Code-Switching Challenge seeks innovative NLP solutions that handle the reality of multilingual communication in Africa, where speakers seamlessly switch between languages within single conversations. This challenge addresses the failure of traditional translation systems when faced with the mixed-language communication that dominates African social media, messaging, and even formal communications.
Participants will develop neural machine translation models that can process and translate code-switched text where speakers mix languages like English-Swahili, French-Wolof, Arabic-Berber, Portuguese-Kinyarwanda, or English-Yoruba within single sentences. The system must understand when code-switching is meaningful (emphasizing points, cultural concepts, borrowed terms) versus arbitrary, and preserve these nuances in translation.
Successful solutions will handle intra-sentential code-switching (within sentences), inter-sentential switching (between sentences), and tag-switching (inserting phrases). Models should work with limited parallel data for code-switched text, leverage monolingual data effectively, and maintain conversation context across messages. The system should support real-time translation for messaging applications and social media platforms.
We particularly value solutions that preserve cultural and emotional nuances in code-switching, handle spelling variations and informal language, work with African language pairs that have limited resources, and can be deployed on mobile devices. The platform should help break down language barriers while respecting the natural multilingual communication patterns of African societies.
Submission Requirements
• Submit up to 5 supporting links (documents, demos, repositories)
• Additional text content and explanations are supported
• Ensure all materials are accessible and properly formatted
• Review your submission before final submission
Online Submission
Submit your solution online