Neural Machine Translation for Code-Switching Challenge

Build models handling mixed-language conversations common in multilingual societies

Build Statement

Hundreds of millions of Africans communicate in multiple languages simultaneously, seamlessly code-switching within conversations in patterns that reflect education, emotion, and culture. Current translation systems fail catastrophically when encountering mixed-language input, excluding African users from global digital platforms and limiting local language technology adoption. A message mixing English and Swahili, or French and Wolof, becomes incomprehensible to standard NLP systems. Developers must create neural translation models that understand and translate code-switched conversations, handling the complexity of mixed-language communication that dominates African digital spaces, enabling true multilingual participation in the digital economy.

Full Description

The Neural Machine Translation for Code-Switching Challenge seeks innovative NLP solutions that handle the reality of multilingual communication in Africa, where speakers seamlessly switch between languages within single conversations. This challenge addresses the failure of traditional translation systems when faced with the mixed-language communication that dominates African social media, messaging, and even formal communications.

Participants will develop neural machine translation models that can process and translate code-switched text where speakers mix languages like English-Swahili, French-Wolof, Arabic-Berber, Portuguese-Kinyarwanda, or English-Yoruba within single sentences. The system must understand when code-switching is meaningful (emphasizing points, cultural concepts, borrowed terms) versus arbitrary, and preserve these nuances in translation.

Successful solutions will handle intra-sentential code-switching (within sentences), inter-sentential switching (between sentences), and tag-switching (inserting phrases). Models should work with limited parallel data for code-switched text, leverage monolingual data effectively, and maintain conversation context across messages. The system should support real-time translation for messaging applications and social media platforms.

We particularly value solutions that preserve cultural and emotional nuances in code-switching, handle spelling variations and informal language, work with African language pairs that have limited resources, and can be deployed on mobile devices. The platform should help break down language barriers while respecting the natural multilingual communication patterns of African societies.

Submission Requirements

• Submit up to 5 supporting links (documents, demos, repositories)

• Additional text content and explanations are supported

• Ensure all materials are accessible and properly formatted

• Review your submission before final submission

Online Submission

Submit your solution online

Deadline
December 30, 2025 at 12:00 AM
Prize Pool
$1,000 USD + Internship
Cash Prize
$1000
Organizer
Build54
Evaluation Criteria
Translation Quality 20%
Accuracy in translating code-switched text while preserving meaning
Code-Switch Handling 18%
Ability to detect and appropriately process language switches
Cultural Preservation 16%
Maintaining cultural nuances and context in translation
Language Coverage 14%
Number of African language pairs supported
Real-time Performance 12%
Speed suitable for messaging and conversation
Low-Resource Adaptation 10%
Performance with limited training data
Deployment Feasibility 10%
Ability to run on mobile devices or edge servers