Moroccan Arabic Datasets

Moroccan Darija, the colloquial Arabic dialect spoken in Morocco, has been the focus of several dataset initiatives aimed at supporting natural language processing (NLP) applications. Moroccan Darija, spoken by over 33.5 million people, is at the heart of our mission to create chatbots that communicate naturally and effectively. Our goal is simple: break down language barriers and make technology accessible to everyone. Below are some notable resources:

Standard languages like English or French often hinder effective communication for many users. Smartly’s fine-tuned AI models enable chatbots to understand and respond accurately in Darija, whether written in Arabic script or Latin transcription (Arabizi). This helps businesses build trust by offering clear and precise responses in their customers’ native language.

Repository: https://github.com/cmajoubi/Lexsense-Moroccan-Darija-Datasets

Leave a Reply

Your email address will not be published. Required fields are marked *