Saudi Arabic
Conversations Dataset
50,000 synthetic customer service conversations in authentic Saudi Arabic dialects. Built for fine-tuning Arabic LLMs, chatbot training, and NLP research.
50,000 synthetic customer service conversations in authentic Saudi Arabic dialects. Built for fine-tuning Arabic LLMs, chatbot training, and NLP research.
{
"id": "uuid",
"status": "completed",
"metadata": { "dialect": "Najdi", "sector": "Fintech", "sentiment": "Angry", "topic": "Transfer Failed" },
"conversation": [ { "role": "user", "content": "..." }, { "role": "agent", "content": "..." } ],
"slug": "transfer-failed-a1b2c3"
}Visitors can browse real completed conversations and only download the first 500 examples in public preview format.
Public preview currently exposes 20 completed conversations · download is capped at the first 500 rows.
Drop the JSONL directly into your training pipeline. Format-ready for Hugging Face, Axolotl, and LLaMA-Factory.
Build Saudi customer service bots that actually sound local. Real dialect vocabulary, not translated MSA.
Sentiment analysis, dialect classification, named-entity extraction. Labeled metadata included per row.
Message us on WhatsApp — we'll confirm and send the file directly.