• Home /
  • Papers /
  • Neural Machine Translation for Kurdish-English Language Pairs

Abstract

We present a transformer-based neural machine translation system specifically designed for Kurdish-English translation, incorporating morphological awareness and handling dialectal variations across Sorani and Kurmanji.

Keywords

Neural Machine Translation Kurdish Transformer Morphological Analysis

Related Datasets

Kurdish-English Parallel Translation Corpus

May 8, 2023 450 MB TSV, JSON

High-quality parallel corpus containing 500,000 sentence pairs for Kurdish-English translation, covering multiple domains and ensuring balanced representation of both Sorani and Kurmanji dialects.

Kurdish Morphological Analysis Dataset

April 20, 2023 85 MB CSV, XML, JSON

Comprehensive morphological analysis dataset containing 100,000 Kurdish words with detailed morphological breakdowns, POS tags, and inflectional information for both Sorani and Kurmanji dialects.

Citation

Salim, N., & Rashid, L. (2023). Neural Machine Translation for Kurdish-English Language Pairs. Computational Linguistics, 49(2), 23-41.

Publication Details

Authors 2 authors
Datasets 2 datasets