Automatic Speech Recognition for Kurdish Dialects
Authors
Speech Communication
Vol. 145
pp. 102-118
September 10, 2023
Member Organizations:
This research develops the first comprehensive ASR system for Kurdish, supporting both Sorani and Kurmanji dialects with domain adaptation techniques. Achieved 89.3% word accuracy on conversational speech.
Large-scale audio corpus containing 1,000 hours of Kurdish speech from 500+ speakers across different dialects, ages, and regions. Includes high-quality transcriptions and speaker metadata.
Comprehensive pronunciation dictionary for Kurdish containing phonetic transcriptions for 75,000 words, including stress patterns and dialectal variations using IPA notation.
Kareem, H., Ahmed, R., & Jamal, S. (2023). Automatic Speech Recognition for Kurdish Dialects. Speech Communication, 145, 102-118.