Development of Accent-Based Automatic Speech Recognition (ASR) for the Kashmiri Language
A Key Initiative for Low-Resource Language Technology
Project Overview
Funding Agency
J&K Science, Technology & Innovation Council
Sanctioned Budget
₹6 Lakhs
Duration
2 Years
Status
Ongoing
Principal Investigator
Co-Principal Investigator
Project Goal & Challenge
This project aims to develop an advanced **Accent-Based Automatic Speech Recognition (ASR)** system tailored for the **Kashmiri language**. The core challenge addressed is the significant phonetic and accentual diversity across different regions where Kashmiri is spoken. The research focuses on capturing and modelling these accent variations to build a robust, dialect-aware ASR system.
The system will be capable of accurately transcribing spontaneous Kashmiri speech irrespective of **regional accent differences**, moving beyond standard dialect recognition to full transcription capability.
Regional Dialect Focus
The research specifically models accent variations primarily from three major dialect groups, each with distinct pronunciation patterns, vocabulary influences, and phonological structures:
North Kashmiri
Bandipora, Kupwara, and parts of Baramulla
South Kashmiri
Anantnag, Kulgam, Shopian, Pulwama
Central Kashmiri
Srinagar, Budgam, Ganderbal
Methodology & Technology
The project involves several key technical components:
- Comprehensive Accented Speech Dataset Creation:Large-scale audio recordings from native speakers representing the regional dialects are collected.
- Data Annotation: Detailed phoneme-level and accent-specific annotations are performed on the collected dataset.
- Acoustic Modeling: Development of specialized acoustic models to handle phonetic variations.
- Language Model Development: Creation of robust language models using advanced deep neural architectures.
- Model Architectures:Utilizing modern techniques including Convolutional Neural Networks (CNNs), Recurrent Neural Networks (RNNs), and transformer-based models for state-of-the-art ASR performance.
Societal Impact & Future Applications
The system will serve as a foundational resource for building key applications for the Kashmiri community, contributing significantly to digital accessibility and language preservation:
- Kashmiri voice assistants and intelligent agents.
- Accurate speech-to-text services for media and journalism.
- Educational tools for language learning.
- E-governance applications to improve public service access.
- Advancing AI technologies for low-resource and underrepresented languages in India.