Data Contribution Portal

Your language data fuels our research. Help us build robust AI for Kashmiri.

Why Contribute?

The quality of Natural Language Processing (NLP) models depends entirely on the volume and quality of training data. Your contribution helps expand our corpus for **Speech Recognition**, **Machine Translation**, and **Lexical Resources**.

Guidelines for Submission

  • **Focus on Quality:** Submit original and grammatically correct Kashmiri text or any other related data.
  • **Datasets:** Submit datasets for developing NLP resources.
  • **Variety:** We welcome text from different domains (news, fiction, conversations).
  • **Licensing:** By submitting, you grant AI4Language a non-exclusive license to use this data for research purposes.

Submit Your Data (Text or File)

We use this only for potential follow-up questions.
Max size: 5MB. Allowed formats: **.txt, .doc, .docx, .pdf, .odt, .inpage.**