indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
-
Updated
Jan 2, 2024 - Jupyter Notebook
indicTranslate v1 - Machine Translation for 11 Indic languages. For latest v2, check: https://github.com/AI4Bharat/IndicTrans2
A pipeline for transliteration, spell correction, POS tagging and word sense disambiguation of Hinglish code mixed data to Hindi Devanagari script.
Vyākarana: A Colorless Green Benchmark for Syntactic Evaluation in Indic Languages
Non-contextual : Word2Vec, FastText Contextual : BERT, RoBERTa, ELECTRA, CamemBERT, Distil-BERT, XLM-RoBERTa Analyzed embedding models, used the best one to build a Flask web app for Hindi NER and data collection from user feedback, deployed on AWS.
Contextualized Topic Modeling using Zero-Shot Learning on Indic Languages (IndicCTM)
We have done cleaning on the Hindi dataset and removed the characters which are not required in it
KPT: Kannada Pre-trained Transformer
A production-ready, frugal, sovereign AI system that orchestrates India's open-source language models to achieve state-of-the-art reasoning on consumer hardware through Test-Time Compute (TTC) and Cognitive Serialization.
Efficient fine-tuning of Llama-3.2-1B-Instruct on the Bhojpuri language using Unsloth and LoRA. Includes a complete workflow for instruction tuning and dataset preparation.
AyushDhara AI is a voice-first DPI bridging the rural health gap. Using Bedrock RAG, it provides grounded AYUSH guidance and a Sentinel dashboard for proactive, ABDM-ready public health surveillance.
Add a description, image, and links to the indic-nlp topic page so that developers can more easily learn about it.
To associate your repository with the indic-nlp topic, visit your repo's landing page and select "manage topics."