iTRANSLIT is a deep learning based transliteration package for indic language
pip install itranslit
pytorch 1.7.0 or 1.7.0+
NB: No GPU need. It's CPU based.
| Language Name | Langauage Code |
|---|---|
| Bangla | bn |
| Gujarati | gu |
| Hindi | hi |
| Punjabi | pa |
| Sindhi | sd |
| Urdu | ur |
| Malayalam | ml |
| Tamil | ta |
from itranslit import Translit
translit = Translit('bn')
word = "aami"
output = translit.predict(word, topk=10)
print(output)- We used Google Dakshina Dataset
- Thanks to AI4Bharat for providing training notebook with details explanation
- We trained Google Dakshina lexicons train datasets for 10 epochs with batch size 128, 1e-3, embedding dim = 300, hidden dim = 512, lstm, used attention
- We evaluated our trained model with Google Dakshina lexicon test data using AI4Bharat evaluation script
- You can find evaluation summary here