GitHub - un1u3/devkota: Small Nepali decoder-only Transformer that you can pretrain, fine-tune on poetry, and generate poems from prompts.

Tried every approach for iteration 
1-5 
most of them terriblly failed or GPU entensive 
except 2 
deleted all iterations 
iteration 6 is build upon the knowledge obtained from the failure of previous 5 iterations 
now the AIM is to make a robust transformer that can write poem nearly good as devkota(although it cant)

Mistakes in previous approaches 
no clean data 
tokenizer 
limited GPU 
high perplexity 


What to be improved 
use cleaner data- Collecting myself now
tokenizer - using custom tokenizer
limited GPU, using colab for training and making it resource considerations 
increase the epochs to reduce preplexity (old models were converging to less perplexity on higher num of epochs)

and MostIMP thing 
NO AI for code, this project is to understand internals of an LLM

Happy Coding:)

Name		Name	Last commit message	Last commit date
Latest commit History 87 Commits
iteration2		iteration2
iteration6		iteration6
.gitignore		.gitignore
image.png		image.png
readme.md		readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages