Hey community!
I've built a large language model from scratch with 3 million training tokens. The training was not fully complete due to compute limitations, but the model is producing somewhat coherent text. Have a go at it and let me know what changes I can make.
Hey community! I've built a large language model from scratch with 3 million training tokens. The training was not fully complete due to compute limitations, but the model is producing somewhat coherent text. Have a go at it and let me know what changes I can make.