Skip to content
Discussion options

You must be logged in to vote
Admin verified this answer by Bhavesh716 Apr 19, 2026

Hey! I trained it on a simple plain text file — anything works really, a book, some articles, or even Wikipedia dumps.

For beginners I'd suggest starting with a single book or story (~1MB). Clean, consistent text gives the best results for a small model like this.

Just drop it in data.txt and run train.py — that's it.

Replies: 1 comment

Comment options

You must be logged in to vote
0 replies
Answer verified by Admin Apr 19, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants