🧠 MiniGPT — Lightweight Text Generation Model

MiniGPT is a minimal yet functional GPT-style text generation project built from scratch using PyTorch and deployed via a FastAPI inference API.

It demonstrates a full mini-LLM lifecycle — training, saving checkpoints, and serving text generation via an HTTP endpoint.

🚀 Features

🧩 Custom GPT-style architecture implemented in PyTorch
🏋️‍♂️ Trained on tokenized text data (Byte Pair Encoding vocabulary)
💾 Checkpoint saving & loading for model reuse
⚙️ FastAPI inference server exposing a /generate endpoint
🔥 GPU acceleration (if available) using torch.cuda
📦 Clean project structure with tokenizer, model, and inference neatly organized

uvicorn app:app --reload --port 8080 curl -X POST "http://127.0.0.1:8080/generate"
-H "Content-Type: application/json"
-d '{ "prompt": "I am a boy from", "max_new_tokens": 150 }'

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.gitignore		.gitignore
LLM from scratch.ipynb		LLM from scratch.ipynb
MiniLLMInference.ipynb		MiniLLMInference.ipynb
README.md		README.md
new.py		new.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 MiniGPT — Lightweight Text Generation Model

🚀 Features

About

Uh oh!

Releases

Packages

Languages

aashwinraj/MiniGPT

Folders and files

Latest commit

History

Repository files navigation

🧠 MiniGPT — Lightweight Text Generation Model

🚀 Features

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages