Skip to content

aashwinraj/MiniGPT

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Β 

History

3 Commits
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 
Β 

Repository files navigation

🧠 MiniGPT β€” Lightweight Text Generation Model

MiniGPT is a minimal yet functional GPT-style text generation project built from scratch using PyTorch and deployed via a FastAPI inference API.

It demonstrates a full mini-LLM lifecycle β€” training, saving checkpoints, and serving text generation via an HTTP endpoint.


πŸš€ Features

  • 🧩 Custom GPT-style architecture implemented in PyTorch
  • πŸ‹οΈβ€β™‚οΈ Trained on tokenized text data (Byte Pair Encoding vocabulary)
  • πŸ’Ύ Checkpoint saving & loading for model reuse
  • βš™οΈ FastAPI inference server exposing a /generate endpoint
  • πŸ”₯ GPU acceleration (if available) using torch.cuda
  • πŸ“¦ Clean project structure with tokenizer, model, and inference neatly organized

uvicorn app:app --reload --port 8080 curl -X POST "http://127.0.0.1:8080/generate"
-H "Content-Type: application/json"
-d '{ "prompt": "I am a boy from", "max_new_tokens": 150 }' Screenshot 2025-11-04 161758

About

No description, website, or topics provided.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published