🧠 LLM Training and Inference

This project contains a small LLM (Large Language Model) that can be trained and used for text generation.

🚀 Setup

Before running anything, install dependencies:

pip install torch sentencepiece tqdm

🔗 Step 1: Tokenizer Preparation

Create a file named wiki.txt and add the your content (text) to it. Before training the model, you must prepare the tokenizer:

python tokenizer.py

This step ensures that the tokenizer is trained and ready to be used.

📈 Step 2: Training the Model

To train the model, run:

python main.py --DEVICE cuda --mode train

If you want to train on CPU, run:

python main.py --DEVICE cpu --mode train

🤖 Step 3: Running Inference

Once trained, you can generate text:

python main.py --DEVICE cuda --mode inference

If you trained the model on CPU, you can also infer on CPU:

python main.py --DEVICE cpu --mode inference

📚 Reference

For more information, visit: Ideami

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
README.md		README.md
config.py		config.py
data_utils.py		data_utils.py
inference.py		inference.py
main.py		main.py
model.py		model.py
tokenizer.py		tokenizer.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🧠 LLM Training and Inference

🚀 Setup

🔗 Step 1: Tokenizer Preparation

📈 Step 2: Training the Model

🤖 Step 3: Running Inference

📚 Reference

About

Uh oh!

Releases

Packages

Languages

frezazadeh/LLM-Like-Llama

Folders and files

Latest commit

History

Repository files navigation

🧠 LLM Training and Inference

🚀 Setup

🔗 Step 1: Tokenizer Preparation

📈 Step 2: Training the Model

🤖 Step 3: Running Inference

📚 Reference

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages