Tiny-Universe Study Notes

This repository contains my study notes and implementations based on the Datawhale Tiny-Universe project - "A comprehensive guide to building LLM systems from scratch" (《大模型白盒子构建指南》).

📁 Repository Structure

Task 1: Qwen Model Deep Dive 🔍

Directory: Task1/

Focus: Understanding Qwen2 model architecture and internal mechanisms
Key Components:
- Model configuration and initialization
- Decoder layer implementation
- Attention mechanism (including GQA - Grouped Query Attention)
- Position embeddings and RoPE
- Forward pass walkthrough

Task 2: TinyLLM - Pretraining from Scratch 🚀

Directory: Task2/

Focus: Building and pretraining a Llama3-style model from scratch
Key Components:
- Model pretraining pipeline
- Data preparation and tokenization
- Training loop implementation
- Model inference and text generation

Task 3: TinyAgent - Building an AI Agent 🤖

Directory: Task3/

Focus: Implementing a minimal Agent system using the ReAct paradigm
Key Components:
- ReAct (Reasoning + Acting) framework implementation
- Tool integration (Google Search)
- Agent planning and execution logic
- System prompt engineering
Architecture: Two-stage model calling for tool selection and response generation

Task 4: TinyEval - LLM Evaluation Framework 📊

Directory: Task4/

Focus: Building a comprehensive evaluation system for LLMs
Key Components:
- Multi-modal evaluation (generative, discriminative, choice-based)
- Multiple metrics (F1, ROUGE, BLEU, Accuracy)
- Custom dataset evaluation support
- Two-stage evaluation pipeline (inference + evaluation)
Supported Tasks: Question answering, text generation, classification

This repository represents my personal learning journey through the fascinating world of Large Language Models.

Happy Learning! 🎉

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
.vscode		.vscode
Task1		Task1
Task2		Task2
Task3		Task3
Task4		Task4
transformers_100		transformers_100
.gitattributes		.gitattributes
Readme.md		Readme.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Tiny-Universe Study Notes

📁 Repository Structure

Task 1: Qwen Model Deep Dive 🔍

Task 2: TinyLLM - Pretraining from Scratch 🚀

Task 3: TinyAgent - Building an AI Agent 🤖

Task 4: TinyEval - LLM Evaluation Framework 📊

About

Uh oh!

Releases

Packages

Uh oh!

Languages

tkgaolol/Tiny_LLM

Folders and files

Latest commit

History

Repository files navigation

Tiny-Universe Study Notes

📁 Repository Structure

Task 1: Qwen Model Deep Dive 🔍

Task 2: TinyLLM - Pretraining from Scratch 🚀

Task 3: TinyAgent - Building an AI Agent 🤖

Task 4: TinyEval - LLM Evaluation Framework 📊

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages