The interactive CLI now supports directory browsing! Just paste a directory path and select from available datasets.
When you run:
python llamaforge_interactive.py[2/5] DATASET CONFIGURATION
[i] Supported formats: JSON, JSONL, CSV, TXT
> Dataset file path: ~/LlamaForge/testtraindata
[✗] ERROR: File not found: ~/LlamaForge/testtraindata
[2/5] ██░░░ > DATASET CONFIGURATION
[i] Supported formats: JSON, JSONL, CSV, TXT
Auto-structuring enabled for all formats
Tip: You can paste a directory path to browse files
> Dataset file path or directory: ~/LlamaForge/testtraindata
[✓] Directory found: /home/joker/LlamaForge/testtraindata
┌─ Available Datasets (3 found)
├─ 1. code_alpaca.jsonl (335.5 KB)
├─ 2. practical_coding.jsonl (1.8 KB)
└─ 3. sql_generation.jsonl (153.1 KB)
> Select dataset number (1-3) [1]: 1
▰▰▰▰▰ Loading dataset COMPLETE
[✓] Dataset selected: code_alpaca.jsonl (JSONL format)
- Directories: Shows all compatible files
- Files: Uses directly
- **~ expansion**:
~/LlamaForge/testtraindataworks!
- Shows file sizes (KB/MB)
- Supports up to 20 files (with overflow indicator)
- Sorted alphabetically
- Beautiful tree-style display
Automatically finds:
.jsonfiles.jsonlfiles.csvfiles.txtfiles
> Dataset file path or directory: testtraindata
┌─ Available Datasets (3 found)
├─ 1. code_alpaca.jsonl (335.5 KB)
├─ 2. practical_coding.jsonl (1.8 KB)
└─ 3. sql_generation.jsonl (153.1 KB)
> Select dataset number: 1
> Dataset file path or directory: examples/datasets
┌─ Available Datasets (4 found)
├─ 1. code_generation.jsonl (0.9 KB)
├─ 2. instruction_following.jsonl (1.1 KB)
├─ 3. qa_pairs.json (0.8 KB)
└─ 4. sentiment.csv (0.3 KB)
> Select dataset number: 2
> Dataset file path or directory: testtraindata/code_alpaca.jsonl
▰▰▰▰▰ Loading dataset COMPLETE
[✓] Dataset validated: JSONL format detected
> Dataset file path or directory: ~/LlamaForge/testtraindata
[✓] Directory found: /home/joker/LlamaForge/testtraindata
...
- No more typing full paths - Just paste the directory
- See file sizes - Know what you're selecting
- Quick browsing - View all available datasets at once
- Error prevention - Only shows valid dataset files
- Works everywhere - Supports relative, absolute, and ~ paths
python llamaforge_interactive.py
# At dataset prompt, type: testtraindata
# Select from the list# At dataset prompt, type: examples/datasets
# Browse all example datasets# In your terminal, use tab completion:
# testtraindata/<TAB>
# Then paste the full path- Enter on directory: Browse files
- Number + Enter: Select file
- Ctrl+C: Cancel and exit
- Invalid path: Shows helpful suggestions
Try it now:
python llamaforge_interactive.py- Select your Ollama model (e.g.,
qwen2.5:3b) - Paste:
testtraindataor~/LlamaForge/testtraindata - Select: Dataset from the list
- Configure training parameters
- Start training!
No more path errors! Browse your datasets like a pro. 🎯