This directory stores all data files for the AI-Tutor system, including knowledge bases, user data, logs, etc.
data/
├── knowledge_bases/ # Knowledge base storage
└── user/ # User activity data
├── solve/ # Problem-solving results
├── question/ # Generated questions
├── research/ # Research reports and cache
├── co_writer/ # Interactive IdeaGen documents
├── notebook/ # Notebook records
├── guide/ # Guided-learning sessions
└── run_code_workspace/ # Code execution workspace
Stores all knowledge base data files for the AI-Tutor system.
Stores all user-generated data and output files.
Problem-solving module output directory containing results from problem-solving tasks.
Question generation module output directory containing generated questions and validation results.
Research module output directory containing research reports and cache data.
Co-Writer module output directory for interactive IdeaGen documents.
Notebook data storage containing notebook records and indices.
Guided-learning module output directory containing learning session data.
Code execution workspace for temporarily storing files generated by code execution.
Data directory paths are configured in config/main.yaml:
paths:
user_data_dir: "./data/user"
knowledge_bases_dir: "./data/knowledge_bases"
run_code_workspace_dir: "./data/user/run_code_workspace"- Backup Important Data: Recommend regularly backing up
knowledge_bases/and important user data - Version Control: Recommend adding
data/directory to.gitignoreto avoid committing large files - Disk Space: Knowledge bases and user data may occupy significant disk space, clean old data regularly
- Permission Management: Ensure application has read/write permissions
- Path Consistency: All modules use unified path configuration, avoid hardcoded paths
- Knowledge Base Management:
src/knowledge/- Knowledge base creation, updates, queries - User Data: Each functional module automatically manages its corresponding user data directory
- Logging System:
src/core/logging/- Unified logging management
# Clean old user data directories (customize based on retention policy)
# Replace the pattern with your actual naming convention, e.g.:
# find data/user -type d -regex '.*_[0-9]{8}.*' -mtime +30 -exec rm -rf {} \;
find data/user -type d -name "*_YYYYMMDD*" -mtime +30 -exec rm -rf {} \;
# Clean temporary workspace files
find data/user/run_code_workspace -type f -mtime +7 -delete# Backup entire knowledge base directory
tar -czf knowledge_bases_backup_$(date +%Y%m%d).tar.gz data/knowledge_bases/
# Backup specific knowledge base
tar -czf kb_backup.tar.gz data/knowledge_bases/{kb_name}/# Restore knowledge base
tar -xzf knowledge_bases_backup_20250101.tar.gz -C data/