Open
Description
🚀 Describe the new functionality needed
The following is a high-level plan for the v1 release of the Llama Stack. It is not meant to be a
comprehensive list of all tasks, but rather a guide to help us stay on track. Also, it only covers
what's missing from the current state of the codebase.
Phase 1: Foundation & Infrastructure
CI/CD & Tooling
- Enable integration tests for all APIs (post-training is missing)
Phase 2: API Completion & Stabilization
Missing Features
- Audit and list missing API endpoints
- Identify areas of API we want to change / improve - now is the time to break the API if needed
- Implement missing endpoints + implementations
- Validate consistency with REST standards (like we did for Pagination, this one needs some followup)
- File API initial implementation (feat: add s3 provider to files API #1950???)
- Better MCP support
API Design & Versioning
- Freeze and document contract (schema, fields, defaults) for /v1/ - we have the docs, we just
need to freeze the contract - API deprecation strategy planning - more a nice to have?
Error Handling
- Implement consistent error model (exceptions, 422, 404, etc.) - are we doing this already?
Phase 3: Distribution and Provider Integration
Do we need to do anything here?
Phase 4: Testing & Type Safety
Type Safety
- Ensure mypy passes for 100% of codebase
Test Coverage
- Reach 80%+ test coverage for unit tests
Phase 5: Polishing & Release
Code Hygiene
- Idiomatic Python throughout the codebase chore: enable pyupgrade fixes #1806
Documentation
- Project README cleanup: setup, dev, and usage - but more importantly clear and concise
User Experience
- Add make dev / ./scripts/bootstrap.sh to start locally (Implement a 1 line utility to get started with a llama stack distro on local #1374)
- Revisit CLI UX for symmetry and consistency (especially for "llama stack build/run") - a good
CLI experience is crucial for the adoption of the Llama Stack. (example:llama stack build
andllama stack run
are not symmetric for--image-name
argument whenvenv
is used #1131) - Remove the ability to
llama stack run
a container Remove the ability to run the server in container from "llama stack run" #2110 - Rebuild a new UI - make it standalone so any UI launch can connect to any llama-stack server
- Implement XDG-compliant directory handling using the platformdirs library for configuration, data, and cache paths
Server - Security & Observability
- More RBAC capabilities?
- More auth capabilities? (Kubernetes: feat: Add Kubernetes authentication #1778)
- Add dependency vulnerability scanning - we have dependabot already, but is it enough?
- Enable CORS Expose knobs to configure cross-origin resource sharing for server #2119
- Enable multi-tenant capabilities (namespaces)
- Enable multi-users capabilities
- Quotas feat(quota): add server‑side per‑client request quotas (requires auth) #2096
- Repo scanning for vulnerability (with Snyk?)