Releases: spignelon/nollama
Nollama 0.5 release
NoLlama v0.5 - Multi-Provider LLM Support 🚀
🎉 Major Release: Complete Rewrite with LiteLLM Integration
NoLlama has been completely refactored to support 8 major LLM providers through a unified interface powered by LiteLLM.
✨ New Features
🌐 Multi-Provider Support
Access models from 8 major providers with a single interface:
- Google Gemini - Google AI Studio models
- Vertex AI - Google Cloud Platform models
- Groq - Ultra-fast inference with Llama, Mixtral, Gemma
- Anthropic - Claude Sonnet, Opus, and Haiku models
- OpenAI - GPT-4o, GPT-4, o1, and more
- DeepSeek - DeepSeek Chat, Coder, and Reasoner
- OpenRouter - Access to models from multiple providers
- Ollama - Local model support
🔍 Dynamic Model Discovery
- No more hardcoded models! Models are fetched automatically from each provider's API
- Always up-to-date with the latest available models
- Prevents errors from deprecated or removed models
⚡ Smart Model Selection
- Type-to-search with auto-completion as you type
- Vim-style search using
/model-namefor quick filtering - Browse hundreds of models effortlessly
- Intelligent partial matching and refinement
💬 Configurable Multi-Turn Context
- NEW:
MAX_MULTITURN_PAIRSsetting to limit conversation history - Keep last N conversation pairs to manage token usage
- Unlimited context by default
🎨 Enhanced UI/UX
- Suppressed verbose LiteLLM debug output for cleaner interface
- Improved error handling and user feedback
- Better status indicators during model fetching
📝 Configuration Improvements
Dual Configuration Support
- .env file for development/git clone users
~/.nollamafile for pip install users- Complete configuration template with all 8 providers
- Environment variable support for all settings
Advanced Settings
## Configurable multi-turn context window
# MAX_MULTITURN_PAIRS=0 # 0 = unlimited (default)
## Provider API keys
# GEMINI_API_KEY=your_key_here
# GROQ_API_KEY=your_key_here
# ANTHROPIC_API_KEY=your_key_here
# OPENAI_API_KEY=your_key_here
# And more...📚 Documentation Updates
Comprehensive README Rewrite
- Clear installation instructions for all platforms (Linux/macOS/Windows)
- Platform-specific configuration guides
- Detailed usage examples and workflows
- Updated roadmap with completed features
New Dependencies
litellm>=1.0.0- Multi-provider LLM supportpython-dotenv>=1.0.0- Environment variable managementprompt_toolkit>=3.0.0- Advanced terminal UI with search
🔧 Technical Improvements
- Removed dependency on
google-genai(replaced with LiteLLM) - Better error handling for API failures
- Improved message history management
- Cleaner code structure with separated concerns
- Python 3.10-3.13 support (dropped 3.9)
- Development status: Alpha → Beta
📦 Installation
pip install nollama --upgrade🚀 Quick Start
# Set up your API key
echo "GROQ_API_KEY=your_api_key" > ~/.nollama
# Run nollama
nollamaFull Changelog: v0.4...v0.5
Nollama 0.4 Release
Nollama 0.4 enhances your terminal experience for interacting with Google's AI models directly from your command line. This release significantly expands model support with the addition of new Gemma models and updates to the existing Gemini lineup, alongside several UI and technical improvements.
What's New
✨ Expanded Model Support: Added Gemma 3 27B and Gemma 3n E4B models to the selection
🚀 Updated Model Lineup: Now includes Gemini 2.0 Flash Lite and updated versions of other models
🏷️ Improved Model Display: Now shows user-friendly model names instead of technical model IDs
📦 Dependency Updates: All required modules updated to their latest versions for improved performance
🛠️ Enhanced Stability: Various bug fixes and code optimizations
Installation
pip install nollama
Get your free API key from Google AI Studio and start chatting with powerful language models through a lightweight terminal interface.
Usage
Run nollama in your terminal after installation. Select from our expanded model lineup and start your conversation!
NoLlama v0.3 Release
NoLlama 0.3 brings a streamlined terminal experience for interacting with Google's Gemini AI models directly from your command line. This release features support for the latest Gemini models including Gemini 2.5 Pro Preview and Gemini 2.5 Flash Preview, with enhanced conversation context handling.
What's New
- ✨ Added context window support for more coherent multi-turn conversations
- 🚀 Support for latest Gemini 2.0 and 2.5 models
- 💬 Real-time streaming of AI responses
- 🎨 Rich markdown rendering with syntax highlighting
- 🛠️ Improved error handling and stability
Installation
pip install nollama
Get your free API key from Google AI Studio and start chatting with powerful language models through a lightweight terminal interface.
Release v0.2.1
Updated g4f to the latest version, the old release wasn't working. This new release should work.
Release v0.1
Initial release of NoLlama, includes a Linux binary. A Windows binary will be added soon. For more details, check out the project README.