Skip to content

Releases: spignelon/nollama

Nollama 0.5 release

04 Feb 19:35
dd8d3d1

Choose a tag to compare

NoLlama v0.5 - Multi-Provider LLM Support 🚀

🎉 Major Release: Complete Rewrite with LiteLLM Integration

NoLlama has been completely refactored to support 8 major LLM providers through a unified interface powered by LiteLLM.

✨ New Features

🌐 Multi-Provider Support

Access models from 8 major providers with a single interface:

  • Google Gemini - Google AI Studio models
  • Vertex AI - Google Cloud Platform models
  • Groq - Ultra-fast inference with Llama, Mixtral, Gemma
  • Anthropic - Claude Sonnet, Opus, and Haiku models
  • OpenAI - GPT-4o, GPT-4, o1, and more
  • DeepSeek - DeepSeek Chat, Coder, and Reasoner
  • OpenRouter - Access to models from multiple providers
  • Ollama - Local model support

🔍 Dynamic Model Discovery

  • No more hardcoded models! Models are fetched automatically from each provider's API
  • Always up-to-date with the latest available models
  • Prevents errors from deprecated or removed models

⚡ Smart Model Selection

  • Type-to-search with auto-completion as you type
  • Vim-style search using /model-name for quick filtering
  • Browse hundreds of models effortlessly
  • Intelligent partial matching and refinement

💬 Configurable Multi-Turn Context

  • NEW: MAX_MULTITURN_PAIRS setting to limit conversation history
  • Keep last N conversation pairs to manage token usage
  • Unlimited context by default

🎨 Enhanced UI/UX

  • Suppressed verbose LiteLLM debug output for cleaner interface
  • Improved error handling and user feedback
  • Better status indicators during model fetching

📝 Configuration Improvements

Dual Configuration Support

  • .env file for development/git clone users
  • ~/.nollama file for pip install users
  • Complete configuration template with all 8 providers
  • Environment variable support for all settings

Advanced Settings

## Configurable multi-turn context window
# MAX_MULTITURN_PAIRS=0  # 0 = unlimited (default)

## Provider API keys
# GEMINI_API_KEY=your_key_here
# GROQ_API_KEY=your_key_here
# ANTHROPIC_API_KEY=your_key_here
# OPENAI_API_KEY=your_key_here
# And more...

📚 Documentation Updates

Comprehensive README Rewrite

  • Clear installation instructions for all platforms (Linux/macOS/Windows)
  • Platform-specific configuration guides
  • Detailed usage examples and workflows
  • Updated roadmap with completed features

New Dependencies

  • litellm>=1.0.0 - Multi-provider LLM support
  • python-dotenv>=1.0.0 - Environment variable management
  • prompt_toolkit>=3.0.0 - Advanced terminal UI with search

🔧 Technical Improvements

  • Removed dependency on google-genai (replaced with LiteLLM)
  • Better error handling for API failures
  • Improved message history management
  • Cleaner code structure with separated concerns
  • Python 3.10-3.13 support (dropped 3.9)
  • Development status: Alpha → Beta

📦 Installation

pip install nollama --upgrade

🚀 Quick Start

# Set up your API key
echo "GROQ_API_KEY=your_api_key" > ~/.nollama

# Run nollama
nollama

Full Changelog: v0.4...v0.5

Nollama 0.4 Release

04 Jun 06:53
639d54c

Choose a tag to compare

Nollama 0.4 enhances your terminal experience for interacting with Google's AI models directly from your command line. This release significantly expands model support with the addition of new Gemma models and updates to the existing Gemini lineup, alongside several UI and technical improvements.

What's New

Expanded Model Support: Added Gemma 3 27B and Gemma 3n E4B models to the selection
🚀 Updated Model Lineup: Now includes Gemini 2.0 Flash Lite and updated versions of other models
🏷️ Improved Model Display: Now shows user-friendly model names instead of technical model IDs
📦 Dependency Updates: All required modules updated to their latest versions for improved performance
🛠️ Enhanced Stability: Various bug fixes and code optimizations

Installation

pip install nollama

Get your free API key from Google AI Studio and start chatting with powerful language models through a lightweight terminal interface.

Usage

Run nollama in your terminal after installation. Select from our expanded model lineup and start your conversation!

NoLlama v0.3 Release

19 Apr 22:54
29b9d1b

Choose a tag to compare

NoLlama 0.3 brings a streamlined terminal experience for interacting with Google's Gemini AI models directly from your command line. This release features support for the latest Gemini models including Gemini 2.5 Pro Preview and Gemini 2.5 Flash Preview, with enhanced conversation context handling.

What's New

  • ✨ Added context window support for more coherent multi-turn conversations
  • 🚀 Support for latest Gemini 2.0 and 2.5 models
  • 💬 Real-time streaming of AI responses
  • 🎨 Rich markdown rendering with syntax highlighting
  • 🛠️ Improved error handling and stability

Installation
pip install nollama

Get your free API key from Google AI Studio and start chatting with powerful language models through a lightweight terminal interface.

Release v0.2.1

05 Nov 16:24
6d4e4ed

Choose a tag to compare

Updated g4f to the latest version, the old release wasn't working. This new release should work.

Release v0.1

23 Aug 06:39
8d455c4

Choose a tag to compare

Initial release of NoLlama, includes a Linux binary. A Windows binary will be added soon. For more details, check out the project README.