Vision: The multi-agent AI platform designed for quality conversations, not complexity.
Mission: Transform how people think through complex problems by enabling natural, intelligent conversations with multiple AI agents that learn, remember, and collaborate.
Previous Focus: Watch AI agents debate topics (passive observation)
New Direction: Participate in intelligent conversations with multiple AI agents (active participation)
Key Insight: Research shows active participation creates 3-5x higher engagement and produces more practical value than passive observation. Current market solutions either require technical expertise (AutoGen) or overwhelm users with complexity (Tess AI with 200+ models).
Our platform differentiates through three core pillars:
Unlike competitors where agents contradict each other or get stuck in loops, we ensure:
- Anti-contradiction detection and resolution
- Loop detection and automatic breaking
- Real-time conversation health scoring
- Evidence-grounded responses
The platform gets smarter with every conversation through:
- Three-tier memory system (short/medium/long-term)
- Semantic search across all past conversations
- Personalized agent behavior based on user history
- Cross-session learning and context retention
Opening the 100x larger non-technical market with:
- Zero-config start (just type and go)
- Template library for common scenarios
- Progressive disclosure of advanced features
- Natural language commands (no technical syntax)
- Next.js 15 + FastAPI architecture
- SSE streaming
- Multi-provider support (OpenAI, Anthropic, Google, Mistral)
- Basic UI components
- 2-4 agent sequential debates
- XState state machine
- Manual controls (pause/resume/stop)
- Cost tracking
- 34/34 backend tests passing
Timeline: 4-6 weeks Status: Planning β Implementation Focus: Top 3 competitive differentiators
Priority: CRITICAL Impact: Solves #1 user frustration across all platforms
Features:
-
Anti-Contradiction Engine
- Vector similarity detection between agent responses
- Automatic contradiction flagging
- Forced synthesis/reconciliation when agents disagree
- Confidence scoring for each agent claim
-
Loop Detection & Breaking
- Pattern recognition for repetitive exchanges
- Automatic intervention after 2-3 similar turns
- Context injection to break loops
- Cost protection (stop expensive loops)
-
Conversation Health Scoring
- Real-time quality metrics visible to users
- Progress indicators (are we getting somewhere?)
- Coherence scoring across turns
- Productivity assessment
-
AI Moderator Role
- Optional moderator agent to keep discussions on track
- Intervention when conversation derails
- Summary generation at key points
- Conflict resolution facilitation
-
Evidence Grounding
- Agents must cite sources for factual claims
- Citation tracking and validation
- Fact-checking integration (optional)
- Confidence intervals for uncertain statements
Technical Approach:
- Embedding-based similarity detection (OpenAI/Anthropic embeddings)
- Pattern matching for loop detection
- Structured output for citations
- Real-time scoring using lightweight ML models
Priority: CRITICAL Impact: Creates compound value and user lock-in
Features:
-
Three-Tier Memory System
- Short-term: Current conversation context (full detail)
- Medium-term: Recent conversations (summarized, 7-30 days)
- Long-term: Historical knowledge (indexed, >30 days)
-
Semantic Search
- Vector embeddings for all conversations
- Cross-conversation search ("what did we discuss about X?")
- Automatic context retrieval when relevant
- Privacy-respecting search scoping
-
Personalization Engine
- Learn user preferences (communication style, expertise level)
- Adapt agent personalities based on history
- Domain knowledge accumulation
- Custom terminology and context tracking
-
Privacy Controls
- User-controlled memory retention policies
- Selective forgetting
- Export/delete all data
- Encryption at rest
-
Memory Dashboard
- Visualize what agents remember
- Browse conversation history
- Edit/correct stored information
- Memory usage metrics
Technical Approach:
- PostgreSQL with pgvector extension
- Redis for short-term caching
- Automatic summarization using Claude/GPT
- Incremental embedding updates
- Tiered storage (hot/warm/cold)
Priority: CRITICAL Impact: Opens to 100x larger market
Features:
-
Zero-Config Start
- No setup required on first visit
- Intelligent agent selection based on query
- Default 3-agent panel (generalist, specialist, critic)
- One-click to start conversing
-
Template Library
- "Conversation Starters" for common scenarios:
- "Brainstorm business ideas"
- "Debug my thinking on [topic]"
- "Research [subject] from multiple angles"
- "Plan [project] step-by-step"
- Community-contributed templates
- Template marketplace (future)
- "Conversation Starters" for common scenarios:
-
Visual Agent Builder
- Drag-and-drop personality customization
- Slider controls (creativity, formality, expertise level)
- Personality presets ("The Devil's Advocate", "The Optimist", "The Analyst")
- No code or technical knowledge required
-
Progressive Disclosure
- Simple mode (default): Just chat
- Intermediate mode: Configure agents, set parameters
- Advanced mode: Custom system prompts, fine-grained controls
- Feature discovery through usage
-
Guided Onboarding
- Interactive tutorial (skip-able)
- Contextual tooltips
- Example conversations
- Best practices guide
-
Natural Language Commands
- "Add an expert in [domain]"
- "Make the responses more concise"
- "Show me what we discussed about X last week"
- No technical syntax required
Technical Approach:
- Smart defaults for all configurations
- A/B testing for onboarding flows
- Analytics to identify confusion points
- Gradual feature unlocking based on usage patterns
Timeline: 6-8 weeks Status: Planned
Priority: HIGH Impact: Creates "wow moment" differentiation
Features:
- WebSocket-based bidirectional streaming
- Sub-1.5s first token latency
- Smooth interruption handling
- Multiple agents streaming in parallel
- Typing indicators for each agent
- Token-by-token rendering with syntax highlighting
Technical Approach:
- Replace SSE with WebSockets
- Parallel LLM calls with streaming
- Client-side buffering and rendering
- Optimistic UI updates
Priority: HIGH Impact: Transforms debates from chaotic to productive
Features:
-
Multi-Dimensional Voting
- Ranked choice voting
- Weighted voting by confidence
- Quadratic voting for nuanced preferences
-
Argument Mapping
- Visual relationship graphs
- Claim β Evidence β Conclusion tracking
- Argument strength visualization
-
Automatic Synthesis
- Generate consensus summaries
- Highlight areas of agreement/disagreement
- Extract action items and decisions
-
Evidence Tracking
- Citation management
- Source credibility assessment
- Fact-checking integration
-
Structured Debate Protocols
- Oxford-style debates
- Lincoln-Douglas format
- Socratic method
- Custom protocol builder
Technical Approach:
- Graph database for argument structures
- NLP for claim extraction
- Structured output from LLMs
- Visual rendering with D3.js or similar
Priority: MEDIUM-HIGH Impact: Viral growth, enterprise value, network effects
Features:
-
Multi-User Conversations
- Multiple humans + agents in same conversation
- Real-time presence indicators
- User role management (host, participant, observer)
-
Conversation Forking
- Branch conversations to explore alternatives
- Merge branches back together
- Version control for discussions
-
Commenting & Annotation
- Comment on specific agent responses
- Highlight and annotate text
- Threaded discussions
-
Sharing & Permissions
- Shareable links with granular permissions
- Public/private/team conversations
- Read-only vs. interactive sharing
-
Export & Integration
- Export as Markdown, PDF, JSON
- API for programmatic access
- Webhook integrations
- Slack/Discord bots
Technical Approach:
- Operational transformation or CRDTs for real-time collaboration
- WebSocket room management
- Permission system with fine-grained controls
- Multiple export templates
Timeline: 3-6 months post-launch Status: Research & Planning
- Voice input/output for natural conversation
- Image analysis and discussion
- Document upload and analysis
- Screen sharing for debugging
- Pre-trained agents for specific domains:
- Legal analysis
- Medical research
- Financial planning
- Software architecture
- Creative writing
- Community marketplace for custom agents
- Agentic workflows (agents can take actions)
- Tool use (web search, calculator, code execution)
- Self-improving agents (RL from conversation quality)
- Multi-step planning and execution
- SSO and team management
- Admin dashboards and analytics
- Custom deployment (on-premise, VPC)
- SLA and support tiers
- Audit logs and compliance
- Native iOS and Android apps
- Offline mode with sync
- Push notifications for async conversations
- Voice-first mobile UX
- Conversation Quality: <5% contradiction rate, <2% loop rate
- Memory Effectiveness: 80%+ context recall accuracy
- UX Simplicity: 90%+ users complete first conversation without help
- Engagement: Average 3+ messages per user per session
- Retention: 40%+ week-over-week retention
- Performance: <1.5s first token latency
- Collaboration: 30%+ of conversations shared with others
- Growth: 20%+ month-over-month MAU growth
- Revenue: $50K MRR within 6 months of launch
- Quality: 4.5+ star average rating
Primary: Thoughtful professionals (25-45 years old)
- Strategists, researchers, writers, analysts
- Need to think through complex problems
- Frustrated by current tools' complexity or chaos
- Willing to pay for quality tools
Secondary: Technical enthusiasts
- Early adopters interested in AI capabilities
- Want to customize and experiment
- Potential contributors to open-source community
- vs. Tess AI: Curated quality over 200+ model chaos
- vs. ChatGPT Group Chats: Purpose-built for multi-agent from ground up
- vs. AutoGen: Zero-code, accessible to non-developers
- vs. Debate Platforms: Interactive participation, not passive watching
- Free Tier: Unlimited basic conversations (3 agents, standard models)
- Pro Tier ($20/month): Advanced models, 5 agents, memory features, export
- Team Tier ($50/user/month): Collaboration, admin, priority support
- Enterprise: Custom pricing, on-premise, SLA
- Month 1-2 (MVP Beta): Technical community via Product Hunt, HackerNews
- Month 3-4 (Public Launch): Content marketing, SEO, partnerships
- Month 5-6 (Scale): Paid acquisition, enterprise outreach
- LLM API costs: Implement aggressive caching, use cheaper models when appropriate
- Latency issues: WebSocket optimization, CDN for static assets, edge functions
- Scaling challenges: Serverless architecture, horizontal scaling, database optimization
- Competitor moves: Focus on quality and UX moats that are hard to replicate
- AI model changes: Abstract LLM provider to easily switch or multi-home
- Regulatory: Build privacy-first from day one, GDPR compliance
- Scope creep: Strict MVP definition, ruthless prioritization
- Quality issues: Comprehensive testing, staged rollout, feature flags
- Resource constraints: Focus on top 3 differentiators, outsource non-core
- Quality over quantity: Better conversations beat more features
- Simplicity over complexity: Accessible to everyone, powerful for experts
- Privacy over convenience: User data is sacred
- Community over control: Open-source core, extensible platform
- Speed over perfection: Ship fast, iterate based on feedback
- Modular architecture: Easy to extend and maintain
- Test-driven development: Comprehensive test coverage
- Performance by default: Optimize for speed from day one
- Security first: Threat modeling, regular audits
- Observability: Instrument everything, learn from data
This roadmap is a living document. We review and update quarterly based on:
- User feedback and feature requests
- Competitive landscape changes
- Technical feasibility and learnings
- Business metrics and goals
Last Updated: December 4, 2024 Next Review: March 1, 2025 Owner: Product Team