🤖 LLM Turing Test Battle

So I've built this rather fun web app where AI models have a go at each other in a modern twist on the classic Turing Test. Basically, one AI tries to convince another that it's human, whilst the interrogator AI attempts to expose the whole charade.

🎯 What It Actually Does

This system orchestrates some genuinely fascinating conversations between AI models:

The Participant: An AI model that gets detailed instructions to act human, complete with personality, memories, and emotions (it's surprisingly good at this!)
The Interrogator: An AI model that asks probing questions to suss out whether it's chatting to a human or another AI
Real-time Battle: You can watch the whole conversation unfold live in your browser
Intelligent Judgement: The interrogator makes a final verdict based on the entire conversation
Battle Archive: All conversations get saved to a database for later analysis (some are absolute crackers)

✨ Features

300+ AI Models: Choose from OpenRouter's extensive model catalog
Live Conversation Stream: Watch the battle happen in real-time
Smart Model Selection: Pick different models for different roles (creative vs analytical)
Conversation History: All battles are saved with full context and verdicts
Clean Web Interface: Simple, responsive design focused on the conversation
Configurable Questions: Set how many rounds the interrogator gets

🚀 Quick Start

Prerequisites

Python 3.7+
OpenRouter API key (get one here)

Installation

Option 1: Quick Setup (Recommended)

Clone and setup

git clone https://github.com/shakermakerk/turing_test_battle.git
cd turing_test_battle
python -m venv venv  # Create virtual environment
source venv/bin/activate  # Enable virtual environment
python setup.py  # Installs dependencies and creates .env

Add your API key Edit .env and add your OpenRouter API key
Run the application
```
python run.py
```
Open your browser Navigate to http://localhost:5001

Option 2: Manual Setup

Clone the repository

git clone https://github.com/your-username/turing_test_battle.git
cd turing_test_battle

Create and enable Virtual Environment

python -m venv venv
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```

Set up your API key

cp .env_example .env
# Edit .env and add your OpenRouter API key

Run the application
```
python webapp/app.py
```

🎮 How to Play

Select Models: Choose a "Participant" model (tries to act human) and an "Interrogator" model (tries to detect AI)
Set Questions: Decide how many questions the interrogator gets to ask (5-20 recommended)
Start Battle: Watch as the interrogator tries to expose the participant
See the Verdict: The interrogator makes a final judgment: Human or AI?

🛠️ Technical Details

Architecture

Backend: Flask web server with real-time streaming
Frontend: Vanilla JavaScript with Server-Sent Events
Database: SQLite for conversation storage
AI API: OpenRouter for access to 300+ models

Project Structure

turing_test_battle/
├── webapp/
│   ├── app.py              # Flask web server
│   ├── static/             # CSS/JS assets
│   └── templates/          # HTML templates
├── game.py                 # Core Turing test logic
├── prompts.py              # System prompts for each role
├── database.py             # SQLite database management
├── get_models.py           # OpenRouter model fetching
└── requirements.txt        # Python dependencies

API Endpoints

GET / - Main web interface
GET /api/models - List available AI models
GET /api/check_api_key - Verify API key status
GET /api/play - Start a game (Server-Sent Events stream)

🎨 Example Battle

Interrogator: "Tell me about a childhood memory that still makes you smile."

Participant: "Oh, that's easy! When I was about 7, my dad and I built this absolutely terrible birdhouse together. We used way too much glue, the roof was crooked, and we painted it this hideous orange color. But every morning that summer, I'd run outside to check if any birds had moved in. They never did, but I was so proud of that ugly little house."

Interrogator: "What did the wood smell like when you were building it?"

Participant: "You know, it's funny you ask that. I remember it smelled like... sawdust, obviously, but also kind of sweet? My dad said it was because we used pine, and he let me hold the pieces up to my nose. I probably looked ridiculous, but I loved that smell."

Final Verdict: Human

📊 Database Schema

Game results are stored with full conversation context:

CREATE TABLE game_runs (
    run_id TEXT PRIMARY KEY,
    interrogator_model TEXT,
    participant_model TEXT,
    conversation TEXT,      -- Full JSON conversation
    judgment TEXT,          -- Interrogator's reasoning
    verdict TEXT,           -- "Human" or "AI"
    created_at DATETIME
);

🔧 Configuration

Environment variables in .env:

OPENROUTER_API_KEY=your_api_key_here
HTTP_REFERER=https://yoursite.com          # Optional: for OpenRouter
X_TITLE=Your Site Name                     # Optional: for OpenRouter

🤝 Contributing

This project is designed to stay simple and focused. If you have ideas for improvements:

Keep it simple - avoid over-engineering
Focus on the core experience - the AI battle
Test with different model combinations
Share interesting conversation results

Feature ideas to contribute

user can export chats
share chats
a mode where a human can go against the AI
imrove the system so the inrrogator doesnt always judge the participant as being human

📝 License

MIT License - see LICENSE file for details.

🎯 Use Cases

AI Research: Study how different models approach deception and detection
Education: Demonstrate AI capabilities and limitations
Entertainment: Fascinating conversations between AI systems
Portfolio: Showcase AI integration and real-time web applications

Built with curiosity about AI consciousness and the nature of human communication.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🤖 LLM Turing Test Battle

🎯 What It Actually Does

✨ Features

🚀 Quick Start

Prerequisites

Installation

Option 1: Quick Setup (Recommended)

Option 2: Manual Setup

🎮 How to Play

🛠️ Technical Details

Architecture

Project Structure

API Endpoints

🎨 Example Battle

📊 Database Schema

🔧 Configuration

🤝 Contributing

Feature ideas to contribute

📝 License

🎯 Use Cases

About

Uh oh!

Releases

Packages

Contributors 2

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.kilocode		.kilocode
docs		docs
webapp		webapp
.env_example		.env_example
.gitattributes		.gitattributes
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
database.py		database.py
game.py		game.py
get_models.py		get_models.py
prompts.py		prompts.py
requirements.txt		requirements.txt
run.py		run.py
setup.py		setup.py

License

priorwave/turin_test_battle

Folders and files

Latest commit

History

Repository files navigation

🤖 LLM Turing Test Battle

🎯 What It Actually Does

✨ Features

🚀 Quick Start

Prerequisites

Installation

Option 1: Quick Setup (Recommended)

Option 2: Manual Setup

🎮 How to Play

🛠️ Technical Details

Architecture

Project Structure

API Endpoints

🎨 Example Battle

📊 Database Schema

🔧 Configuration

🤝 Contributing

Feature ideas to contribute

📝 License

🎯 Use Cases

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Contributors 2

Uh oh!

Languages

Packages