🚀 DeepInfra Wrapper

A lightweight, efficient proxy service that provides free and unlimited access to DeepInfra's AI models through their OpenAI-compatible API.

✨ Features

🆓 Free & Unlimited - Access DeepInfra models without rate limits or costs
🔄 Auto-rotating proxies - Uses a pool of public proxies that automatically refreshes
🛡️ Optional API key authentication - Secure your instance when needed
📊 Interactive Swagger UI - Easy-to-use API documentation
🔍 Model availability checks - Only exposes models that are actually accessible
⚡ Streaming support - Full support for streaming responses
🔄 OpenAI-compatible API - Drop-in replacement for OpenAI API clients

📋 Requirements

Go 1.20 or higher
Docker (optional, for containerized deployment)

🚀 Quick Start

Using Pre-built Docker Image (Recommended)

# Pull the Docker image from GitHub Container Registry
docker pull ghcr.io/metimol/deepinfra-wrapper:latest

# Run the container
docker run -p 8080:8080 ghcr.io/metimol/deepinfra-wrapper:latest

Building Docker Image Locally

# Build the Docker image
docker build -t deepinfra-proxy .

# Run the container
docker run -p 8080:8080 deepinfra-proxy

Manual Build

# Download dependencies
go mod download

# Build the application
go build -o deepinfra-proxy .

# Run the application
./deepinfra-proxy

🔒 Authentication (Optional)

You can enable API key authentication by setting the API_KEY environment variable:

# With Docker
docker run -p 8080:8080 -e API_KEY=your-secret-key deepinfra-proxy

# Without Docker
API_KEY=your-secret-key ./deepinfra-proxy

When API key authentication is enabled, clients will need to include the API key in the Authorization header:

Authorization: Bearer your-secret-key

🔌 API Endpoints

Chat Completions

POST /v1/chat/completions

Example request:

{
  "model": "meta-llama/Llama-2-70b-chat-hf",
  "messages": [
    {
      "role": "user",
      "content": "Tell me a joke about programming"
    }
  ],
  "temperature": 0.7,
  "max_tokens": 1000,
  "stream": false
}

List Available Models

GET /models

Returns a list of all available models that can be used with the API.

API Documentation

GET /docs

Interactive Swagger UI documentation for exploring and testing the API endpoints.

GET /openapi.json

OpenAPI specification document that can be imported into API tools.

📦 Environment Variables

Variable	Description	Default
`API_KEY`	Secret key for API authentication	None (authentication disabled)
`PORT`	Port to run the server on	8080

🔄 How It Works

The proxy fetches and maintains a list of working public proxies
It regularly checks which DeepInfra models are accessible and caches this list
When a request comes in, it routes the request through one of the working proxies to DeepInfra
If a proxy fails, it's automatically removed from the rotation
New proxies are regularly added to the pool to ensure reliability

📝 Client Usage Examples

cURL

curl -X POST "http://localhost:8080/v1/chat/completions" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "meta-llama/Llama-2-70b-chat-hf",
    "messages": [
      {
        "role": "user",
        "content": "Hello, how are you today?"
      }
    ]
  }'

Python with OpenAI client

from openai import OpenAI

client = OpenAI(
    base_url="http://localhost:8080/v1/",
    api_key="your-api-key"  # Only needed if API_KEY is set
)

response = client.chat.completions.create(
    model="meta-llama/Llama-2-70b-chat-hf",
    messages=[
        {"role": "user", "content": "What's the capital of France?"}
    ]
)

print(response.choices[0].message.content)

JavaScript/Node.js

import { OpenAI } from "openai";

const openai = new OpenAI({
  baseURL: "http://localhost:8080/v1/",
  apiKey: "your-api-key", // Only needed if API_KEY is set
});

async function main() {
  const response = await openai.chat.completions.create({
    model: "meta-llama/Llama-2-70b-chat-hf",
    messages: [
      { role: "user", content: "Explain quantum computing in simple terms" }
    ],
  });

  console.log(response.choices[0].message.content);
}

main();

⚠️ Limitations

The service depends on the availability of public proxies
Response times may vary based on proxy performance
Some models might become temporarily unavailable

📚 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github/workflows		.github/workflows
handlers		handlers
services		services
types		types
utils		utils
Dockerfile		Dockerfile
README.md		README.md
go.mod		go.mod
main.go		main.go

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

🚀 DeepInfra Wrapper

✨ Features

📋 Requirements

🚀 Quick Start

Using Pre-built Docker Image (Recommended)

Building Docker Image Locally

Manual Build

🔒 Authentication (Optional)

🔌 API Endpoints

Chat Completions

List Available Models

API Documentation

📦 Environment Variables

🔄 How It Works

📝 Client Usage Examples

cURL

Python with OpenAI client

JavaScript/Node.js

⚠️ Limitations

📚 Contributing

About

Uh oh!

Releases 1

Packages

Uh oh!

Uh oh!

Languages

metimol/deepinfra-wrapper

Folders and files

Latest commit

History

Repository files navigation

🚀 DeepInfra Wrapper

✨ Features

📋 Requirements

🚀 Quick Start

Using Pre-built Docker Image (Recommended)

Building Docker Image Locally

Manual Build

🔒 Authentication (Optional)

🔌 API Endpoints

Chat Completions

List Available Models

API Documentation

📦 Environment Variables

🔄 How It Works

📝 Client Usage Examples

cURL

Python with OpenAI client

JavaScript/Node.js

⚠️ Limitations

📚 Contributing

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases 1

Packages 0

Uh oh!

Uh oh!

Languages

Packages