You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Copy file name to clipboardExpand all lines: README.md
+44-2Lines changed: 44 additions & 2 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -2,7 +2,7 @@
2
2
3
3
AIBackends is an API server that you can use to integrate AI into your applications. You can run it locally or self-host it.
4
4
5
-
The project supports running open models locally with Ollama, LM Studio or LlamaCpp. It also supports LLM Gateway, OpenRouter, OpenAI, Anthropic and Google AI Studio, Baseten providers.
5
+
The project supports running open models locally with Ollama, LM Studio or LlamaCpp. It also supports LLM Gateway, OpenRouter, OpenAI, Anthropic, Google AI Studio, Baseten and ZAI providers.
6
6
7
7
## Why AI Backends?
8
8
@@ -31,16 +31,51 @@ Since APIs are ready to use, you don't need to understand prompt engineering. Ju
31
31
|**/api/pdf-summarizer**| Extract and summarize content from PDF documents with AI |
32
32
|**/api/web-search**| Perform web searches and get AI-powered summaries of results |
33
33
34
+
#### Text Processing Examples
35
+
36
+
**Web Search** - Search the web and get AI-powered summaries:
|**/api/synthetic-data**| Generate realistic synthetic data based on prompts with optional JSON schema validation |
38
56
57
+
**Synthetic Data Generation** - Generate realistic test data with custom schemas:
58
+
59
+

60
+
39
61
### Image Processing
40
62
41
63
| Endpoint | Description |
42
64
|----------|-------------|
43
-
|**/api/describe-image**| Describe an image (work in progress) |
65
+
|**/api/vision**| Analyze images with vision AI - ask questions, detect objects, get coordinates (ZAI GLM-4.6v) |
66
+
|**/api/ocr**| Extract structured data from images using OCR with optional JSON schema output (ZAI GLM-4.6V) |
67
+
68
+
#### Vision AI Examples
69
+
70
+
AIBackends includes powerful vision capabilities powered by ZAI GLM-4.6v models. You can ask questions about images, detect objects, and extract structured data.
71
+
72
+
**Vision Q&A** - Ask questions about any image:
73
+
74
+

75
+
76
+
**OCR Extraction** - Extract structured data from documents, receipts, and invoices:
77
+
78
+

44
79
45
80
More to come...check swagger docs for updated endpoints.
46
81
@@ -63,6 +98,7 @@ More to come...check swagger docs for updated endpoints.
63
98
|[Vercel AI Gateway](https://vercel.com/ai-gateway)| Open source and private models | Available |
64
99
|[Google AI Studio](https://ai.google.dev/)| Gemini models via OpenAI-compatible interface | Available |
65
100
|[Baseten](https://baseten.co/)| Cloud-hosted ML models with OpenAI-compatible API | Available |
101
+
|[ZAI](https://z.ai/)| GLM models with vision/OCR capabilities | Available |
- LLM-Friendly API Docs: `http://localhost:3000/api/llms.txt`. Copy the contents of this file and paste it into AI builder tools like Bolt.new, v0, Lovable, or AI coding assistants to help them understand and use the AIBackends API endpoints.
0 commit comments