Skip to content
Open
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
33 changes: 23 additions & 10 deletions README.md
Original file line number Diff line number Diff line change
@@ -1,6 +1,7 @@
<!---
WARNING: DO NOT EDIT THIS FILE DIRECTLY. IT IS GENERATED BY src/pull_available_models.py
--->

Comment on lines 1 to +4

Copilot AI Apr 22, 2026

Copy link

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

README.md is explicitly marked as generated (see header). The PR edits it directly and introduces content/formatting (Longcat section, TOC entry, bullet style changes) that are not present in the generator, so the next run of src/pull_available_models.py will overwrite these changes. Please add Longcat (and any formatting adjustments like '-' vs '*') to the generator/template, re-generate README.md, and commit the generated output.

Copilot uses AI. Check for mistakes.

Copy link
Copy Markdown
Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

@copilot apply changes based on this feedback

# Free LLM API resources

This lists various services that provide free access or credits towards API-based LLM usage.
Expand All @@ -25,6 +26,7 @@ This lists various services that provide free access or credits towards API-base
- [Cohere](#cohere)
- [GitHub Models](#github-models)
- [Cloudflare Workers AI](#cloudflare-workers-ai)
- [Longcat AI](#longcat-ai)
- [Providers with trial credits](#providers-with-trial-credits)
- [Fireworks](#fireworks)
- [Baseten](#baseten)
Expand Down Expand Up @@ -105,18 +107,18 @@ Models tend to be context window limited.

### [Mistral (La Plateforme)](https://console.mistral.ai/)

* Free tier (Experiment plan) requires opting into data training
* Requires phone number verification.
- Free tier (Experiment plan) requires opting into data training
- Requires phone number verification.

**Limits (per-model):** 1 request/second, 500,000 tokens/minute, 1,000,000,000 tokens/month

- [Open and Proprietary Mistral models](https://docs.mistral.ai/getting-started/models/models_overview/)

### [Mistral (Codestral)](https://codestral.mistral.ai/)

* Currently free to use
* Monthly subscription based
* Requires phone number verification
- Currently free to use
- Monthly subscription based
- Requires phone number verification

**Limits:** 30 requests/minute, 2,000 requests/day

Expand All @@ -136,7 +138,6 @@ Routes to various supported providers.

**Limits:** [$5/month](https://vercel.com/docs/ai-gateway/pricing)


### [OpenCode Zen](https://opencode.ai/docs/zen/)

AI gateway with curated models.
Expand Down Expand Up @@ -303,9 +304,21 @@ Extremely restrictive input/output token limits.
- Una Cybertron 7B v2 (BF16)
- Zephyr 7B Beta (AWQ)

</tbody></table>
### [Longcat AI](https://longcat.chat/platform/docs/)

AI models from MeiTuan, with respectable performance, though lacking in benchmarks and recognition.

**Limits:** [500,000 tokens/day](https://longcat.chat/platform/docs/#supported-models)

**Models:**

- longcat-flash-chat
- longcat-flash-thinking
- longcat-flash-thinking-2601
- longcat-flash-lite — separate quota: 50,000,000 tokens/day
- longcat-flash-omni-2603
- longcat-flash-chat-2602-exp
- sphynx

## Providers with trial credits

Expand Down Expand Up @@ -376,6 +389,7 @@ Extremely restrictive input/output token limits.
**Credits:** $1

**Models:**

- DeepSeek V3 0324
- Llama 3.3 70B Instruct
- deepseek-ai/deepseek-r1-0528
Expand All @@ -386,9 +400,9 @@ Extremely restrictive input/output token limits.
**Credits:** $5 for 3 months

**Models:**

- E5-Mistral-7B-Instruct
- Llama 3.3 70B
- Llama 3.3 70B
- Llama-4-Maverick-17B-128E-Instruct
- Qwen/Qwen3-235B
- Qwen/Qwen3-32B
Expand All @@ -406,6 +420,7 @@ Extremely restrictive input/output token limits.
**Credits:** 1,000,000 free tokens

**Models:**

- BGE-Multilingual-Gemma2
- Gemma 3 27B Instruct
- Llama 3.3 70B Instruct
Expand All @@ -420,5 +435,3 @@ Extremely restrictive input/output token limits.
- qwen3-embedding-8b
- qwen3.5-397b-a17b
- voxtral-small-24b-2507