diff --git a/README.md b/README.md index 08d59124..f45a9e73 100644 --- a/README.md +++ b/README.md @@ -1,6 +1,7 @@ + # Free LLM API resources This lists various services that provide free access or credits towards API-based LLM usage. @@ -25,6 +26,7 @@ This lists various services that provide free access or credits towards API-base - [Cohere](#cohere) - [GitHub Models](#github-models) - [Cloudflare Workers AI](#cloudflare-workers-ai) + - [Longcat AI](#longcat-ai) - [Providers with trial credits](#providers-with-trial-credits) - [Fireworks](#fireworks) - [Baseten](#baseten) @@ -105,8 +107,8 @@ Models tend to be context window limited. ### [Mistral (La Plateforme)](https://console.mistral.ai/) -* Free tier (Experiment plan) requires opting into data training -* Requires phone number verification. +- Free tier (Experiment plan) requires opting into data training +- Requires phone number verification. **Limits (per-model):** 1 request/second, 500,000 tokens/minute, 1,000,000,000 tokens/month @@ -114,9 +116,9 @@ Models tend to be context window limited. ### [Mistral (Codestral)](https://codestral.mistral.ai/) -* Currently free to use -* Monthly subscription based -* Requires phone number verification +- Currently free to use +- Monthly subscription based +- Requires phone number verification **Limits:** 30 requests/minute, 2,000 requests/day @@ -136,7 +138,6 @@ Routes to various supported providers. **Limits:** [$5/month](https://vercel.com/docs/ai-gateway/pricing) - ### [OpenCode Zen](https://opencode.ai/docs/zen/) AI gateway with curated models. @@ -303,9 +304,21 @@ Extremely restrictive input/output token limits. - Una Cybertron 7B v2 (BF16) - Zephyr 7B Beta (AWQ) - +### [Longcat AI](https://longcat.chat/platform/docs/) +AI models from MeiTuan, with respectable performance, though lacking in benchmarks and recognition. + +**Limits:** [500,000 tokens/day](https://longcat.chat/platform/docs/#supported-models) + +**Models:** +- longcat-flash-chat +- longcat-flash-thinking +- longcat-flash-thinking-2601 +- longcat-flash-lite — separate quota: 50,000,000 tokens/day +- longcat-flash-omni-2603 +- longcat-flash-chat-2602-exp +- sphynx ## Providers with trial credits @@ -376,6 +389,7 @@ Extremely restrictive input/output token limits. **Credits:** $1 **Models:** + - DeepSeek V3 0324 - Llama 3.3 70B Instruct - deepseek-ai/deepseek-r1-0528 @@ -386,9 +400,9 @@ Extremely restrictive input/output token limits. **Credits:** $5 for 3 months **Models:** + - E5-Mistral-7B-Instruct - Llama 3.3 70B -- Llama 3.3 70B - Llama-4-Maverick-17B-128E-Instruct - Qwen/Qwen3-235B - Qwen/Qwen3-32B @@ -406,6 +420,7 @@ Extremely restrictive input/output token limits. **Credits:** 1,000,000 free tokens **Models:** + - BGE-Multilingual-Gemma2 - Gemma 3 27B Instruct - Llama 3.3 70B Instruct @@ -420,5 +435,3 @@ Extremely restrictive input/output token limits. - qwen3-embedding-8b - qwen3.5-397b-a17b - voxtral-small-24b-2507 - -