Add prima.cpp in Hardware Acceleration and Deployment Strategies #13

Lizonghang · 2025-04-22T17:18:23Z

prima.cpp is a distributed implementation of llama.cpp that lets you run 70B-level LLMs on your everyday devices—💻 laptops, 🖥️ desktops, 📱 phones, and tablets (GPU or no GPU, it’s all good). With it, you can run QwQ-32B, Qwen 2.5-72B, Llama 3-70B, or DeepSeek R1 70B right from your local home cluster!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add prima.cpp in Hardware Acceleration and Deployment Strategies #13

Add prima.cpp in Hardware Acceleration and Deployment Strategies #13

Lizonghang commented Apr 22, 2025

Add prima.cpp in Hardware Acceleration and Deployment Strategies #13

Are you sure you want to change the base?

Add prima.cpp in Hardware Acceleration and Deployment Strategies #13

Conversation

Lizonghang commented Apr 22, 2025