Skip to content

Popular repositories Loading

  1. rmbg-1.4 rmbg-1.4 Public template

    State-of-the-art background removal model, designed to effectively separate foreground from background. <metadata> gpu: T4 | collections: ["HF Transformers"] </metadata>

    Python 23 12

  2. triton-co-pilot triton-co-pilot Public

    Generate Glue Code in seconds to simplify your Nvidia Triton Inference Server Deployments

    Python 20 3

  3. smaug-72b smaug-72b Public

    Smaug-72B topped the Hugging Face LLM leaderboard and it’s the first model with an average score of 80, making it the world’s best open-source foundation model. <metadata> gpu: A100 | collections: …

    Python 17 5

  4. qwq-32b-preview qwq-32b-preview Public template

    A 32B experimental reasoning model for advanced text generation and robust instruction following. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 17 7

  5. whisper-large-v3 whisper-large-v3 Public template

    State‑of‑the‑art speech recognition model for English, delivering transcription accuracy across diverse audio scenarios. <metadata> gpu: T4 | collections: ["CTranslate2"] </metadata>

    Python 15 16

  6. deepseek-r1-distill-qwen-32b deepseek-r1-distill-qwen-32b Public template

    A distilled DeepSeek-R1 variant built on Qwen2.5-32B, fine-tuned with curated data for enhanced performance and efficiency. <metadata> gpu: A100 | collections: ["vLLM"] </metadata>

    Python 15 37

Repositories

Showing 10 of 174 repositories
  • jina-embeddings-v4 Public template

    A 3.8B multimodal-multilingual embedding that unifies text and image understanding in a single late-interaction space, delivers both dense and multi-vector outputs. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata>

    Python 0 0 0 0 Updated Jul 13, 2025
  • flux-1-kontext-dev Public template

    12B model from Black Forest Labs that allows in‑context image editing with character and style consistency; supporting iterative, instruction-guided edits. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>

    Python 0 0 0 0 Updated Jul 13, 2025
  • gemma-3n-e4b-it Public template

    8B variant of the lightweight Gemma 3n series that operates with a 4B‑parameter memory footprint, enabling full multimodal inference (text, image, audio, video) on resource‑constrained hardware. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>

    Python 0 0 0 0 Updated Jul 13, 2025
  • qwen3-embedding-0.6b Public template

    600M parameter, 100 language embedding model that turns up to 32k token inputs into instruction-aware vectors. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata>

    Python 0 0 0 0 Updated Jun 23, 2025
  • devstral-small Public template

    An agentic LLM for software engineering tasks, excels at using tools to explore codebases, editing multiple files and power software engineering agents. <metadata> gpu: A100 | collections: ["HF_Transformers"] </metadata>

    Python 0 0 0 0 Updated Jun 23, 2025
  • deepseek-r1-qwen3-8b Public template

    A distilled 8B parameter reasoning powerhouse, leveraging deep chain‑of‑thought from the DeepSeek R1‑0528—delivering SOTA open‑source performance. <metadata> gpu: A100 | collections: ["HF Transformers"] </metadata>

    Python 0 0 0 0 Updated Jun 23, 2025
  • nanonets-ocr-s Public template

    Nanonets-OCR-s that turns images or PDFs into structured Markdown capturing tables, LaTeX, captions and tags—for fast, powerful, human-readable OCR. <metadata> gpu: A10 | collections: ["HF_Transformers"] </metadata>

    Python 0 2 0 0 Updated Jun 23, 2025
  • Python 0 0 0 0 Updated Jun 11, 2025
  • Python 0 1 0 0 Updated May 20, 2025
  • kokoro Public template

    82M parameters lightweight text-to-speech (TTS) model that delivers high-quality voice synthesis. <metadata> gpu: T4 | collections: ["SSE Events"] </metadata>

    Python 1 1 0 0 Updated May 19, 2025

Most used topics

Loading…