Skip to content
@basetenlabs

Baseten

Machine learning infrastructure for developers

Welcome to Baseten

Baseten is an AI infrastructure platform. We combine applied performance research, distributed multi-cloud infrastructure, and developer tooling to run models of all modalities in production.

Get started:

  • Deploy an open-source model in two clicks from the model library.
  • Read our docs to package and serve a fine-tuned or custom model.

Pinned Loading

  1. truss truss Public

    The simplest way to serve AI/ML models in production

    Python 992 85

  2. truss-examples truss-examples Public

    Examples of models deployable with Truss

    Python 170 44

Repositories

Showing 10 of 58 repositories
  • dynamo Public Forked from ai-dynamo/dynamo

    A Datacenter Scale Distributed Inference Serving Framework

    basetenlabs/dynamo’s past year of commit activity
    Rust 0 Apache-2.0 382 0 5 Updated May 27, 2025
  • truss Public

    The simplest way to serve AI/ML models in production

    basetenlabs/truss’s past year of commit activity
    Python 992 MIT 85 63 (5 issues need help) 15 Updated May 27, 2025
  • TensorRT-LLM Public Forked from NVIDIA/TensorRT-LLM

    TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.

    basetenlabs/TensorRT-LLM’s past year of commit activity
    C++ 0 Apache-2.0 1,464 0 3 Updated May 27, 2025
  • truss-examples Public

    Examples of models deployable with Truss

    basetenlabs/truss-examples’s past year of commit activity
    Python 170 MIT 44 13 52 Updated May 24, 2025
  • action-junit-report Public Forked from mikepenz/action-junit-report

    Reports junit test results as GitHub Pull Request Check

    basetenlabs/action-junit-report’s past year of commit activity
    TypeScript 0 Apache-2.0 143 0 2 Updated May 15, 2025
  • nx-set-shas Public Forked from nrwl/nx-set-shas

    ✨ A Github Action which sets the base and head SHAs required for `nx affected` commands in CI

    basetenlabs/nx-set-shas’s past year of commit activity
    TypeScript 0 MIT 84 0 1 Updated May 15, 2025
  • changed-files Public Forked from tj-actions/changed-files

    :octocat: Github action to retrieve all (added, copied, modified, deleted, renamed, type changed, unmerged, unknown) files and directories.

    basetenlabs/changed-files’s past year of commit activity
    TypeScript 0 MIT 301 0 1 Updated May 15, 2025
  • basetenlabs/frontend-log-viewer-challenge’s past year of commit activity
    TypeScript 1 0 0 1 Updated May 15, 2025
  • create-pull-request Public Forked from peter-evans/create-pull-request

    A GitHub action to create a pull request for changes to your repository in the actions workspace

    basetenlabs/create-pull-request’s past year of commit activity
    TypeScript 0 MIT 520 0 1 Updated May 15, 2025
  • TensorRT-Model-Optimizer Public Forked from NVIDIA/TensorRT-Model-Optimizer

    A unified library of state-of-the-art model optimization techniques such as quantization, pruning, distillation, speculative decoding, etc. It compresses deep learning models for downstream deployment frameworks like TensorRT-LLM or TensorRT to optimize inference speed on NVIDIA GPUs.

    basetenlabs/TensorRT-Model-Optimizer’s past year of commit activity
    Python 0 71 0 2 Updated Apr 29, 2025

Most used topics

Loading…