Skip to content

Pinned Loading

  1. llm-on-openshift llm-on-openshift Public

    Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

    Python 146 139

  2. multi-gpu-llms multi-gpu-llms Public

    Repository to deploy LLMs with Multi-GPUs in distributed Kubernetes nodes

    Jupyter Notebook 28 13

  3. gpu-partitioning-guide gpu-partitioning-guide Public

    Repository to demo GPU Sharing with Time Slicing, MPS, MIG and others

    Jupyter Notebook 56 12

  4. models-aas models-aas Public

    Models as a Service

    Liquid 73 28

  5. litemaas litemaas Public

    LiteMaaS is a proof-of-concept application for managing LLM subscriptions, API keys, and usage tracking. It seamlessly integrates with LiteLLM to provide a unified interface for accessing multiple …

    TypeScript 36 21

  6. sardeenz sardeenz Public

    Sardeenz is a proof-of-concept application that allows you to load more than one model on a given GPU. It allows you to add more and more models onto a GPU, until it is fully utilized.

    TypeScript 18 2

Repositories

Showing 10 of 132 repositories
  • rh-aiservices-bu/fraud-detection’s past year of commit activity
    Jupyter Notebook 30 Apache-2.0 89 6 9 Updated Jan 22, 2026
  • s4 Public

    Super Simple Storage Service

    rh-aiservices-bu/s4’s past year of commit activity
    0 Apache-2.0 0 0 0 Updated Jan 22, 2026
  • rh-aiservices-bu/llm-d-playbook’s past year of commit activity
    Python 2 0 0 0 Updated Jan 22, 2026
  • ts-nvml Public

    TypeScript bindings for NVML library

    rh-aiservices-bu/ts-nvml’s past year of commit activity
    TypeScript 0 Apache-2.0 0 0 0 Updated Jan 21, 2026
  • sardeenz Public

    Sardeenz is a proof-of-concept application that allows you to load more than one model on a given GPU. It allows you to add more and more models onto a GPU, until it is fully utilized.

    rh-aiservices-bu/sardeenz’s past year of commit activity
    TypeScript 18 Apache-2.0 2 2 0 Updated Jan 20, 2026
  • litemaas Public

    LiteMaaS is a proof-of-concept application for managing LLM subscriptions, API keys, and usage tracking. It seamlessly integrates with LiteLLM to provide a unified interface for accessing multiple LLMs with comprehensive budget management.

    rh-aiservices-bu/litemaas’s past year of commit activity
    TypeScript 36 MIT 21 8 0 Updated Jan 14, 2026
  • rh-kb-chat Public
    rh-aiservices-bu/rh-kb-chat’s past year of commit activity
    Jupyter Notebook 17 MIT 21 14 0 Updated Jan 13, 2026
  • qaroot Public
    rh-aiservices-bu/qaroot’s past year of commit activity
    TypeScript 0 1 2 0 Updated Jan 12, 2026
  • rh-aiservices-bu/rh1-llmd-lab-2026’s past year of commit activity
    Smarty 1 ISC 0 0 0 Updated Jan 8, 2026
  • llm-on-openshift Public

    Resources, demos, recipes,... to work with LLMs on OpenShift with OpenShift AI or Open Data Hub.

    rh-aiservices-bu/llm-on-openshift’s past year of commit activity
    Python 146 Apache-2.0 139 3 0 Updated Jan 7, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…