You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
BudConnect is a cloud service that provides compatibility checking and update synchronization for Bud inference runtimes on customer infrastructure. It acts as …
[EMNLP 2024 Industry Track] This is the official PyTorch implementation of "LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression To…
CacheEval is a prompt level caching accuracy evaluation system which is intended to check the accuracy and performance of a prompt caching system like GPTCache …
RAGFlow is a leading open-source Retrieval-Augmented Generation (RAG) engine that fuses cutting-edge RAG with Agent capabilities to create a superior context la…
Multi-source LLM model catalog with cost-accurate pricing. Fetches model metadata from LiteLLM and truefoundry/models, merges them with cost-accurate pricing, f…
LayerZero is a GenAI kernel orchestration, and Dispatch system, that allow Inference engine developers to easily develop cross platform Inference engines.
Generative AI Examples is a collection of GenAI examples such as ChatQnA, Copilot, which illustrate the pipeline capabilities of the Open Platform for Enterpris…