liranschour

Follow

liranschour

Follow

Achievements

Achievements

Popular repositories Loading

LMCache LMCache Public

Forked from LMCache/LMCache

Redis for LLMs

Python 1
ovs ovs Public

C
vllm vllm Public

Forked from vllm-project/vllm

A high-throughput and memory-efficient inference and serving engine for LLMs

Python
nixl nixl Public

Forked from ai-dynamo/nixl

NVIDIA Inference Xfer Library (NIXL)

C++
llm-d llm-d Public

Forked from llm-d/llm-d

Achieve state of the art inference performance with modern accelerators on Kubernetes

Shell
uccl uccl Public

Forked from uccl-project/uccl

UCCL is an efficient communication library for GPUs, covering collectives, P2P (e.g., KV cache transfer, RL weight transfer), and EP (e.g., GPU-driven)

C++