Change the repository type filter
All
Repositories list
135 repositories
- Efficient Triton Kernels for LLM Training
- Multi-hop declarative data pipelines
venice
Public- Open Control Plane for Tables in Data Lakehouse
ignite-3
Publiciceberg
Publicgobblin-elr
Publicavro-util
Publicdatahub-gma
Publiclinkedin.github.com
Publiccoral
PublicBurrow
Publicisolation-forest
PublicA distributed Spark/Scala implementation of the isolation forest algorithm for unsupervised outlier detection, featuring support for scalable training and ONNX export for easy cross-platform inference.- An extensible distributed system for reliable nearline data streaming at scale
cruise-control
PublicCruise-control is the first of its kind to fully automate the dynamic workload rebalance and self-healing of a Kafka cluster. It provides great value to Kafka users by simplifying the operation of Kafka clusters.rest.li
Publicghc25-ds-workshop
Publictransport
Publicfmchisel
Publicgoavro
Public