Skip to content
Change the repository type filter

All

    Repositories list

    • snap

      Public
      Scalable Nucleotide Alignment Program -- a fast and accurate read aligner for high-throughput sequencing data
      C++
      Apache License 2.0
      63298309Updated Sep 6, 2025Sep 6, 2025
    • succinct

      Public
      Enabling queries on compressed data.
      Java
      Apache License 2.0
      6928253Updated Dec 16, 2023Dec 16, 2023
    • ernest

      Public
      Code for Ernest
      Python
      Apache License 2.0
      193322Updated Jul 6, 2023Jul 6, 2023
    • graphx

      Public
      Former GraphX development repository. GraphX has been merged into Apache Spark; please submit pull requests there.
      Scala
      Apache License 2.0
      1013591813Updated Dec 5, 2022Dec 5, 2022
    • zipg

      Public
      A Memory-efficient Graph Store for Interactive Queries
      Java
      41303Updated Sep 1, 2021Sep 1, 2021
    • smash

      Public
      Benchmarking toolkit for variant calling
      Python
      BSD 2-Clause "Simplified" License
      134841Updated Oct 13, 2020Oct 13, 2020
    • Succinct C++
      C++
      724131Updated Sep 13, 2020Sep 13, 2020
    • iolap

      Public
      Scala
      Apache License 2.0
      71101Updated Jul 23, 2020Jul 23, 2020
    • SparkNet

      Public
      Distributed Neural Networks for Spark
      Scala
      MIT License
      170611264Updated Jul 23, 2020Jul 23, 2020
    • sprint

      Public
      Sprint Transformations for RegEx queries
      C++
      3900Updated Oct 1, 2019Oct 1, 2019
    • Drizzle integration with Apache Spark
      Scala
      Apache License 2.0
      3412010Updated Sep 11, 2018Sep 11, 2018
    • cyclades

      Public
      Cyclades
      C++
      Apache License 2.0
      102802Updated Apr 7, 2018Apr 7, 2018
    • HTML
      0000Updated Jan 15, 2018Jan 15, 2018
    • spark-ec2

      Public archive
      Scripts used to setup a Spark cluster on EC2
      Python
      Apache License 2.0
      2923874516Updated Nov 22, 2017Nov 22, 2017
    • R Codebase for BISCUIT: Infinite Mixture Model to cluster and impute single cells.
      R
      32000Updated Nov 3, 2017Nov 3, 2017
    • ampcrowd

      Public
      A RESTful web service that runs microtasks across multiple crowds, provides quality control techniques, and is easily extensible.
      Python
      Apache License 2.0
      165271Updated Jul 2, 2017Jul 2, 2017
    • keystone

      Public
      Simplifying robust end-to-end machine learning on Apache Spark.
      Scala
      Apache License 2.0
      117475390Updated Apr 18, 2017Apr 18, 2017
    • Scala
      Apache License 2.0
      26110230Updated Apr 17, 2017Apr 17, 2017
    • Rust
      2300Updated Mar 30, 2017Mar 30, 2017
    • An efficient updatable key-value store for Apache Spark
      Scala
      Apache License 2.0
      77254174Updated Mar 11, 2017Mar 11, 2017
    • A example skeleton for an application built on top of KeystoneML
      Shell
      4810Updated Mar 5, 2017Mar 5, 2017
    • ml-matrix

      Public
      Distributed Matrix Library
      Scala
      Apache License 2.0
      347230Updated Jan 28, 2017Jan 28, 2017
    • HTML
      4900Updated Nov 29, 2016Nov 29, 2016
    • MLI

      Public
      An API for Distributed Machine Learning
      Scala
      5915612Updated Sep 22, 2016Sep 22, 2016
    • ray-core

      Public
      Experiments for the Ray backend
      C++
      3301Updated Aug 6, 2016Aug 6, 2016
    • Build artifacts for Ray Core
      C++
      Apache License 2.0
      1000Updated Aug 6, 2016Aug 6, 2016
    • numbuf

      Public
      Numerical Buffers
      C++
      Apache License 2.0
      0020Updated Jul 28, 2016Jul 28, 2016
    • mojo

      Public
      C++
      BSD 3-Clause "New" or "Revised" License
      57000Updated Jun 17, 2016Jun 17, 2016
    • caffe

      Public
      Caffe: a fast open framework for deep learning.
      C++
      Other
      19k300Updated May 19, 2016May 19, 2016
    • benchmark

      Public
      Large scale query engine benchmark
      Python
      649955Updated Apr 5, 2016Apr 5, 2016
    ProTip! Don't forget that you can create saved views to keep track of your most important repositories!