Skip to content
Change the repository type filter

All

    Repositories list

    • Spark-Dashboard is a solution for monitoring Apache Spark jobs. This repository provides the tooling and configuration for deploying an Apache Spark Performance…
      Dockerfile
      Apache License 2.0
      2013410Updated Jan 5, 2026Jan 5, 2026
    • This is a mirror of https://github.com/LucaCanali/sparkMeasure - sparkMeasure is a tool for performance troubleshooting of Apache Spark workloads. It simplifies…
      Scala
      Apache License 2.0
      31600Updated Oct 3, 2025Oct 3, 2025
    • Python DataSource for Apache Spark 4 to read ROOT files (High Energy Physics) as DataFrames, powered by uproot, awkward, and PyArrow.
      Python
      Apache License 2.0
      0100Updated Oct 2, 2025Oct 2, 2025
    • Grafana Mimir dashboards used for cardinality exploration
      Apache License 2.0
      96710Updated Sep 17, 2025Sep 17, 2025
    • Material for the course "Introduction to Apache Spark APIs for Data Processing" https://sparktraining.web.cern.ch/
      Jupyter Notebook
      Creative Commons Attribution 4.0 International
      81800Updated May 13, 2025May 13, 2025
    • Code and examples of how to write and deploy Apache Spark Plugins. Spark plugins allow runnig custom code on the executors as they are initialized. This also al…
      Scala
      Apache License 2.0
      149430Updated May 9, 2025May 9, 2025
    • Spark Executor Plugins Examples for Spark 2.4
      Java
      Apache License 2.0
      2600Updated May 7, 2025May 7, 2025
    • Contrib repository for the OpenTelemetry Collector
      Go
      Apache License 2.0
      3.4k000Updated Apr 12, 2025Apr 12, 2025
    • Mirror of CERN db/hadoop-xrootd. Hadoop-XRootD Filesystem Connector
      Java
      Apache License 2.0
      3631Updated Sep 25, 2024Sep 25, 2024
    • Code and links to the data for the article "Machine Learning Pipelines with Modern Big DataTools for High Energy Physics"
      Jupyter Notebook
      Apache License 2.0
      143100Updated Jun 11, 2024Jun 11, 2024
    • argo-helm

      Public
      ArgoProj Helm Charts
      Mustache
      Apache License 2.0
      2.1k000Updated May 28, 2024May 28, 2024
    • This repository contains Jupyter notebook examples, intended to be linked with the SWAN Gallery
      Jupyter Notebook
      Apache License 2.0
      1100Updated May 16, 2024May 16, 2024
    • Aiven's JDBC Sink and Source Connectors for Apache Kafka®
      Java
      Apache License 2.0
      58000Updated Nov 8, 2023Nov 8, 2023
    • zkpolicy

      Public
      Zookeeper Policy Audit Tool (aka zkPolicy) for checking and enforcing ACLs on ZNodes.
      Java
      MIT License
      1710Updated Oct 25, 2023Oct 25, 2023
    • dbod-api

      Public
      DB On Demand API
      Python
      GNU General Public License v3.0
      3492Updated Aug 14, 2023Aug 14, 2023
    • TF-Spawner is an experimental tool for running TensorFlow distributed training on Kubernetes clusters.
      Python
      Apache License 2.0
      2800Updated Mar 22, 2023Mar 22, 2023
    • Unified RESTful interface for managing CERNs data storage back-ends
      Python
      GNU General Public License v3.0
      2712Updated Jan 31, 2022Jan 31, 2022
    • Python Re-implementation of the cern-get-sso-cookie functionality
      Python
      61110Updated Jan 11, 2022Jan 11, 2022
    • Analyzes network traffic of HBase RegionServers
      Clojure
      Apache License 2.0
      5100Updated Nov 5, 2021Nov 5, 2021
    • A re-implementation of (parts of) NetApp's ZAPI in idiomatic Python using Requests
      Python
      GNU General Public License v3.0
      1300Updated Sep 13, 2021Sep 13, 2021
    • binderhub

      Public
      Run your code in the cloud, with technology so advanced, it feels like magic!
      Python
      BSD 3-Clause "New" or "Revised" License
      402000Updated Aug 19, 2021Aug 19, 2021
    • Java
      GNU General Public License v3.0
      0350Updated Mar 12, 2021Mar 12, 2021
    • Set of valves classes that helps CERN applications with the integration in the CERN Authentication
      Java
      GNU General Public License v3.0
      0200Updated Oct 22, 2020Oct 22, 2020
    • This image generates configuration and war files for Oracle Rest DataServices based on data provided by dadEdit3 database.
      Python
      GNU General Public License v3.0
      0000Updated Sep 4, 2020Sep 4, 2020
    • dbod-web

      Public
      Future DB On Demand Web Interface implementation
      TypeScript
      MIT License
      35152Updated Aug 28, 2020Aug 28, 2020
    • dbod-core

      Public
      DB On Demand management infrastructure core library
      Perl
      GNU General Public License v3.0
      05200Updated Apr 1, 2020Apr 1, 2020
    • Java
      GNU General Public License v3.0
      0001Updated Mar 5, 2020Mar 5, 2020
    • TypeScript
      GNU General Public License v3.0
      1000Updated Feb 20, 2020Feb 20, 2020
    • HDFS Connector for Oracle Cloud Infrastructure
      Java
      Other
      26000Updated Jan 20, 2020Jan 20, 2020
    • Rundeck plugin running jobs on Nomad cluster.
      Java
      MIT License
      6000Updated Aug 9, 2019Aug 9, 2019