Skip to content
Change the repository type filter

All

    Repositories list

    • the pipeline from DPA site link to structured database of reports
      Jupyter Notebook
      Creative Commons Zero v1.0 Universal
      1060Updated Mar 5, 2026Mar 5, 2026
    • TFC

      Public
      Python
      0210Updated Mar 4, 2026Mar 4, 2026
    • Public resources about investigating or reporting police misconduct, shared at a public event in Oakland CA in January 2026
      0320Updated Feb 16, 2026Feb 16, 2026
    • StanLCMCR

      Public
      R
      GNU General Public License v3.0
      0300Updated Feb 14, 2026Feb 14, 2026
    • decentralized storage layer for community archives
      Python
      2111Updated Feb 10, 2026Feb 10, 2026
    • verdata

      Public
      Una herramienta para el uso y análisis de los datos de Conflicto armado en Colombia resultantes del proyecto conjunto JEP-CEV-HRDAG.
      R
      GNU General Public License v2.0
      61640Updated Feb 4, 2026Feb 4, 2026
    • This is intended to be a public repo that uses the HRDAG/US-II-MP missing persons database as an input to a brief series of Jupyter notebooks, covering various …
      Jupyter Notebook
      GNU General Public License v3.0
      0210Updated Dec 1, 2025Dec 1, 2025
    • Collecting, processing, and linking data from the U.S. carceral system.
      HTML
      GNU General Public License v3.0
      0010Updated Oct 1, 2025Oct 1, 2025
    • DGA

      Public
      code to do capture-recapture estimation using decomposable graphical models
      R
      2100Updated Sep 21, 2025Sep 21, 2025
    • n2s

      Public
      This moves data from the NAS to S3
      Python
      0000Updated Sep 6, 2025Sep 6, 2025
    • dsg

      Public
      A simple data versioning system
      Python
      0110Updated Jul 21, 2025Jul 21, 2025
    • HTML
      0000Updated Jun 23, 2025Jun 23, 2025
    • A hub for our past, present, and future LLM explorations. Includes some general training-style examples and real-world, project-specific design examples.
      Python
      Creative Commons Zero v1.0 Universal
      01110Updated May 13, 2025May 13, 2025
    • This is a public repo for the data and analyses performed on emergency response data from the Chicago Office of Emergency Management & Communication ("OEMC") pr…
      Jupyter Notebook
      1410Updated May 5, 2025May 5, 2025
    • My experience moving a trove of docs into IPFS with postgres metadata
      Python
      0700Updated Feb 28, 2025Feb 28, 2025
    • HTML
      0220Updated Sep 12, 2024Sep 12, 2024
    • materials to study and learn about principled data processing
      Jupyter Notebook
      GNU General Public License v3.0
      514160Updated Jan 16, 2024Jan 16, 2024
    • libsnap

      Public
      library code for all the snap* tools
      Shell
      GNU General Public License v3.0
      2010Updated Jul 24, 2023Jul 24, 2023
    • This repo is going to hold the data and work related to an investigation into Chicago Police arrests of persons on a state registry, in particular, the sex offe…
      Python
      GNU General Public License v3.0
      0010Updated Jun 24, 2023Jun 24, 2023
    • snapback

      Public
      snapshot/hardlink based backups
      Shell
      GNU General Public License v3.0
      0000Updated Jun 22, 2023Jun 22, 2023
    • snapcrypt

      Public
      encrypted snapshot backups to external hard drives with LUKS
      Shell
      GNU General Public License v3.0
      0110Updated May 4, 2023May 4, 2023
    • snap

      Public
      utilities to manage data directories in parallel with using git for code directories.
      Shell
      GNU General Public License v3.0
      2400Updated Feb 11, 2023Feb 11, 2023
    • workflows

      Public
      Shared workflow repository
      GNU General Public License v2.0
      0011Updated Aug 7, 2022Aug 7, 2022
    • mse-prox

      Public
      code that manages processing MSE scripts through Amazon SQS
      Python
      1120Updated Mar 28, 2022Mar 28, 2022
    • CSS
      0100Updated Jul 14, 2021Jul 14, 2021
    • Statistical analysis of the total number of people killed in drug-related killings
      TeX
      GNU General Public License v3.0
      2500Updated Aug 20, 2019Aug 20, 2019
    • jsweave

      Public
      code to weave data literals into LaTeX, with defaults
      Python
      GNU General Public License v3.0
      0030Updated Mar 18, 2019Mar 18, 2019
    • filr

      Public
      a package to manage the file metadata in R scripts
      R
      GNU General Public License v3.0
      0110Updated Mar 15, 2019Mar 15, 2019
    • makr

      Public archive
      tools for recursive make in principled data processing projects
      Python
      0330Updated Jan 2, 2019Jan 2, 2019