Skip to content
Change the repository type filter

All

    Repositories list

    • Streamlit for FOIArchive search GUI
      Python
      3200Updated Nov 20, 2025Nov 20, 2025
    • specific chat agent for history lab
      TypeScript
      0170Updated Oct 15, 2025Oct 15, 2025
    • Searches vectorize for similar chunks. Exports a worker entrypoint for chat agent
      TypeScript
      0000Updated Aug 30, 2025Aug 30, 2025
    • foiarchive-api

      Public archive
      This API was officially retired in August 2025.
      Python
      03103Updated Aug 19, 2025Aug 19, 2025
    • HLSTM

      Public
      R
      0000Updated Jul 16, 2025Jul 16, 2025
    • Handles registration/login and form submission
      TypeScript
      0000Updated Jun 2, 2025Jun 2, 2025
    • R interface with History Lab's API
      R
      1000Updated Apr 28, 2025Apr 28, 2025
    • Stata package to access History Lab API
      Stata
      1000Updated Apr 28, 2025Apr 28, 2025
    • Scripts, configuration and examples for the PostgREST proof of concept
      PLpgSQL
      0100Updated Apr 21, 2025Apr 21, 2025
    • TypeScript
      0000Updated Apr 15, 2025Apr 15, 2025
    • workers for creating a collection on the Ramus Network
      TypeScript
      0000Updated Apr 12, 2025Apr 12, 2025
    • Workers that handle bulk file uploads
      TypeScript
      0010Updated Apr 10, 2025Apr 10, 2025
    • Generates embeddings and stores them in vectorize. RPCed by chunker
      TypeScript
      0000Updated Apr 10, 2025Apr 10, 2025
    • chunker

      Public
      gets triggered by R2 upload + add to queue, chunks up text and RPCs the embedder
      TypeScript
      0000Updated Mar 31, 2025Mar 31, 2025
    • TypeScript
      0010Updated Mar 21, 2025Mar 21, 2025
    • Search dashboard of History Lab's unified COVID-19 collection
      Python
      0000Updated Feb 5, 2025Feb 5, 2025
    • Utility that takes a FOIArchive database SQL query as input and produces the result set in a JSON file.
      Python
      0000Updated Jan 9, 2025Jan 9, 2025
    • History Lab COVID-19 Archive Prototype
      Python
      0000Updated Nov 20, 2024Nov 20, 2024
    • pplc

      Public
      pandemic program link checker
      Python
      0000Updated Oct 31, 2024Oct 31, 2024
    • Database views and scripts that support the Mosaic LLM project
      Shell
      0000Updated Oct 31, 2024Oct 31, 2024
    • Corpus-specific schema objects for UN Archives metadata and text
      0000Updated Oct 31, 2024Oct 31, 2024
    • Course materials from the 2023 & 2024 Archiving Digital Records track of the Archives as Data Summer Institute.
      1300Updated Oct 30, 2024Oct 30, 2024
    • Scripts for updating corpus-specific topic models in the FOIArchive database.
      Shell
      0000Updated Oct 21, 2024Oct 21, 2024
    • SQL scripts for dumping FOIArchive data to CSV
      0000Updated Sep 27, 2024Sep 27, 2024
    • Jupyter Notebook
      1000Updated May 20, 2024May 20, 2024
    • Example of querying the FOIArchive REST API via a Python program
      Jupyter Notebook
      0000Updated Feb 27, 2024Feb 27, 2024
    • Research project investigating OCR evaluation mechanisms at Columbia's History Lab.
      Python
      1000Updated Feb 13, 2024Feb 13, 2024
    • Downloads PDFs and stores the text in the FOIArchive database and a copy in an s3 bucket
      Python
      0000Updated Jan 28, 2024Jan 28, 2024
    • Scripts for preprocessing and loading of metadata and text for the History Lab-Muckrock COVID-19 Collection
      Python
      0000Updated Oct 16, 2023Oct 16, 2023
    • piir-eval

      Public
      Framework for PII redaction evaluation
      PLpgSQL
      0100Updated Apr 28, 2023Apr 28, 2023