Skip to content

Annotation and Analysis

HaileyPunis edited this page Apr 25, 2026 · 10 revisions

This document provides an overview of DAILP’s annotation and analysis processes. The pages under Annotation and Analysis contain details of the methods and practices for these processes on our website that include:

  • Language Model: This page describes our annotation and analysis processes after 2024 that include our community-based design, our different levels of analysis, and our language audio. The categories of Edited Collections, Document-Level Analysis, and Word-Level Analysis fall under our different levels of analysis and language annotation.

    • Edited Collections: This page summarizes how DAILP’s edited collections are scoped, created, organized, and annotated after 2024.
    • Document-Level Analysis: This page provides an outline of how document-level information is organized, annotated, and analyzed by our translation team after 2024.
    • Word-Level Analysis:This page outlines how DAILP represents word-level language data since 2024.
  • User Contributed Audio: This page describes the current process of how Editors or Contributors can upload and publish audio to the DAILP site.

  • Audio Data Process: This page describes DAILP’s past audio data process from before 2024 and our current audio data process.

  • Manuscript Annotation and Analysis: This page explains DAILP's internal annotation process. This includes the different layers of our annotation and translation processes as well as our internal goals and annotation workflows.

  • Language Specific Limitations: This page describes the language specific limitations of our current project design as well as what we resolved and hope to resolve in the future as we expand to help support other languages.

  • Annotation and Analysis (Before 2024): This page provides a description of DAILP’s data annotation and analysis processes prior to 2024.

Clone this wiki locally