Skip to content

Analyze metadata field usage across CAFE collection #405

@Saixel

Description

@Saixel

Summary
Produce a report showing how often metadata fields are populated across CAFE datasets.
Results will inform which fields to prioritize, keep visible, or hide.

Decisions needed

  • Target scope: which collections/subcollections?
  • Which metadata blocks/fields to include (Citation, Geospatial, custom blocks, etc.)

Approach

  • Prepare and use SQL (read-only) to count non-empty values per field and compute % coverage by dataset.

Next Steps

  • Confirm scope (collections + blocks/fields).
  • Adapt/validate SQL against a non-prod snapshot.
  • Coordinate a low-impact window to run in prod (SELECTs only).
  • Share CSVs + brief findings.

Metadata

Metadata

Assignees

Labels

FY26 Sprint 10FY26 Sprint 10 (2025-11-05 - 2025-11-19)FY26 Sprint 11FY26 Sprint 11 (2025-11-20 - 2025-12-03)FY26 Sprint 12FY26 Sprint 12 (2025-12-03 - 2025-12-17)FY26 Sprint 8FY26 Sprint 8 (2025-10-08 - 2025-10-22)FY26 Sprint 9FY26 Sprint 9 (2025-10-22 - 2025-11-05)NIH CAFEIssues associated with the NIH CAFE projectSize: 33A percentage of a sprint.

Type

No type

Projects

Status

In Progress 💻

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions