Features/alignment files indexing on ingestion json #574

noctillion · 2025-04-07T14:47:12Z

Index cram and bam files if the index is not available on ingestion in the experiments_json_with_files workflow.

…stion_json

codecov · 2025-04-07T19:04:36Z

Codecov Report

All modified and coverable lines are covered by tests ✅

Project coverage is 92.91%. Comparing base (1ddf753) to head (e46fa39).

Additional details and impacted files

@@           Coverage Diff            @@
##           develop     #574   +/-   ##
========================================
  Coverage    92.91%   92.91%           
========================================
  Files          124      124           
  Lines         4486     4486           
  Branches       391      391           
========================================
  Hits          4168     4168           
  Misses         228      228           
  Partials        90       90

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

davidlougheed · 2025-04-16T19:15:01Z

chord_metadata_service/chord/workflows/wdls/experiments_json_with_files.wdl

+            "~{drs_url}/ingest")
+        echo "$resp_main" | jq -c
+
+        # If it's BAM or CRAM, ingest the index as well


this part of the code still isn't matching desired behaviour exactly - indices should be ingested if they're listed in the JSON; or, if no index is specified in the JSON, one should be generated (maybe in the python script using subprocess?) and put into the JSON. I think also prepare_for_drs should extract index files as well as file paths, and this task should stay naive to the type of file it is ingesting.

I think for now the best way to write this workflow might unfortunately just be as a single big Python task which uses the requests library to ingest into DRS and, if needed, uses subprocess+samtools to generate indices for files missing the indices, rewriting the experiments JSON as it goes and then POSTing it to Katsu at the end.

noctillion added 3 commits April 7, 2025 10:35

rf: task write drs responses to file

4b8927d

implement cram bam indexing on ingestion

040d6a4

Merge branch 'develop' into features/alignment_files_indexing_on_inge…

3b632c6

…stion_json

noctillion requested a review from davidlougheed April 7, 2025 19:08

update experiment_json with cram bam index information

e46fa39

davidlougheed requested changes Apr 16, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Features/alignment files indexing on ingestion json #574

Features/alignment files indexing on ingestion json #574

noctillion commented Apr 7, 2025

codecov bot commented Apr 7, 2025 •

edited

Loading

davidlougheed Apr 16, 2025

davidlougheed Apr 16, 2025

Features/alignment files indexing on ingestion json #574

Are you sure you want to change the base?

Features/alignment files indexing on ingestion json #574

Conversation

noctillion commented Apr 7, 2025

codecov bot commented Apr 7, 2025 • edited Loading

Codecov Report

davidlougheed Apr 16, 2025

Choose a reason for hiding this comment

davidlougheed Apr 16, 2025

Choose a reason for hiding this comment

codecov bot commented Apr 7, 2025 •

edited

Loading