You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Add Terra performance analysis best practices to CLAUDE.md
Document lessons learned from regression analysis of filter_bam_to_taxa:
- Use GCS file timestamps (not Python logs) for accurate task timing
- Use wildcards in gcloud storage ls for efficient batch queries
- Handle attempt-* directories for preempted tasks
- Match samples to workflows by scanning stderr content
Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>
Copy file name to clipboardExpand all lines: CLAUDE.md
+47Lines changed: 47 additions & 0 deletions
Display the source diff
Display the rich diff
Original file line number
Diff line number
Diff line change
@@ -188,3 +188,50 @@ Image versions are pinned in `requirements-modules.txt` and must be kept in sync
188
188
## Dockstore Integration
189
189
190
190
Workflows are registered on Dockstore for easy import to Terra, DNAnexus, and other platforms. The `.dockstore.yml` file defines all published workflows and their test parameter files.
191
+
192
+
## Terra Performance Analysis
193
+
194
+
When analyzing workflow performance from Terra submissions, use the Terra MCP tools for structure/status queries and direct GCS access for log analysis.
195
+
196
+
### Timing Methodology for WDL Tasks
197
+
198
+
When measuring task execution time from Terra logs:
199
+
200
+
1.**Start time**: Use first Python log timestamp in stderr
0 commit comments