Skip to content

Commit 8737643

Browse files
authored
Remove Lf metrics from Metrics page on hud (#6413)
As we finished the migration to support autoscaled runners in LF and we're now comfortable in a stable situation for a few months, the following graphs in the Hud metrics page are not relevant anymore: <img width="619" alt="Screenshot 2025-03-14 at 18 47 13" src="https://github.com/user-attachments/assets/4662540e-1fc5-4ce0-b631-72110ab91644" /> This change is removing them from that page, and the queries that are used by it. The dashboard that displays the % of usage in each fleet is still very relevant, and we'll be keeping it. # Release and rollback There is CI in this repository that triggers the release every time changes are merged to main. So, both for release and rollback just merge to main and wait changes to be reflected in hud.pytorch.org.
1 parent de00dac commit 8737643

File tree

3 files changed

+0
-154
lines changed

3 files changed

+0
-154
lines changed

torchci/clickhouse_queries/lf_rollover_health/params.json

-6
This file was deleted.

torchci/clickhouse_queries/lf_rollover_health/query.sql

-107
This file was deleted.

torchci/pages/metrics.tsx

-41
Original file line numberDiff line numberDiff line change
@@ -901,47 +901,6 @@ export default function Page() {
901901
</p>
902902
</Grid2>
903903

904-
<Grid2 size={{ xs: 12 }} height={ROW_HEIGHT}>
905-
<TimeSeriesPanel
906-
title={"LF vs Meta: Success rate delta"}
907-
queryName={"lf_rollover_health"}
908-
queryParams={{ ...timeParams, days_ago: timeRange }}
909-
granularity={"day"}
910-
timeFieldName={"bucket"}
911-
yAxisLabel={"rate delta"}
912-
yAxisFieldName={"success_rate_delta"}
913-
yAxisRenderer={(value) => value}
914-
groupByFieldName={"job_name"}
915-
/>
916-
</Grid2>
917-
918-
<Grid2 size={{ xs: 12 }} height={ROW_HEIGHT}>
919-
<TimeSeriesPanel
920-
title={"LF vs Meta: Cancelled rate delta"}
921-
queryName={"lf_rollover_health"}
922-
queryParams={{ ...timeParams, days_ago: timeRange }}
923-
granularity={"day"}
924-
timeFieldName={"bucket"}
925-
yAxisLabel={"rate delta"}
926-
yAxisFieldName={"cancelled_rate_delta"}
927-
yAxisRenderer={(value) => value}
928-
groupByFieldName={"job_name"}
929-
/>
930-
</Grid2>
931-
932-
<Grid2 size={{ xs: 12 }} height={ROW_HEIGHT}>
933-
<TimeSeriesPanel
934-
title={"LF vs Meta: Duration increase ratio"}
935-
queryName={"lf_rollover_health"}
936-
queryParams={{ ...timeParams, days_ago: timeRange }}
937-
granularity={"day"}
938-
timeFieldName={"bucket"}
939-
yAxisLabel="increase ratio"
940-
yAxisFieldName={"success_duration_increase_ratio"}
941-
yAxisRenderer={(value) => value}
942-
groupByFieldName={"job_name"}
943-
/>
944-
</Grid2>
945904
<Grid2 size={{ xs: 12 }} height={ROW_HEIGHT}>
946905
<TimeSeriesPanel
947906
title={"Percentage of jobs rolled over to Linux Foundation"}

0 commit comments

Comments
 (0)