Teerapat-Vatpitak
diff --git a/‎CHANGELOG.md‎
Lines changed: 1 addition & 1 deletion b/‎CHANGELOG.md‎
Lines changed: 1 addition & 1 deletion
diff --git a/‎POST_PUBLISH_ISSUES.md‎
Lines changed: 12 additions & 27 deletions b/‎POST_PUBLISH_ISSUES.md‎
Lines changed: 12 additions & 27 deletions
diff --git a/‎crates/mcp-loadtest-cli/src/cmd_cross.rs‎
Lines changed: 3 additions & 149 deletions b/‎crates/mcp-loadtest-cli/src/cmd_cross.rs‎
Lines changed: 3 additions & 149 deletions
diff --git a/‎crates/mcp-loadtest-cli/src/cmd_cross/render.rs‎
Lines changed: 162 additions & 0 deletions b/‎crates/mcp-loadtest-cli/src/cmd_cross/render.rs‎
Lines changed: 162 additions & 0 deletions
@@ -125,7 +125,7 @@ and this project adheres to [Semantic Versioning](https://semver.org/spec/v2.0.0
 - `DEFAULT_LEAK_THRESHOLD_MB_PER_SEC` → use `DEFAULT_LATENCY_DRIFT_MS_PER_SEC`. The old constant remains as an alias for one release and will be removed in v0.2.0.
 
 ### Notes
-- 11 source files have production code (excluding `#[cfg(test)] mod tests`) over the 300-line convention. They are split into "split candidates" (genuine refactor opportunities) and "borderline" (just over). See `POST_PUBLISH_ISSUES.md` for the per-file plan; tracked under M8.
+- ✅ The M8 file-split pass completed in the pre-publish review. All source files have production code (excluding `#[cfg(test)] mod tests`) under the 300-line convention. See `POST_PUBLISH_ISSUES.md` for the per-wave summary of what split where.
 - `serve` and `tui` modules will move behind cargo feature flags in a future release to keep the default build slim.
 - HTTP / SSE transport host-allowlist for SSRF defense is deferred. Currently mitigated by `Policy::none()` on redirects; the allowlist is operator-facing config and will land alongside the broader transport-hardening pass.
 
 
@@ -76,34 +76,19 @@ Each block below is in `gh issue create` shape — copy-paste-ready once the rep
 
 Files where production code (excluding `#[cfg(test)] mod tests`) exceeds the 300-line convention. The bracketed numbers are production LoC / total LoC.
 
-### `chore(refactor): split files > 300 production LoC (M8)`
+### ~~`chore(refactor): split files > 300 production LoC (M8)`~~ **COMPLETED in v0.1.0 pre-publish**
 
-**Body:**
-> Splits to land in v0.2. Each line is a separate sub-task; some are easy extractions, some are genuine module restructures. Per-file plan:
->
-> **Wave 1 — extract a helper (low risk):**
-> - [ ] `src/scenario/soak.rs` (358 prod LoC) — extract `detect_leak` + `LinReg` into `scenario/soak/leak_detect.rs`
-> - [ ] `src/report/html.rs` (336) — pull CSS constant + SVG chart renderer into `report/html/{css,chart}.rs`
-> - [ ] `src/scenario/spike.rs` (331) — barely over; trim or leave
->
-> **Wave 2 — per-component split (medium):**
-> - [ ] `crates/mcp-loadtest-cli/src/cmd_compare.rs` (503) — split into `cmd_compare/{types,diff,render,classify}.rs`
-> - [ ] `src/scenario/fuzzer.rs` (488) — split into `fuzzer/{payloads,driver,classify}.rs`
-> - [ ] `src/serve/tools.rs` (509) — split into `serve/tools/{deadlock_probe,sustained_load,compare_runs}.rs` + `mod.rs`
->
-> **Wave 3 — orchestrator / heavy types:**
-> - [ ] `src/run.rs` (447) — split into `run/{orchestrator,thresholds}.rs`
-> - [ ] `src/metrics/mod.rs` (426) — split into `metrics/{types,recorder,outcomes}.rs`
-> - [ ] `src/config.rs` (376) — split into `config/{schema,validate,example}.rs`
-> - [ ] `crates/mcp-loadtest-cli/src/main.rs` (434) — extract subcommand handlers into `main/cmd_*.rs`
-> - [ ] `src/serve/mod.rs` (304) — barely over; leave or trim
->
-> Each split must:
-> - preserve the public API path via `pub use` re-export from `mod.rs`
-> - not break existing tests
-> - keep `cargo doc` warning-free
-
-**Labels:** `tech-debt`, `refactor`, `v0.2`
+All 11 originally-flagged files now have production code under 300 lines. Splits landed across four commits in the pre-publish review:
+
+- Wave 1: `scenario/soak.rs` (358→196 via `soak/leak_detect.rs`), `report/html.rs` (336→212 via `html/{css,chart}.rs`), `scenario/spike.rs` (331→217 via `spike/phase.rs`).
+- Wave 2: `cmd_compare.rs` (503→100 via `cmd_compare/{types,diff,render}.rs`), `scenario/fuzzer.rs` (488→277 via `fuzzer/{payloads,classify}.rs`), `serve/tools.rs` (509→130 via `serve/tools/{deadlock_probe,sustained_load,compare_runs}.rs`).
+- Wave 3: `run.rs` (447→293 via `run/thresholds.rs`), `metrics/mod.rs` (426→<300 via `metrics/{types,per_tool}.rs` + module-doc trim), `config.rs` (376→217 via `config/{validate,example}.rs`), `main.rs` (434→208 via `cmd_run.rs` + `cmd_deadlock.rs` + `emit.rs`).
+- Wave 4 (trim): `metrics/mod.rs`, `scenario/soak.rs`, `serve/mod.rs` — module-doc trim brought them under the 300 cap.
+- `sse.rs` (311→263 via `sse/reader.rs`), `cmd_cross.rs` (322→189 via `cmd_cross/render.rs`).
+
+Public API paths preserved via `pub use` re-exports throughout. 264 tests pass, `cargo fmt --check` + `cargo clippy -D warnings` clean.
+
+**Labels:** ~~`tech-debt`~~ — done.
 
 ---
 
 
@@ -15,7 +15,6 @@ use std::time::Duration;
 
 use anyhow::{Context, Result, anyhow};
 use futures::StreamExt;
-use mcp_loadtest::analysis::grading::{GradingProfile, grade};
 use mcp_loadtest::config::{
     Config, OutputConfig, ScenarioConfig, ServerConfig, ThresholdsConfig, split_server_command,
 };
@@ -26,6 +25,9 @@ use mcp_loadtest::scenario::deadlock_probe::DeadlockProbe;
 use mcp_loadtest::scenario::sustained::Sustained;
 use serde_json::{Value, json};
 
+mod render;
+use render::render_markdown;
+
 /// Cap on how many servers we spawn in parallel from `cross`. Each spawn
 /// invokes `tokio::process::Command::spawn`; on Windows hitting the
 /// JobObject limit when N is large causes spawns to fail. 8 chosen so a
@@ -185,140 +187,6 @@ async fn run_one(server: &str, args: &CrossArgs, args_value: &Value) -> Result<R
     Ok(report)
 }
 
-// ---- rendering ----------------------------------------------------------
-
-/// Render the cross-comparison as a Markdown report.
-fn render_markdown(rows: &[ServerRow], args: &CrossArgs) -> String {
-    let scenario_label = match args.scenario {
-        CrossScenario::Sustained => "sustained",
-        CrossScenario::DeadlockProbe => "deadlock_probe",
-    };
-    let mut out = String::new();
-    out.push_str("# Cross-server comparison\n\n");
-    out.push_str(&format!(
-        "- Scenario: `{}`\n- Tool: `{}`\n- Duration: {}s per server\n- Servers: {}\n\n",
-        scenario_label,
-        args.tool,
-        args.duration.as_secs_f64(),
-        rows.len(),
-    ));
-
-    // Servers list — separate from the metrics table so failed servers still
-    // show up clearly above and below.
-    out.push_str("## Servers\n\n");
-    for (idx, row) in rows.iter().enumerate() {
-        let label = column_label(idx);
-        let status = match &row.result {
-            Ok(_) => "ok",
-            Err(_) => "FAILED",
-        };
-        out.push_str(&format!("- **{label}**: `{}` — {status}\n", row.command));
-    }
-    out.push('\n');
-
-    // Metrics table — one column per server, one row per metric.
-    out.push_str("## Metrics\n\n");
-
-    // Header row.
-    out.push_str("| Metric |");
-    for (idx, _) in rows.iter().enumerate() {
-        out.push_str(&format!(" {} |", column_label(idx)));
-    }
-    out.push('\n');
-    out.push_str("|---|");
-    for _ in rows {
-        out.push_str("---:|");
-    }
-    out.push('\n');
-
-    let profile = GradingProfile::default_general();
-
-    // Per-row formatters keep the table easy to scan; each function plucks
-    // a different field out of a Report (or returns "n/a" on a failed run).
-    push_metric_row(&mut out, "p50 latency", rows, |r| {
-        format_duration_ms(r.metrics.latency.p50)
-    });
-    push_metric_row(&mut out, "p95 latency", rows, |r| {
-        format_duration_ms(r.metrics.latency.p95)
-    });
-    push_metric_row(&mut out, "p99 latency", rows, |r| {
-        format_duration_ms(r.metrics.latency.p99)
-    });
-    push_metric_row(&mut out, "max latency", rows, |r| {
-        format_duration_ms(r.metrics.latency.max)
-    });
-    push_metric_row(&mut out, "RPS", rows, |r| {
-        format!("{:.2}", r.metrics.throughput.requests_per_sec)
-    });
-    push_metric_row(&mut out, "error rate", rows, |r| {
-        let total = r.metrics.throughput.total_requests;
-        let success = r.metrics.throughput.successful_requests;
-        if total == 0 {
-            "0.00%".to_string()
-        } else {
-            let errors = total.saturating_sub(success);
-            format!("{:.2}%", errors as f64 / total as f64 * 100.0)
-        }
-    });
-    push_metric_row(&mut out, "deadlocks", rows, |r| {
-        r.scenario_outcome.deadlock_count.to_string()
-    });
-    push_metric_row(&mut out, "Grade", rows, |r| {
-        let g = grade(r, &profile);
-        g.overall.name().to_string()
-    });
-
-    // Errors section — list any per-server failures with their full message
-    // so the user can debug without spelunking through stderr.
-    let failures: Vec<&ServerRow> = rows.iter().filter(|r| r.result.is_err()).collect();
-    if !failures.is_empty() {
-        out.push_str("\n## Errors\n\n");
-        for row in failures {
-            if let Err(e) = &row.result {
-                out.push_str(&format!("- `{}`: {e:#}\n", row.command));
-            }
-        }
-    }
-
-    out
-}
-
-/// Push one row of the metrics table. `extract` runs only on successful
-/// reports; failed servers get `"n/a"`.
-fn push_metric_row<F>(out: &mut String, label: &str, rows: &[ServerRow], extract: F)
-where
-    F: Fn(&Report) -> String,
-{
-    out.push_str(&format!("| {label} |"));
-    for row in rows {
-        let cell = match &row.result {
-            Ok(report) => extract(report),
-            Err(_) => "n/a".to_string(),
-        };
-        out.push_str(&format!(" {cell} |"));
-    }
-    out.push('\n');
-}
-
-/// Letter label for a column: `A`, `B`, ..., then `S1`, `S2`, ... once we
-/// run out of single letters. Cross-comparing more than 26 servers in one
-/// table is unlikely but the fallback keeps headers unambiguous.
-fn column_label(idx: usize) -> String {
-    if idx < 26 {
-        let c = (b'A' + idx as u8) as char;
-        c.to_string()
-    } else {
-        format!("S{}", idx + 1)
-    }
-}
-
-/// Format a `Duration` as millisecond-precision text. Mirrors the helper in
-/// `run.rs` — small enough to inline here rather than expose pub from the lib.
-fn format_duration_ms(d: Duration) -> String {
-    let total_ms = d.as_secs_f64() * 1000.0;
-    format!("{total_ms:.2}ms")
-}
-
 #[cfg(test)]
 mod tests {
     use super::*;
@@ -363,18 +231,4 @@ mod tests {
     fn split_server_command_empty_errors() {
         assert!(split_server_command("").is_err());
     }
-
-    #[test]
-    fn column_label_first_letters() {
-        assert_eq!(column_label(0), "A");
-        assert_eq!(column_label(1), "B");
-        assert_eq!(column_label(25), "Z");
-        assert_eq!(column_label(26), "S27");
-    }
-
-    #[test]
-    fn format_duration_ms_two_decimals() {
-        assert_eq!(format_duration_ms(Duration::from_millis(1)), "1.00ms");
-        assert_eq!(format_duration_ms(Duration::from_micros(1234)), "1.23ms");
-    }
 }
@@ -0,0 +1,162 @@
+//! Markdown rendering for the `cross` subcommand.
+//!
+//! Split out of `cmd_cross.rs` to keep that file under the 300-line
+//! production-code convention. Pure formatting — no I/O, no async.
+
+use std::time::Duration;
+
+use mcp_loadtest::analysis::grading::{GradingProfile, grade};
+use mcp_loadtest::report::Report;
+
+use super::{CrossArgs, CrossScenario, ServerRow};
+
+/// Render the cross-comparison as a Markdown report.
+pub(super) fn render_markdown(rows: &[ServerRow], args: &CrossArgs) -> String {
+    let scenario_label = match args.scenario {
+        CrossScenario::Sustained => "sustained",
+        CrossScenario::DeadlockProbe => "deadlock_probe",
+    };
+    let mut out = String::new();
+    out.push_str("# Cross-server comparison\n\n");
+    out.push_str(&format!(
+        "- Scenario: `{}`\n- Tool: `{}`\n- Duration: {}s per server\n- Servers: {}\n\n",
+        scenario_label,
+        args.tool,
+        args.duration.as_secs_f64(),
+        rows.len(),
+    ));
+
+    // Servers list — separate from the metrics table so failed servers still
+    // show up clearly above and below.
+    out.push_str("## Servers\n\n");
+    for (idx, row) in rows.iter().enumerate() {
+        let label = column_label(idx);
+        let status = match &row.result {
+            Ok(_) => "ok",
+            Err(_) => "FAILED",
+        };
+        out.push_str(&format!("- **{label}**: `{}` — {status}\n", row.command));
+    }
+    out.push('\n');
+
+    // Metrics table — one column per server, one row per metric.
+    out.push_str("## Metrics\n\n");
+
+    // Header row.
+    out.push_str("| Metric |");
+    for (idx, _) in rows.iter().enumerate() {
+        out.push_str(&format!(" {} |", column_label(idx)));
+    }
+    out.push('\n');
+    out.push_str("|---|");
+    for _ in rows {
+        out.push_str("---:|");
+    }
+    out.push('\n');
+
+    let profile = GradingProfile::default_general();
+
+    // Per-row formatters keep the table easy to scan; each function plucks
+    // a different field out of a Report (or returns "n/a" on a failed run).
+    push_metric_row(&mut out, "p50 latency", rows, |r| {
+        format_duration_ms(r.metrics.latency.p50)
+    });
+    push_metric_row(&mut out, "p95 latency", rows, |r| {
+        format_duration_ms(r.metrics.latency.p95)
+    });
+    push_metric_row(&mut out, "p99 latency", rows, |r| {
+        format_duration_ms(r.metrics.latency.p99)
+    });
+    push_metric_row(&mut out, "max latency", rows, |r| {
+        format_duration_ms(r.metrics.latency.max)
+    });
+    push_metric_row(&mut out, "RPS", rows, |r| {
+        format!("{:.2}", r.metrics.throughput.requests_per_sec)
+    });
+    push_metric_row(&mut out, "error rate", rows, |r| {
+        let total = r.metrics.throughput.total_requests;
+        let success = r.metrics.throughput.successful_requests;
+        if total == 0 {
+            "0.00%".to_string()
+        } else {
+            let errors = total.saturating_sub(success);
+            format!("{:.2}%", errors as f64 / total as f64 * 100.0)
+        }
+    });
+    push_metric_row(&mut out, "deadlocks", rows, |r| {
+        r.scenario_outcome.deadlock_count.to_string()
+    });
+    push_metric_row(&mut out, "Grade", rows, |r| {
+        let g = grade(r, &profile);
+        g.overall.name().to_string()
+    });
+
+    // Errors section — list any per-server failures with their full message
+    // so the user can debug without spelunking through stderr.
+    let failures: Vec<&ServerRow> = rows.iter().filter(|r| r.result.is_err()).collect();
+    if !failures.is_empty() {
+        out.push_str("\n## Errors\n\n");
+        for row in failures {
+            if let Err(e) = &row.result {
+                out.push_str(&format!("- `{}`: {e:#}\n", row.command));
+            }
+        }
+    }
+
+    out
+}
+
+/// Push one row of the metrics table. `extract` runs only on successful
+/// reports; failed servers get `"n/a"`.
+fn push_metric_row<F>(out: &mut String, label: &str, rows: &[ServerRow], extract: F)
+where
+    F: Fn(&Report) -> String,
+{
+    out.push_str(&format!("| {label} |"));
+    for row in rows {
+        let cell = match &row.result {
+            Ok(report) => extract(report),
+            Err(_) => "n/a".to_string(),
+        };
+        out.push_str(&format!(" {cell} |"));
+    }
+    out.push('\n');
+}
+
+/// Letter label for a column: `A`, `B`, ..., then `S1`, `S2`, ... once we
+/// run out of single letters. Cross-comparing more than 26 servers in one
+/// table is unlikely but the fallback keeps headers unambiguous.
+pub(super) fn column_label(idx: usize) -> String {
+    if idx < 26 {
+        let c = (b'A' + idx as u8) as char;
+        c.to_string()
+    } else {
+        format!("S{}", idx + 1)
+    }
+}
+
+/// Format a `Duration` as millisecond-precision text. Mirrors the helper in
+/// `run.rs` — small enough to inline here rather than expose pub from the lib.
+fn format_duration_ms(d: Duration) -> String {
+    let total_ms = d.as_secs_f64() * 1000.0;
+    format!("{total_ms:.2}ms")
+}
+
+#[cfg(test)]
+mod tests {
+    use super::*;
+
+    #[test]
+    fn column_label_first_letters() {
+        assert_eq!(column_label(0), "A");
+        assert_eq!(column_label(1), "B");
+        assert_eq!(column_label(25), "Z");
+        assert_eq!(column_label(26), "S27");
+    }
+
+    #[test]
+    fn format_duration_ms_two_decimals() {
+        assert_eq!(format_duration_ms(Duration::from_millis(1)), "1.00ms");
+        assert_eq!(format_duration_ms(Duration::from_micros(1234)), "1.23ms");
+    }
+}