WW-shan
diff --git a/‎reports/maker-simulation-tradetape-2026-05-13.md‎
Lines changed: 3 additions & 1 deletion b/‎reports/maker-simulation-tradetape-2026-05-13.md‎
Lines changed: 3 additions & 1 deletion
diff --git a/‎reports/research-summary-2026-05-13.md‎
Lines changed: 14 additions & 10 deletions b/‎reports/research-summary-2026-05-13.md‎
Lines changed: 14 additions & 10 deletions
diff --git a/‎scripts/research_simulation_utils.py‎
Lines changed: 168 additions & 0 deletions b/‎scripts/research_simulation_utils.py‎
Lines changed: 168 additions & 0 deletions
diff --git a/‎scripts/simulate_maker_basket.py‎
Lines changed: 12 additions & 5 deletions b/‎scripts/simulate_maker_basket.py‎
Lines changed: 12 additions & 5 deletions
@@ -1,5 +1,7 @@
 # Maker Simulation v2 — Trade Tape (2026-05-13T03:50:03.951692+00:00)
 
+> **Post-review correction (2026-05-13)**: this report was generated before the simulator capped PnL by the thinnest at-or-below-target leg trade size and before maker quotes were forced to stay strictly below bestAsk. Treat the dollar figures below as a stale upper bound. Re-run `scripts/simulate_maker_basket_v2.py` with the corrected code before making any trade/no-trade decision.
+
 **Method**: real Polymarket trade tape. For each (group, day, markup), check if any SELL Yes trade at price <= target occurred on each leg that day. If ALL legs had a qualifying trade, basket fills.
 
 **Window**: 14 days (2026-04-29 -> 2026-05-13)
@@ -64,4 +66,4 @@ v1 mid-touch results from earlier today (see `maker-simulation-2026-05-13.md`):
 - v2 vs v1 mismatch: v2 < v1 means mid-touch over-counts (less real trade activity at target); v2 > v1 means mid-touch under-counts (trades happened that mid-snapshot didn't capture).
 
 ---
-*Snapshot: 2026-05-13T03:50:03.951692+00:00*
+*Snapshot: 2026-05-13T03:50:03.951692+00:00*
@@ -4,6 +4,8 @@
 **面向读者**：同学 WW
 **与昨日报告 (`research-summary-2026-05-12.md`) 的关系**：昨日是一次 06:13 UTC snapshot 的普查结果（"有 1 个 strict 候选 James Bond, edge +8.93%"）。今日把数据量扩到 14 天 × 15 分钟 granularity，并且对 strict 候选做了真实 orderbook 深度检查。**结论从「thesis 待验证」推到「thesis 在当前深度下商业死亡」**。
 
+> **Post-review correction（2026-05-13）**：§3.9 的 maker v2 dollar 结论是旧 simulator 产物；旧公式按目标 basket size 计收益，没有按每条腿真实 at-or-below-target SELL-Yes 成交量封顶，也没有强制 maker quote 严格低于 bestAsk。代码已修正，下面涉及 `$918/yr`、`$200-500/yr`、`$2-5k/yr` 的数字只能视为 stale upper bound，必须重跑 `scripts/simulate_maker_basket_v2.py` 后再决策。
+
 ---
 
 ## TL;DR —— 30 秒结论（vs 昨日）
@@ -20,10 +22,10 @@
 1. mid-price 给出的"持续 edge"和 bestAsk 实际可成交 edge 是两个东西（§3.4）
 2. TAKER 一次性吃光 bestAsk 在 2 个测试组（James Bond + SC Gov）下死亡（§2 + §3.7）
 3. 我据此说"thesis 死了" —— 用户当场质疑（§3.9）
-4. 补做 MAKER 模拟：v1 mid-touch 给 $15k/yr 假象，v2 trade tape 给 $918/yr 真值
-5. 修正后预期：**$200-500/yr @ $100 basket，或 $2-5k/yr @ $1000 basket**（capital 占用 $144k）
+4. 补做 MAKER 模拟：v1 mid-touch 给 $15k/yr 假象，v2 trade tape 旧公式给 $918/yr 上界
+5. Post-review 修正：maker v2 需要按真实成交量封顶后重跑；**旧 `$200-500/yr @ $100 basket` 不再作为最终结论**
 
-**今天 Claude 的判决（修正后）**：**Taker 死，Maker 活在 hobby 规模。最严重的教训是 §3.9：我用 1 个角度的测试做了全局结论，错了。用户的"不信邪" 把这份报告从错误里拉了回来**。
+**Post-review 后的判决**：**Taker 基本死；Maker 不能再按旧数字下结论，必须用成交量封顶版本重跑。最严重的教训是 §3.9：我用 1 个角度的测试做了全局结论，错了；但旧 maker v2 又犯了 size 上限错误。**
 
 ---
 
@@ -241,14 +243,16 @@ scripts/verify_group_book.py --group-id 0xa8574c0caacc --basket-sizes "50,200,50
 
 **警告写在脚本里：mid-touch 不等于 trade-at-target。真实 fill 率会低很多**。
 
-#### v2 (trade tape) 修正
+#### v2 (trade tape) 旧公式结果（post-review 后需重跑）
 
 `scripts/simulate_maker_basket_v2.py`：从 `data-api.polymarket.com/trades` 拉了真实成交记录。48,030 raw trades → 1,602 个 SELL Yes 在窗口内（只有 **3.3%** 的成交是 SELL-Yes，即"会触发我们 maker bid 的那种"）。
 
-| Metric | v1 (mid-touch) | v2 (trade tape) |
+Post-review 发现旧公式仍把每次 fill 乘以目标 basket size，没有按最薄腿真实 at-or-below-target 成交量封顶；因此本节数字只能作为旧版上界。
+
+| Metric | v1 (mid-touch) | v2 (trade tape, pre-fix upper bound) |
 |---|---:|---:|
 | 总日 $ | $42.59 | **$2.51** |
-| 年化 | $15,546 | **$918** |
+| 年化 | $15,546 | **$918（旧上界）** |
 | 正期望组 | 49/72 | **17/72** |
 | 平均 fill rate | 23-69% | **5-6%** |
 
@@ -262,21 +266,21 @@ scripts/verify_group_book.py --group-id 0xa8574c0caacc --basket-sizes "50,200,50
 | D/R 相关动（联合 fill 比独立 fill 难） | ×0.7 |
 | Partial fill 风险（一腿成 一腿没成 → 持仓不对冲） | -10% |
 | Polygon gas / 多笔交易成本 | -20% |
-| **现实估计** | **$200-500/yr @ $100 basket** |
+| **旧现实估计** | **无效，需按成交量封顶后重跑** |
 
-如果放大到 $1000 basket：~$2-5k/yr，但资金占用 $144k（72 组 × 2 腿 × $1000）。
+旧版 "$1000 basket → ~$2-5k/yr" 线性外推同样无效，因为真实成交量通常远低于目标 basket size。
 
 #### 修正后的两层 verdict
 
 | 策略 | 现实预期 \$/yr | 备注 |
 |---|---:|---|
 | Taker basket arb | \$0-200 | 被深度杀死，verified |
 | Maker basket arb（mid-sim 错估） | \$15k 假象 | 方法错 |
-| **Maker basket arb（trade tape）** | **\$200-500 @ \$100 / \$2-5k @ \$1000** | 方法学上可辩护 |
+| **Maker basket arb（trade tape）** | **需重跑** | 代码已改成成交量封顶 + 非 crossing maker quote |
 
 #### 我学到的最严肃的教训
 
-我前两天说 "thesis 已死" 是**过度推论**。我只测了 1 个视角（TAKER 一次性吃光 bestAsk），用 2 个组的单次 snapshot 就下了"整条 thesis 死亡"的判决。**用户当面质疑后做的真测试（trade tape v2）证明 thesis 活着，只是商业规模上接近 hobby**。
+我前两天说 "thesis 已死" 是**过度推论**。我只测了 1 个视角（TAKER 一次性吃光 bestAsk），用 2 个组的单次 snapshot 就下了"整条 thesis 死亡"的判决。**用户当面质疑后补做的 trade tape v2 提示 maker 方向仍值得验证，但 post-review 后必须用成交量封顶版本重跑，不能再把旧收益数当结论**。
 
 更广义的教训：**"测一个角度 → 推全局"** 是科研里最廉价的错误之一。Robust 测试需要至少：
 - 多种策略视角（taker / maker / hold-to-resolution）
 
@@ -0,0 +1,168 @@
+"""Shared helpers for research-only simulation scripts."""
+from __future__ import annotations
+
+import statistics
+from typing import Any
+
+DEFAULT_TICK_SIZE = 0.001
+EPSILON = 1e-9
+
+
+def simulate_buy_cost(asks: list[tuple[float, float]], target_units: float) -> tuple[float, float, float]:
+    """Walk an ask ladder and return (units_filled, total_cost, avg_price_paid)."""
+    filled = 0.0
+    cost = 0.0
+    for price, size in asks:
+        if filled >= target_units:
+            break
+        take = min(float(size), target_units - filled)
+        if take <= 0:
+            continue
+        cost += take * float(price)
+        filled += take
+    avg_price = cost / filled if filled > 0 else 0.0
+    return filled, cost, avg_price
+
+
+def fee_rate_from_book(book: dict[str, Any]) -> float:
+    member = book.get("member") or {}
+    return float(book.get("fee_rate", member.get("fee_rate", 0.0)) or 0.0)
+
+
+def simulate_basket_fill(book_data: list[dict[str, Any]], requested_size: float) -> dict[str, Any]:
+    """Simulate a mutually-exclusive YES basket at the common executable size.
+
+    A basket only pays out for the minimum size completed across all legs. If
+    one leg has 2 units of ask depth and the requested basket is 10 units, the
+    executable basket is 2 units, not 10.
+    """
+    requested_size = float(requested_size)
+    requested_fills: list[float] = []
+    for book in book_data:
+        filled, _, _ = simulate_buy_cost(book.get("asks") or [], requested_size)
+        requested_fills.append(filled)
+
+    effective_size = min(requested_fills) if requested_fills else 0.0
+    effective_size = max(0.0, min(effective_size, requested_size))
+
+    total_cost = 0.0
+    total_fee = 0.0
+    per_member: list[dict[str, Any]] = []
+    if effective_size > EPSILON:
+        for book in book_data:
+            filled, cost, avg_px = simulate_buy_cost(book.get("asks") or [], effective_size)
+            fee_rate = fee_rate_from_book(book)
+            fee = fee_rate * avg_px * (1.0 - avg_px) * filled
+            total_cost += cost
+            total_fee += fee
+            member = book.get("member") or {}
+            per_member.append(
+                {
+                    "member": str(member.get("question") or "")[:40],
+                    "filled": filled,
+                    "avg_px": avg_px,
+                    "cost": cost,
+                    "fee": fee,
+                }
+            )
+
+    edge_dollars = effective_size - total_cost - total_fee
+    edge_pct = edge_dollars / effective_size if effective_size > EPSILON else 0.0
+    return {
+        "size": requested_size,
+        "requested_size": requested_size,
+        "effective_size": effective_size,
+        "max_fillable_units": effective_size,
+        "is_full_size_fillable": effective_size + EPSILON >= requested_size,
+        "total_cost": total_cost,
+        "total_fee": total_fee,
+        "edge_dollars": edge_dollars if effective_size > EPSILON else 0.0,
+        "edge_pct": edge_pct,
+        "per_member": per_member,
+    }
+
+
+def maker_target_price(
+    best_bid: float,
+    best_ask: float,
+    markup: float,
+    tick_size: float = DEFAULT_TICK_SIZE,
+) -> float | None:
+    """Return a non-crossing maker bid target or None if the spread is too tight."""
+    best_bid = float(best_bid)
+    best_ask = float(best_ask)
+    markup = float(markup)
+    tick_size = float(tick_size)
+    if best_ask <= 0 or best_bid < 0 or tick_size <= 0:
+        return None
+    lower = best_bid + tick_size
+    upper = best_ask - tick_size
+    if upper + EPSILON < lower:
+        return None
+    target = max(best_ask - markup, lower)
+    target = min(target, upper)
+    if target <= 0 or target + EPSILON >= best_ask:
+        return None
+    return round(target, 6)
+
+
+def zero_maker_stats(n_total_days: int, reason: str) -> dict[str, Any]:
+    return {
+        "targets": [],
+        "n_filled_days": 0,
+        "n_total_days": n_total_days,
+        "fill_rate": 0.0,
+        "avg_edge_given_fill": 0.0,
+        "median_edge_given_fill": 0.0,
+        "expected_daily_edge_dollars": 0.0,
+        "avg_min_leg_sell_size": 0.0,
+        "avg_effective_basket_size": 0.0,
+        "max_effective_basket_size": 0.0,
+        "n_positive_edge_days": 0,
+        "n_negative_edge_days": 0,
+        "skipped_reason": reason,
+    }
+
+
+def capped_expected_daily_edge(
+    filled_days: list[dict[str, Any]],
+    n_total_days: int,
+    basket_size: float,
+) -> dict[str, float]:
+    """Compute daily maker PnL capped by observed trade size on the thinnest leg."""
+    if n_total_days <= 0 or not filled_days:
+        return {
+            "expected_daily_edge_dollars": 0.0,
+            "avg_effective_basket_size": 0.0,
+            "max_effective_basket_size": 0.0,
+        }
+
+    basket_size = float(basket_size)
+    effective_sizes = [
+        min(basket_size, max(0.0, float(day.get("min_leg_sell_size") or 0.0)))
+        for day in filled_days
+    ]
+    pnl = [
+        float(day.get("edge") or 0.0) * size
+        for day, size in zip(filled_days, effective_sizes)
+    ]
+    return {
+        "expected_daily_edge_dollars": sum(pnl) / n_total_days,
+        "avg_effective_basket_size": statistics.mean(effective_sizes) if effective_sizes else 0.0,
+        "max_effective_basket_size": max(effective_sizes) if effective_sizes else 0.0,
+    }
+
+
+def qualifying_trade_size(trades: list[dict[str, Any]], target_price: float) -> float:
+    """Return total sell size that could have hit a resting bid at target_price."""
+    target_price = float(target_price)
+    total = 0.0
+    for trade in trades:
+        try:
+            price = float(trade.get("price") or 0.0)
+            size = float(trade.get("size") or 0.0)
+        except (TypeError, ValueError):
+            continue
+        if price <= target_price and size > 0:
+            total += size
+    return total
@@ -53,6 +53,8 @@
 from urllib.parse import urlencode
 from urllib.request import Request, urlopen
 
+from research_simulation_utils import maker_target_price, zero_maker_stats
+
 REPO_ROOT = Path(__file__).resolve().parent.parent
 GAMMA_MARKETS_URL = "https://gamma-api.polymarket.com/markets"
 PRICES_HISTORY_URL = "https://clob.polymarket.com/prices-history"
@@ -232,13 +234,17 @@ def main() -> int:
         # For each markup level, compute basket fill rate + avg basket edge
         markup_stats: dict[float, dict] = {}
         for markup in markups:
-            # Maker target price per leg = today's bestAsk - markup (clamped >= bestBid)
+            # Maker target price per leg must stay inside the spread and below bestAsk.
             targets: list[float] = []
             for m in members:
-                t = m["best_ask"] - markup
-                # Don't go below today's bestBid (would never realistically fill)
-                t = max(t, m["best_bid"] + 0.001)
+                t = maker_target_price(m["best_bid"], m["best_ask"], markup)
+                if t is None:
+                    targets = []
+                    break
                 targets.append(t)
+            if not targets:
+                markup_stats[markup] = zero_maker_stats(len(all_days), "no_non_crossing_maker_quote")
+                continue
 
             filled_days: list[dict] = []
             for d in all_days:
@@ -314,7 +320,8 @@ def best_markup_income(r: dict) -> float:
         f"# Maker-strategy Basket Simulation ({iso})",
         "",
         f"**Method**: for each (group, UTC-day, markup-level), check if every leg's "
-        f"mid-price touched (today's bestAsk - markup) at some point during the day. "
+        f"mid-price touched a non-crossing maker target derived from today's bestAsk - markup "
+        f"at some point during the day. "
         f"If ALL legs filled, compute basket cost at maker target prices + fee. "
         f"Aggregate fill_rate * avg_edge as proxy for expected daily $income.",
         "",