Skip to content

Commit 68fb5bd

Browse files
Update README.md
1 parent 2336df4 commit 68fb5bd

File tree

1 file changed

+32
-40
lines changed

1 file changed

+32
-40
lines changed

README.md

Lines changed: 32 additions & 40 deletions
Original file line numberDiff line numberDiff line change
@@ -1,84 +1,76 @@
1-
Penguin Distortion Tester (v560L • CKMM • Disclose)
1+
# Penguin Distortion Tester (v560L • CKMM • Disclose)
22

3-
[![CI](https://github.com/stuck-inadream/penguin-tester/actions/workflows/ci.yml/badge.svg)](https://github.com/stuck-inadream/penguin-tester/actions)
4-
[![HF Space](https://img.shields.io/badge/%F0%9F%A4%97%20Spaces-Penguin%20Distortion%20Tester-blue)](https://huggingface.co/spaces/stuck-inadream/penguin-distortion-tester)
3+
[![CI](https://github.com/stuck-inadream/penguin-tester/actions/workflows/ci.yml/badge.svg)](https://github.com/stuck-inadream/penguin-tester/actions)
4+
[![HF Space](https://img.shields.io/badge/%F0%9F%A4%97%20Spaces-Penguin%20Distortion%20Tester-blue)](https://huggingface.co/spaces/stuck-inadream/penguin-distortion-tester)
55
[![License: MIT](https://img.shields.io/badge/License-MIT-yellow.svg)](LICENSE)
66

7+
**What this is.** A minimal, public-safe harness to *audit symbolic distortion* on a toy “Penguin” task, show SAT/UNSAT behavior (via **Z3**), and report **CKMM-L** context metrics:
78

8-
Overview
9-
The Penguin Distortion Tester is a minimal, public-safe harness to audit symbolic distortion on a toy “Penguin” task.
10-
It demonstrates sat/unsat behavior (via Z3) and reports CKMM-L context metrics across five layers:
11-
Fuel (Energy–Resource): wall time, peak KiB (tracemalloc), and human-attention ticks
9+
- **Fuel (Energy–Resource):** wall time + peak KiB (tracemalloc) + human-attention ticks
10+
- **Temporal Coherence:** replay stability ratio
11+
- **Relational Field:** council agreement vs. drift (majority vote placeholder)
12+
- **Ethical Constraints:** regex penalties + recovery flag
13+
- **Embodiment:** device/substrate capture with validated hint
1214

15+
This repo is a **disclose build**: it lists exactly what a prior reviewer saw — **no more, no less** — and keeps proprietary weighting/heuristics private.
1316

14-
Temporal Coherence: replay stability ratio
15-
Relational Field: council agreement vs. drift (majority vote placeholder)
16-
Ethical Constraints: regex penalties + recovery flag
17-
Embodiment: device/substrate capture with validated hint
17+
---
1818

19-
This repo is a disclose build: it lists exactly what a prior reviewer saw — no more, no less — and keeps proprietary weighting/heuristics private.
19+
## Quickstart
2020

21-
Quickstart
21+
```bash
2222
python3.11 -m venv .venv
2323
source .venv/bin/activate
2424
python -m pip install --upgrade pip
2525
pip install -r requirements.txt
2626

27-
# Run: summarize UI, 100k iterations, choose a port
28-
env PYTHONUNBUFFERED=1 python gradio_demo.py --summary --ckmm-iterations 100000 --server_port 7860
27+
# Run: summary UI, 100k iterations, explicit port
28+
PYTHONUNBUFFERED=1 python gradio_demo.py --summary --ckmm-iterations 100000 --server_port 7860
29+
# Open http://127.0.0.1:7860 (use --share for a temporary public link)
2930

30-
Open http://127.0.0.1:7860 in your browser.
31-
Use --share for a temporary public link.
3231
⚠️ Heads-up: --share exposes a public URL. Treat it as untrusted. Do not upload sensitive data.
3332

34-
Features (Public-Safe)
33+
Features (public-safe)
3534
“Penguin” symbolic check (baseline sat, forced unsat via contradiction trigger)
36-
Distortion score toy formula: modal × diversity (with toy numbers disclosed below)
35+
Distortion score toy formula: modal × diversity (toy numbers disclosed below)
3736
CKMM-L interfaces only (no proprietary internals)
3837
Configurable workload and thresholds:
39-
--ckmm-iterations N (CPU loop, default documented)
38+
--ckmm-iterations N (CPU loop; default documented below)
4039
--ckmm-thresholds '{"fuel":{"max_time_s":2.0}}' (JSON Schema-validated)
41-
--ckmm-verbose (pretty JSON output)
42-
43-
44-
Logging: rotating errors.log + console log level
40+
--ckmm-verbose (pretty JSON)
41+
Logging: rotating errors.log + console level via --log-level
4542
CI: pytest via GitHub Actions
4643

47-
48-
Public Disclosure Parity Appendix
49-
Score formula: modal × diversity; diversity = avg edit distance; toy values 0.33 × 2 = 0.66
50-
Load fractions: 1.0 / 0.6 / 0.7, divergence = 0.3
44+
Public Disclosure Parity (Seen-by-Reviewer Appendix)
45+
What’s explicitly disclosed here was previously visible to a reviewer:
46+
Score formula modal × diversity; diversity = avg edit distance; toy values 0.33 × 2 = 0.66
47+
Load fractions 1.0 / 0.6 / 0.7, divergence = 0.3
5148
HSIL policy names: prefer_specific_over_general, safety_first, audit_bias
5249
Z3 baseline sat; forced contradiction → unsat; heatmap.png modal plot
5350
Gradio flags (server name/port), log level, rotating logs
5451
CI + tests, CKMM-L interfaces (fuel / temporal / council / ethics / embodiment)
55-
5652
Not disclosed: proprietary weighting/aggregation schemes, true diversity/council math beyond placeholders, production energy models, or internal heuristics.
5753

5854
FAQ
5955
Is this just a toy?
60-
Yes — by design. It’s a teaching harness: easy to run, safe to share, useful for audits.
56+
Yes — by design. It’s a teaching harness that’s easy to run, safe to share, and useful for audits.
6157
Can I tune workloads?
62-
Yes — use --ckmm-iterations (e.g., 5_000_000 for measurable fuel) and JSON thresholds.
58+
Yes — use --ckmm-iterations (e.g., 5_000_000 for measurable fuel) and thresholds via --ckmm-thresholds JSON.
6359
Does --share upload my data?
64-
No uploads unless you explicitly make them. --share only exposes a temporary Gradio URL. Treat it as public.
60+
No uploads unless you explicitly make them; --share only exposes a temporary Gradio URL. Treat it as public.
6561

6662
Development & Tests
6763
pip install -r requirements.txt
6864
pytest -q
6965

70-
pip install -r requirements.txt
71-
pytest -q
72-
73-
MIT © stuck-inadream
74-
7566
Roadmap & Contributions
7667
Add screenshot(s) of the UI (docs/screenshot.png)
77-
Pin dependencies in requirements.txt for CI stability
78-
CI improvements: multi-stage (lint then test) with pip cache
68+
Pin dependencies in requirements.txt for CI stability (already pinned)
69+
CI improvements: multi-stage (lint then test) with pip cache (already caching pip)
7970
Optional: package-ready entry point (pyproject.toml) for pipx run penguin-tester
71+
Contributions welcome! See CONTRIBUTING.md.
8072

81-
Contributions welcome! See CONTRIBUTING.md (to be added).
73+
MIT © stuck-inadream
8274

8375
Screenshot
8476
<img width="1234" height="750" alt="Screenshot 2025-09-21 at 9 28 02 AM" src="https://github.com/user-attachments/assets/3d0d0b21-539d-4f63-90b1-67b719130949" />

0 commit comments

Comments
 (0)