Status: early-stage MVP lab for Tension Universe.
This directory hosts small, reproducible experiments that live strictly at the effective layer.
No experiment here should be cited as solving any BlackHole question or S-problem.
This is the central entry point for all Tension Universe / WFGY 3.0 experiments.
Each experiment lives in its own subfolder with:
- code or Colab-style script that you can run yourself, and
- a dedicated
README.mdinside that folder explaining what the experiment is about,
which tension axis it probes, and how to interpret the numbers.
If you do not want to spend tokens, you can simply read the text in each experiment’s README.
If you want to reproduce results, open the corresponding script or notebook and follow its instructions.
This directory is:
- A lab shelf for many small, focused MVP experiments.
- A place where each experiment:
- is small enough to understand in one sitting,
- is reproducible by anyone with an API key or compatible runtime,
- makes its assumptions explicit at the effective layer.
This directory is not:
- A proof library.
- A claim that any BlackHole question has been resolved.
- A replacement for the formal charters or S-problem statements.
Think of it as an evolving gallery of tension probes and anti-explosion modules,
each one tied to a specific slice of the Tension Universe.
Typical usage pattern:
- Browse the index below and choose an experiment folder that looks relevant.
- Open that folder’s
README.mdto understand:- which question or failure mode it targets,
- what is being measured,
- what counts as a good or bad pattern.
- If you want to reproduce the numbers:
- open the script or notebook referenced in that folder,
- provide an OpenAI compatible API key when asked (if required),
- run the code and compare your run with the example described in the README.
You do not have to execute anything to understand the ideas.
Running the code is only needed if you want to verify the behavior on your own account,
or try different models and settings.
This experiment hub is still growing.
If you want to help expand the public TU experiment layer, the preferred contribution path is simple:
- choose one valid TU problem from the 131-problem field
- build one small MVP experiment page under
TensionUniverse/Experiments/ - keep the scope narrow, clear, and auditable
- submit a focused pull request
The ideal contribution is usually:
- one experiment folder
- one structured
README.md - optional small notebook, Colab, or screenshot support when it directly helps the MVP
If you want the expected structure, current open items, and contributor credit rules, start here:
This page is the experiment index only.
Each individual experiment should still explain itself inside its own directory.
This list will grow over time as more MVPs are added.
| Folder | Status | Short note |
|---|---|---|
Q091_MVP/ |
MVP ready | TU Q091 equilibrium climate sensitivity ranges. Synthetic ECS items and range-style tension gauge T_ECS_range that compares plausible climate narratives. |
Q098_MVP/ |
MVP ready | TU Q098 Anthropocene toy trajectories. Three variable human–Earth model with physical tension gauge T_anthro and narrative consistency tension T_story around a simple safe operating region. |
Q101_MVP/ |
MVP ready | TU Q101 equity premium tension. Tiny consumption based asset pricing worlds and scalar T_premium that shows when matching a 6% premium requires extreme risk aversion or unrealistic parameters. |
Q105_MVP/ |
MVP ready | TU Q105 systemic crash warnings. Core–periphery network contagion toy model with early-warning tension gauge T_warning over simple indicator schemes such as global mean load and core stress. |
Q106_MVP/ |
MVP ready | TU Q106 tiny two-layer infrastructure world. Compares robust vs fragile multiplex designs under random and targeted attacks with scalar robustness tension T_robust. Fully offline, one-cell Colab. |
Q108_MVP/ |
MVP ready | TU Q108 toy political polarization. Bounded-confidence opinion dynamics on small graphs with scalar polarization tension T_polar over cluster separation and weight in extremes. Fully offline, one-cell Colab. |
Q121_MVP/ |
MVP ready | TU Q121 alignment slice. Literal helper and aligned helper personas on the same base model with alignment tension observable T_align over SAFE and UNSAFE scenarios. |
Q124_MVP/ |
MVP ready | TU Q124 scalable oversight ladders. Baseline and guided evaluators on synthetic oversight cases with oversight tension observable T_oversight. One-cell Colab with optional API key. |
Q127_MVP/ |
MVP ready | TU Q127 synthetic worlds and entropy. Three tiny Gaussian worlds, cross-world evaluations, and world-detection style tension gauge T_entropy(train → test) with simple heatmaps. |
Q130_MVP/ |
MVP ready | TU Q130 early out-of-distribution and social-pressure probes. Hollywood vs physics reasoning and social-pressure style experiments with tension metrics explained inside Q130_MVP/README.md. |
Each folder owns its own story.
The high-level rule is simple:
- This page only indexes experiments.
- Each experiment explains itself inside its own directory.
All experiments in this lab obey the following principles:
-
Effective-layer only
- They operate on observable behavior and declared encodings.
- They do not introduce hidden assumptions about deep mechanisms.
-
Reproducible in minutes
- Each experiment is small enough to be rerun from a fresh environment.
- External requirements are kept minimal and written down in the experiment’s own README.
-
Limited claims
- An experiment may show that a specific tension gauge or persona setting works well
on a narrow slice of a larger problem. - It does not claim that the full S-problem is solved, nor that the behavior will generalise
to arbitrary traffic or models beyond the ones actually tested.
- An experiment may show that a specific tension gauge or persona setting works well
If you find a mismatch between an experiment’s README and what the code actually does,
please open an issue or pull request in the main WFGY repository so the lab remains audit-friendly.
This hub is expected to accumulate, for example:
- additional out-of-distribution reasoning experiments tied to other BlackHole questions,
- social pressure and persona stability tests for different domains,
- retrieval and vector-store tension probes,
- governance and arbitration demos that combine several tension axes.
New experiments will be added as new MVPs are ready.
When a new experiment appears, it will get its own folder, its own README, and an entry in the table above.
If you want to help move this lab forward, you are welcome to build one focused MVP experiment page and open a PR.
The preferred contribution path, open items, and contributor credit rules are documented here: