Skip to content

Commit dc532db

Browse files
committed
📝 Add concise dashboard guidance, define actionable ML eval metrics, and note lossy compression in group opinion aggregation
1 parent cba6546 commit dc532db

File tree

3 files changed

+8
-0
lines changed

3 files changed

+8
-0
lines changed

Dashboards.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -23,6 +23,7 @@
2323
- Metadata (owner, related OKRs, TTL, …).
2424
- Make them so it's easy to go one layer down (X went down in Y location, or for Z new users, etc).
2525
- Recreate dashboard from first principles periodically.
26+
- Concise and to the point dashboards! Stuffing a dashboard with a bunch of random metrics is a guaranteed way to waste everyone's time.
2627
- When plotting a rate, add the top of funnel and bottom of funnel numbers to make sure things are as expected.
2728
- A large change is not necessarily worth investigating, and a small change is not necessarily benign. What you want to know is if the change is exceptional.
2829
- Be clear with your stakeholder about whether this is a one-off vs. something that should be referenced more than once.

Machine Learning.md

Lines changed: 6 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -26,6 +26,12 @@ These points are expanded with more details in courses like [Made With ML](https
2626
- Collecting good evals will make you understand the problem better.
2727
- Working with probabilistic systems requires new kinds of measurement and deeper consideration of trade-offs.
2828
- Don't work if you cannot define what "great" means for your use case.
29+
- Good eval metrics:
30+
- Measure an error you've observed.
31+
- Relates to a non-trivial issue you will iterate on.
32+
- Are scoped to a specific failure.
33+
- Has a binary outcome (not a 1–5 score).
34+
- Is verifiable (i.e. human labels for LLM-as-a-Judge)
2935

3036
### The [Eval Loop](https://openai.com/index/evals-drive-next-chapter-of-ai/)
3137

Politics.md

Lines changed: 1 addition & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -76,6 +76,7 @@
7676
- Architect coordination as a fractal "pace-layering" of ever more specific algorithms, each handling half the remaining complexity with the simplest tool.
7777
- [A key technical component making democracy work is the secret ballot. No one knows who you voted for, and furthermore, you do not have the ability to prove to anyone else who you voted for, even if you really want to](https://vitalik.eth.limo/general/2025/04/14/privacy.html).
7878
- Voting is a preference aggregation method. Preferences must be aggregated across multiple individuals to determine a collective decision or ranking. This process is central to social choice theory, which provides a mathematical foundation for preference aggregation.
79+
- Every time you derive an opinion from a group, you are doing a lossy compression of each individual opinion. How you do it (preference and meta-preference aggregation) is in itself an opinion / choice.
7980

8081
## Interesting Ideas
8182

0 commit comments

Comments
 (0)