Skip to content

Latest commit

 

History

History
141 lines (93 loc) · 5.74 KB

File metadata and controls

141 lines (93 loc) · 5.74 KB

Related work: efficiency + agentic coding

This is a curated reading list for the problems Happy Paths targets:

  • repeated wrong-turns in agentic coding loops
  • cross-session / cross-developer reuse (memory)
  • correctness vs cost/time ("thinking budget") tradeoffs
  • realistic evaluation of software engineering agents
  • developer productivity measurement

Happy Paths is not trying to be a research project, but we do want our claims (and our measurement choices) to be legible to people who have followed this literature.

Software engineering agents + benchmarks

More agentic coding benchmarks

Context engineering + memory for agents

Inference-time compute / “thinking budget” efficiency

These are relevant because “agentic coding” often fails not for lack of capability, but because the loop burns too much time/tokens getting to the right state.

Developer productivity studies (humans + assistants)

Human factors: comprehension, review, and mental models

A recurring theme in community discussion is that assistants shift the bottleneck from writing to reviewing / understanding (especially for large diffs). We want our metrics to reflect that reality.

Community discussion (Hacker News + Reddit)

We also track practitioner discussion to understand what actually bottlenecks engineering teams when they adopt these tools.

Hacker News threads

Reddit threads


If you think a paper/thread belongs here (especially anything that quantifies cost/time or failure modes), please open a PR adding it.