- Key entry points:
posthog/api/__init__.py(URL routing),posthog/settings/web.py(Django settings, INSTALLED_APPS),products/(product apps) - Monorepo layout - high-level directory structure (products, services, common, tools)
- Products README - how to create and structure products
- Products architecture - DTOs, facades, isolated testing
- Environment:
- Use flox when available — prefer
flox activate -- bash -c "<command>"if commands fail- Never use
flox activatein interactive sessions (it hangs if you try)
- Never use
- Use flox when available — prefer
- Tests:
- Universal:
hogli test <file_or_directory>— auto-detects test type (Python, Jest, Playwright, Rust, Go) - Single test:
hogli test path/to/test.py::TestClass::test_method - Watch mode:
hogli test path/to/test.py --watch - Changed files only:
hogli test --changed
- Universal:
- Lint:
- Python:
ruff check . --fixandruff format .
- Frontend:
pnpm --filter=@posthog/frontend format - TypeScript check:
pnpm --filter=@posthog/frontend typescript:check
- Python:
- Build:
- Frontend:
pnpm --filter=@posthog/frontend build - Start dev:
./bin/startorhogli start(interactive TUI). Detached mode:hogli up -dpaired withhogli wait/hogli down
- Frontend:
- OpenAPI/types:
hogli build:openapi(regenerate after changing serializers/viewsets) - New product:
bin/hogli product:bootstrap <name> - LSP: Pyright is configured against the flox venv. Prefer LSP (
goToDefinition,findReferences,hover) over grep when navigating or refactoring Python code.
- Use conventional commits for all commit messages and PR titles.
- Check docs for any content that may need updating, you can find these at
docs/
feat: New feature or functionality (touches production code)fix: Bug fix (touches production code)chore: Non-production changes (docs, tests, config, CI, refactoring agents instructions, etc.)- Scope convention: use
llmafor LLM analytics changes (for example,feat(llma): ...)
<type>(<scope>): <description>
Examples:
feat(insights): add retention graph exportfix(cohorts): handle empty cohort in query builderchore(ci): update GitHub Actions workflowchore: update AGENTS.md instructions
Required: Before creating any PR, read .github/pull_request_template.md and use its exact section structure.
Do not invent a different format.
Always fill the ## 🤖 Agent context section when creating PRs.
Keep descriptions high-level, focusing on rationale and architecture for the human reviewer.
NEVER share sensitive information in a PR description. Users may share sensitive data in an agent session, but those should never surface to a PR description, or comments.
- Scope is optional but encouraged when the change is specific to a feature area
- Description should be lowercase and not end with a period
- Keep the first line under 72 characters
Pushes trigger CI, which burns runner credits. Refrain from pushing unless explicitly instructed or until the task is complete — batch local commits and push once at the end rather than after every change. If you're mid-task or iterating, keep work local.
This repository is public and all commit messages, pull request titles, and pull request descriptions must be safe for public readers.
- Never mention internal-only systems, private incidents, customer data, private Slack threads, unreleased roadmap details, or security-sensitive implementation details.
- Use product-facing and code-facing context that a public OSS contributor could understand from this repository alone.
- If context is sensitive, summarize it at a high level without naming internal tools, accounts, or people.
- Avoid citing private operational scale or incident metrics (for example, exact affected team counts, internal row-volume anecdotes, or customer-specific performance numbers) unless that data is already public and linkable.
Examples:
- ✅
fix(insights): handle missing series color in trend export - ❌
fix: patch issue found in acme-co prod workspace after sales escalation— references internal customer - ❌
fix: will run fine on our 12 million rows there now— leaks private operational scale
.nvmrccontrols the Node.js version for all CI workflows (viaactions/setup-node) — changing it affects every CI job that runs Node- Every job in
.github/workflows/must declaretimeout-minutes— prevents stuck runners from burning credits indefinitely
See .agents/security.md for SQL, HogQL, and semgrep security guidelines.
- API views should declare request/response schemas — prefer
@validated_requestfromposthog.api.mixinsor@extend_schemafrom drf-spectacular. PlainViewSetmethods that validate manually need@extend_schema(request=YourSerializer)— without it, drf-spectacular can't discover the request body and generated code gets empty schemas - Django serializers are the source of truth for frontend API types —
hogli build:openapigenerates TypeScript via drf-spectacular + Orval. Generated files (api.schemas.ts,api.ts,api.zod.ts) live infrontend/src/generated/core/andproducts/{product}/frontend/generated/— don't edit them manually, change serializers and rerun. See type system guide for the full pipeline - MCP tools are generated from the same OpenAPI spec — see implementing MCP tools for the YAML config and codegen workflow
- MCP UI apps (interactive visualizations for tool results) are defined in
products/*/mcp/tools.yamlunderui_appsand auto-generated — see services/mcp/CONTRIBUTING.md or use theimplementing-mcp-ui-appsskill - When touching a viewset or serializer, ensure schema annotations are present (
@extend_schemaor@validated_requeston viewset methods,help_texton serializer fields) — these flow into generated frontend types and MCP tool schemas - New features should live in
products/— read products/README.md for layout and setup. When creating a new product, follow products/architecture.md (DTOs, facades, isolation) - Every tenant-data model must have
team_id— either as a FK (models.ForeignKey("posthog.Team", ...)) or a plainBigIntegerField(for multi-DB products). This is the primary tenant isolation boundary. Models withoutteam_idmust be org-scoped, user-scoped, or instance-global — never silently unscoped. New models should inherit fromTeamScopedRootMixin(main DB) orProductTeamModel(separate DB) so they start fail-closed — seeposthog/models/scoping/README.md. CI enforces this viaposthog/models/scoping/baseline_unmigrated.txt: any new team-scoped model not on a fail-closed manager fails the IDOR coverage check. In serializers, access the team viaself.context["get_team"]() - Do not add domain-specific fields to the
Teammodel. Use a Team Extension model instead — seeposthog/models/team/README.mdfor the pattern and helpers - PostHog event capture in Celery tasks: Do not use
posthoganalytics.capture()in Celery tasks — events are silently lost. Useph_scoped_capturefromposthog.ph_clientinstead (see its docstring for why and usage). - Django admin
ForeignKeyfields need explicit widget config. When adding aForeignKey/OneToOneFieldto a model that's exposed in Django admin (including via inlines attached to a related admin), list the new field inautocomplete_fields,raw_id_fields, orreadonly_fieldson every admin class that renders the model — otherwise the default<select>widget loads the entire target table per row on each change-page render. Prefer declaring the config on a shared base inline so per-parent variants (e.g., subclasses differentiated byfk_name) inherit it automatically. - Use personhog client for all person/group data access — do not query persons DB tables via the Django ORM or raw SQL. The
posthog/personhog_client/gRPC client is the required interface for reading and writing person-related data. This applies to the following tables:posthog_person,posthog_persondistinctid,posthog_cohortpeople,posthog_group,posthog_grouptypemapping, and related override tables (posthog_personoverride,posthog_pendingpersonoverride,posthog_flatpersonoverride,posthog_featureflaghashkeyoverride,posthog_personlessdistinctid,posthog_personoverridemapping). Use the helpers inposthog/models/person/util.py(e.g.get_person_by_uuid,get_persons_by_distinct_ids,get_person_by_distinct_id) andposthog/models/group_type_mapping.py(get_group_types_for_project) — these already route through personhog with ORM fallback via_personhog_routed(). When adding new person/group data access, follow the same_personhog_routed()pattern: provide apersonhog_fnusingget_personhog_client()and anorm_fnfallback. Never add new direct ORM queries likePerson.objects.filter(...)orPersonDistinctId.objects.filter(...)— use the existing routed helpers or create new ones following the established pattern. Seeposthog/personhog_client/README.mdfor client details andposthog/personhog_client/client.pyfor the full RPC interface. - PostHog does not enable
ATOMIC_REQUESTS— there is no implicit per-request transaction. Each database operation runs in autocommit mode unless explicitly wrapped. Usewith transaction.atomic():around the specific writes that must succeed or fail together. Do not wrap an entire view method atomically — keep the block as narrow as possible around the related writes. Avoid performing irreversible side effects (sending emails, calling external APIs, enqueuing Celery tasks) inside an atomic block: if the transaction rolls back, those side effects have already happened. Schedule such side effects after the commit, or usetransaction.on_commit()for Celery task dispatch. - Temporal activity payloads have a ~2 MiB hard limit — pass large data by reference, not by value. Activity inputs and outputs are serialized across a gRPC boundary that Temporal caps at ~2 MiB per payload (the server rejects larger payloads via
blobSizeLimitError). As a conservative field-level rule, if a field could exceed ~256 KB once serialized (serialized query results, exported file contents, LLM context, rendered HTML, image bytes, unboundedlist[dict[str, Any]]), write it to Postgres / S3 / object storage from inside the activity and return only the reference (row ID, S3 key). The workflow already has access to any row ID created earlier in the same run; it does not need the content to flow back through. Shuttling large data through the workflow on the way to persistence is a foreseeable failure mode that producesPayloadSizeError(TMPRL1103) the moment the underlying data crosses the limit.
- Python: Write as if mypy
--strictis enabled — annotate all function signatures (arguments + return types), avoidAny, useTYPE_CHECKINGimports for type-only references. Do not run mypy locally (too slow); CI runs it on every PR. The config isn't fully strict yet, but new code should be - Python imports: Place all imports at the top of the file (module level). Do not put imports inside functions, methods, or conditionals. Inline imports hide dependencies from static analysis, slow down hot paths with repeated lookups, and obscure circular-import problems instead of fixing them. The only acceptable reasons to defer an import are (1) breaking a true unavoidable circular import — fix the structure first if you can, and (2)
TYPE_CHECKINGblocks for type-only references. If you reach for an inline import to dodge an import-time side effect or startup cost, fix the offending module instead - Frontend: TypeScript required, explicit return types
- Frontend: If there is a kea logic file, write all business logic there, avoid React hooks at all costs.
- Frontend: Any button or form submit that triggers a network request must guard against double-submission — disable the button and show a loading state (
loading/disabledReasononLemonButton, or equivalent) while the request is in flight. Never leave a submit button clickable during an active mutation; reset the state in both success and error paths. This applies to<form onSubmit>handlers,onClickhandlers that callapi.*, and any kealistenerthat issues a request — wire the in-flight state (loader*Loadingselectors, localuseState, or a reducer) into the trigger's disabled/loading props. - Imports: Use oxfmt import sorting (automatically runs on format), avoid direct dayjs imports (use lib/dayjs)
- CSS: Use tailwind utility classes instead of inline styles
- Error handling: Prefer explicit error handling with typed errors
- Naming: Use descriptive names, camelCase for JS/TS, snake_case for Python
- Comments: explain why, not what — if the reason isn't important, skip the comment
- Comments: when refactoring or moving code, preserve existing comments unless they are explicitly made obsolete by the change
- Python tests: do not add doc comments
- Python: do not create empty
__init__.pyfiles - jest tests: when writing jest tests, prefer a single top-level describe block in a file
- Tests: prefer parameterized tests (use the
parameterizedlibrary in Python) — if you're writing multiple assertions for variations of the same logic, it should be parameterized - Reduce nesting: Use early returns, guard clauses, and helper methods to avoid deeply nested code
- Markdown: prefer semantic line breaks; no hard wrapping
- Use American English spelling
- When mentioning PostHog products, the product names should use Sentence casing, not Title Casing. For example, 'Product analytics', not 'Product Analytics'. Any other buttons, tab text, tooltips, etc should also all use Sentence casing. For example, 'Save as view' instead of 'Save As View'.
When automating a convention, try these in order — only fall back to the next if the previous isn't suitable:
- Linters (ruff, oxlint, semgrep) — code pattern enforcement, always paired with CI
- lint-staged / husky — file-level validation or warnings at commit time
- Skills (
.agents/skills/) — scaffold withhogli init:skill - AGENTS.md / CLAUDE.md instructions — when automated enforcement isn't suitable
Claude Code hooks are reserved for environment bootstrapping (SessionStart only) — do not add PreToolUse, PostToolUse, or Notification hooks as they add latency and are fragile. Changes to .claude/hooks/ trigger a lint-staged warning; changes to .claude/settings.json are blocked outright.
ALWAYS invoke the matching skill before writing or reviewing code in these areas — do not skip, do not attempt the work without loading the skill first.
Always invoke:
/improving-drf-endpoints— any DRF viewset or serializer change/django-migrations— any Django migration/clickhouse-migrations— any ClickHouse migration/adopting-generated-api-types— any frontend file usinglib/api,api.get<,api.create<, or handwritten API types
Invoke when in the area:
/implementing-mcp-tools— adding/modifying endpoints ortools.yaml/modifying-taxonomic-filter— any TaxonomicFilter change/sending-notifications— adding notification support/writing-skills— creating or updating skills in.agents/skills/