Releases · Codesteward/codesteward

15 Apr 18:29

github-actions

v0.5.2

0cb6c1a

v0.5.2 Latest

Latest

Fixed — codesteward-graph

dependency query returned empty despite edges being written.
PyProjectParser and PackageJsonParser emitted depends_on edges whose
source_id pointed at a pyproject.toml / package.json file node — but
that node was never actually written to the graph. On GraphQLite the edge
write silently dropped (source MATCH returned nothing); on Neo4j the source
had to be merged separately. Both parsers now return a tuple
(source_file_nodes, edges) and GraphBuilder persists the file nodes
alongside the edges, giving the dependency query a proper source to
match against.

Improved — codesteward-graph

referential query now returns both outgoing and incoming edges for a
filter. Previously the filter matched only the edge source name, so
asking for referential filter=foo returned what foo calls but not
what calls foo — the most common "who depends on this?" question.
Templates now also match r.target_name (and tgt.name on Neo4j), so a
single filter surfaces both directions. Direction is always visible in
the result rows via from_name vs to_name.

Container image

docker pull ghcr.io/codesteward/codesteward:0.5.2

Image is signed with cosign keyless via GitHub OIDC.
Verify with:

cosign verify ghcr.io/codesteward/codesteward:0.5.2 \
  --certificate-identity-regexp 'https://github.com/Codesteward/codesteward/.*' \
  --certificate-oidc-issuer https://token.actions.githubusercontent.com

Assets 8

15 Apr 17:58

github-actions

v0.5.1

2f3f146

v0.5.1

Fixed — codesteward-mcp

stdio transport: server crashed on first tool call — structlog.configure() was
called without an explicit logger_factory, so structlog defaulted to
PrintLoggerFactory(file=sys.stdout). Any log.warning / log.error on a tool path
(e.g. _make_backend when a backend was missing) wrote structlog output to stdout,
which is the JSON-RPC channel in stdio mode. The MCP client received non-JSON and
dropped the connection with JSON Parse error: Unable to parse JSON string. Fix:
route structlog to sys.stderr via PrintLoggerFactory(file=sys.stderr).

Container image

docker pull ghcr.io/codesteward/codesteward:0.5.1

Image is signed with cosign keyless via GitHub OIDC.
Verify with:

cosign verify ghcr.io/codesteward/codesteward:0.5.1 \
  --certificate-identity-regexp 'https://github.com/Codesteward/codesteward/.*' \
  --certificate-oidc-issuer https://token.actions.githubusercontent.com

Assets 8

15 Apr 17:13

github-actions

v0.5.0

969bd61

v0.5.0

Added — codesteward-graph

PyProjectParser — extracts depends_on edges from pyproject.toml files, supporting
PEP 621 [project.dependencies], [project.optional-dependencies], and Poetry
[tool.poetry.dependencies]. Scans all pyproject.toml files in the repo tree (handles
uv workspaces and monorepos). Wired into GraphBuilder.build_graph() alongside the
existing PackageJsonParser. The dependency query type now returns results for Python
projects.

Fixed — codesteward-graph

GraphQLite: dependency query returned null package names — the query template read
pkg.name from the target node via a traversal, but GraphQLite does not resolve target
node properties through traversal patterns (the same limitation already worked around in
the referential query). Rewrote the template to read target_name from edge properties.
The same fix was applied to the semantic query's sink_name/sink_file fields.
GraphQLite: delete_file_nodes did not filter by $param in MATCH patterns —
$param interpolation into MATCH property patterns is unreliable in GraphQLite. The
method was inconsistent with count_nodes/delete_repo_data which already use literal
values via _cypher_escape. Rewrote delete_file_nodes to match. This was silently
breaking incremental rebuilds.
GraphQLite: named query templates used $param in MATCH patterns — tenant/repo/filter
isolation was unreliable in lexical, referential, semantic, and dependency
queries. Rewrote each template as a builder function that constructs Cypher with escaped
literal values and moves filters to WHERE clauses.
GraphQLite: write_augment_edge created duplicate edges on re-invocation — it used
CREATE instead of MERGE and did not dedup by edge_id. Added a delete-before-create
pattern so re-writing an augment edge with the same edge_id is idempotent (matches the
upsert behavior of Neo4j and JanusGraph).
GraphQLite: semantic query used NOT r.sanitized — SQLite stores booleans as
integers and NOT <int> semantics were not reliable through GraphQLite's Cypher
translation. Changed to explicit r.sanitized = 0.
GraphQLite: full rebuild duplicated all edges — write_edges uses CREATE (not MERGE)
for relationship creation, so consecutive full rebuilds doubled the edge count with each run.
Added delete_repo_data(tenant_id, repo_id) to the GraphBackend ABC and all three
implementations (Neo4j, JanusGraph, GraphQLite). GraphBuilder.build_graph() now clears
existing repo data before every full (non-incremental) rebuild.
Cross-file CALLS resolution 0% on codebases with shared method names —
_resolve_call_targets() marked all ambiguous names (e.g. parse, defined in 13+ parsers)
as unresolvable. Added same-file disambiguation: when a callee name is globally ambiguous,
resolve to the definition in the caller's own file. This recovers intra-file method calls
that were previously left as unresolved external nodes.
GUARDED_BY edges emitted for non-auth Python decorators — @property,
@staticmethod, @abstractmethod, @dataclass, @pytest.fixture, and ~30 other standard
Python decorators were incorrectly producing guarded_by edges. Added _NON_AUTH_DECORATORS
blocklist to the Python parser; only actual auth decorators (@login_required,
Depends(...), etc.) now emit guard edges.
build_graph summary reported language: typescript for all codebases — the language
parameter defaulted to "typescript" and was echoed into the summary unchanged. Added
_detect_dominant_language() which counts file-node languages and returns the most common
one. The summary now reflects the actual codebase language.

Security / CI

CI/CD security hardening — introduced a comprehensive CI pipeline based on the
OpenSSF / SLSA guidance:
- Every job now runs behind step-security/harden-runner (audit mode) and uses
  persist-credentials: false with scoped permissions.
- New checks: Semgrep (p/python, p/security-audit, p/owasp-top-ten, p/docker),
  Hadolint, zizmor (GitHub Actions static analysis, with ref-pin policy in
  .github/zizmor.yml), CodeQL security-extended (push-to-main), pip-audit
  against uv export --frozen, Trivy container scan that gates the release,
  dependency-review on PRs, conventional commits, license headers
  (skywalking-eyes, check-only), markdown-lint, OpenSSF Scorecard, and a
  weekly scheduled scan workflow (CodeQL, pip-audit, Trivy image, gitleaks).
- Release workflow now builds linux/amd64 locally, scans with Trivy (HIGH/CRITICAL
  gate), pushes multi-arch with SLSA provenance (provenance: mode=max) and SBOM,
  signs with cosign keyless via GitHub OIDC, and attaches trivy-report.json +
  sbom.cdx.json to the GitHub Release.
- Added .github/CODEOWNERS, .github/dco.yml (probot/dco), SECURITY.md,
  .licenserc.yaml, renovate.json (with pinDigests: true), and
  docs/ci-security-hardening.md.
Container base patched against Debian openssl CVE-2026-28390 — final stage of
Dockerfile.mcp now runs apt-get upgrade -y on top of python:3.12-slim to pick up
Debian security updates that lag the upstream image rebuild cadence.
Narrowly-scoped .trivyignore for bundled codesteward-taint binary — the upstream
v0.1.0 binary was built with Go 1.22.12 and inherits nine Go stdlib CVEs that cannot be
fixed from this repo. Documented each suppression with a TODO to drop once upstream ships
a Go ≥ 1.26.2 rebuild. Operators who do not need taint analysis can
--build-arg TAINT_VERSION=none to remove the binary and skip the suppressions.
Python runtime CVEs resolved — bumped locked cryptography 46.0.5 → 46.0.7
(CVE-2026-34073, CVE-2026-39892), pygments 2.19.2 → 2.20.0 (CVE-2026-4539), and
pytest 9.0.2 → 9.0.3 (CVE-2025-71176) via uv lock --upgrade-package.

Container image

docker pull ghcr.io/codesteward/codesteward:0.5.0

Image is signed with cosign keyless via GitHub OIDC.
Verify with:

cosign verify ghcr.io/codesteward/codesteward:0.5.0 \
  --certificate-identity-regexp 'https://github.com/Codesteward/codesteward/.*' \
  --certificate-oidc-issuer https://token.actions.githubusercontent.com

Assets 8

12 Apr 20:52

github-actions

v0.4.2

6ebdd73

v0.4.2

Fixed — codesteward-graph

.venv and Python tool directories not excluded from graph builds — _IGNORED_DIRS
now includes .venv, venv, .env, env, .tox, .nox, .mypy_cache,
.ruff_cache, .pytest_cache, site-packages, and .eggs. Previously these
directories were parsed, causing recursion errors on large vendored files and polluting
the graph with thousands of library symbols.
Cross-file CALLS targets unresolved — added _resolve_call_targets() post-parse pass
to GraphBuilder.build_graph(). After all files are parsed, a fn_name → node_id map
is built and CALLS edge target_id values are rewritten from bare callee names to proper
node IDs. Ambiguous names (multiple definitions) are left unresolved. Typically resolves
~30% of all CALLS edges in a codebase.
GraphQLite: referential query returned NULL target properties — GraphQLite's
relationship traversal (src)-[r]->(tgt) does not resolve target node properties.
Worked around by storing target_name and target_id as edge properties during
write_edges, and reading them from the edge in the referential query template.
GraphQLite: UNWIND ON CREATE SET did not persist target node properties — replaced
batch UNWIND target-node creation with per-node literal MERGE, consistent with the
existing per-edge literal approach.
GraphQLite: dependency query SQL error — MATCH with mixed $param and literal
values in property patterns triggers a GraphQLite SQL translation bug
(no such column: _prop__gql_default_alias_0.value). Moved node_type = 'file'
from the MATCH pattern to the WHERE clause.

Fixed — codesteward-mcp

codesteward-mcp setup wrote Claude Code MCP config to wrong file — was writing to
~/.claude/settings.json but Claude Code reads MCP servers from ~/.claude.json.
Also added the required "type": "stdio" field to the server config for Claude Code.

Changed — Known issues — GraphQLite backend

The referential query NULL target issue from 0.4.1 is now resolved via edge-stored
target metadata.
The dependency query SQL error from 0.4.1 is now resolved via WHERE clause workaround.

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.4.2

Full setup guide: AGENT_SETUP.md

Assets 6

12 Apr 19:45

github-actions

v0.4.1

95ca9ad

v0.4.1

Fixed — codesteward-graph

GraphQLite backend: SQLite threading error — graphqlite.connect() now passes
check_same_thread=False so that asyncio.to_thread() can execute queries in worker
threads without raising sqlite3.ProgrammingError
GraphQLite backend: node properties not persisted — write_nodes replaced
SET node += n (map-merge syntax unsupported by GraphQLite) with explicit
ON CREATE SET / ON MATCH SET for each field
GraphQLite backend: edges not persisted — write_edges rewritten to work around
two GraphQLite Cypher-to-SQL translation bugs: (1) UNWIND variable references in MATCH
property patterns match all nodes instead of filtering, and (2) $param references in
relationship properties are silently discarded. Edges are now written individually with
Cypher literal values; target nodes are batch-MERGEd in a separate step
GraphQLite backend: count_nodes always returned 0 — replaced MATCH inline
property filter with a WHERE clause using literal values, avoiding the parameter
binding issue
GraphQLite backend: write_augment_edge not persisting edge properties — same
workaround as write_edges: target node MERGE separated from edge CREATE, relationship
properties written as literals

Known issues — GraphQLite backend

These are upstream bugs in the graphqlite package (≤ 0.4.3) that remain unresolved:

Target node properties (to_name, to_file, to_node_type) return NULL in
referential query results — the (src)-[r]->(tgt) pattern match finds edges but
tgt.* properties are inaccessible
dependency named query fails with SQL prepare failed: no such column — the
RETURN DISTINCT + DEPENDS_ON edge type triggers a Cypher-to-SQL translation error
Node count mismatch between graph_rebuild reported total and count_nodes result —
external reference nodes created during edge writing inflate the DB count beyond the
parser-reported total

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.4.1

Full setup guide: AGENT_SETUP.md

Assets 6

12 Apr 18:34

github-actions

v0.4.0

d67b070

v0.4.0

Added — codesteward-graph

Graph backend abstraction layer (engine/backends/): new GraphBackend ABC with a
unified async interface for node/edge writes, named queries, and raw query passthrough.
All tool functions are now backend-agnostic.
JanusGraph backend (backends/janusgraph.py): Apache 2.0 licensed alternative to Neo4j.
Connects via Gremlin (Apache TinkerPop gremlinpython>=3.7). Named query templates
(lexical, referential, semantic, dependency) reimplemented in Gremlin. Raw query
passthrough uses Gremlin instead of Cypher.
GraphQLite backend (backends/graphqlite.py): embedded SQLite-based graph database
(graphqlite>=0.4) — no server needed, ideal for local dev via uvx. Speaks Cypher
(same templates as Neo4j). Database defaults to ~/.codesteward/graph.db; override with
GRAPHQLITE_DB_PATH.
Neo4j backend extracted into backends/neo4j.py (same Cypher queries, now behind the
GraphBackend interface).
get_backend() factory in backends/__init__.py dispatches by GRAPH_BACKEND value.
New optional dependency extras: janusgraph (gremlinpython) and graphqlite (graphqlite).

Added — codesteward-mcp

GRAPH_BACKEND environment variable to select the graph backend: neo4j (default),
janusgraph, or graphqlite.
JANUSGRAPH_URL environment variable for the Gremlin Server WebSocket URL.
GRAPHQLITE_DB_PATH environment variable for the SQLite database file path.
gremlin raw query type in codebase_graph_query for JanusGraph raw Gremlin passthrough.
Cypher/Gremlin mismatch is rejected with a clear error.
docker-compose.janusgraph.yml — drop-in JanusGraph stack (BerkeleyDB JE + Lucene,
single-node, no external Cassandra/HBase required).
docker-compose.neo4j.yml — renamed from the previous docker-compose.yml for clarity.
Docker image now installs the janusgraph extra by default.
New optional dependency extras on codesteward-mcp: janusgraph and graphqlite
(re-exported from codesteward-graph).
Global setup templates: Claude Code (templates/global-claude-code/) and OpenAI Codex
(templates/global-codex/) with CLAUDE.md, skill file, settings snippet, and AGENTS.md.
codesteward-mcp setup subcommand — one-time global setup that auto-detects installed
AI tools (Claude Code, Cursor, Cline, Codex CLI, Gemini CLI), registers the MCP server in
each tool's global config, and merges workflow instructions into CLAUDE.md / AGENTS.md /
GEMINI.md. Idempotent — safe to re-run. --uninstall reverses all changes cleanly.
--backend flag accepts graphqlite (default), neo4j, or janusgraph.
Cline support: .clinerules template, Cline detection in setup command (cross-platform
globalStorage path resolution), and Cline section in AGENT_SETUP.md with marketplace install
instructions via llms-install.md.
docs/setup/ — per-tool setup guides (Claude Code, Cursor & Cline, Codex CLI, Gemini CLI,
Windsurf / VS Code / Claude Desktop / Continue.dev, Docker + Neo4j / JanusGraph). Referenced
from README.md Quick Start.

Changed — codesteward-mcp

GRAPH_BACKEND default changed from neo4j to auto — auto-detects the appropriate
backend at startup: Neo4j if NEO4J_PASSWORD is set, JanusGraph if JANUSGRAPH_URL is
non-default, otherwise GraphQLite. Existing deployments with explicit env vars are unaffected.
Tool response fields renamed: neo4j_connected → backend_connected; new
graph_backend field in graph_rebuild and graph_status responses.
_make_async_driver() replaced by _make_backend() — returns a GraphBackend instance
(or None for stub mode) instead of a raw Neo4j driver.
GraphBuilder now accepts a backend parameter (the GraphBackend instance) instead of
neo4j_driver.
Cypher query templates moved from inline constants in tools/graph.py into each backend's
query_named() implementation.
Server instructions updated to describe all three backends and the gremlin query type.
README.md Quick Start rewritten: leads with uvx codesteward-mcp setup for zero-config
global setup; manual setup simplified with GraphQLite as default.
llms-install.md rewritten for GraphQLite default and Cline compatibility.
All uvx args in templates and docs fixed to use the --from pattern
(uvx --from "codesteward-mcp[graph-all,graphqlite]" codesteward-mcp) — the previous
pattern failed on macOS where uvx cannot parse extras as a command name.
Global setup templates (templates/global-claude-code/, templates/global-codex/) updated
to use GraphQLite as default backend.
License changed from BSD 3-Clause to Apache 2.0.

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.4.0

Full setup guide: AGENT_SETUP.md

Assets 6

20 Mar 17:59

github-actions

v0.3.0

1512a96

v0.3.0

Added — codesteward-graph

Taint-source node and edge emission across all 12 parsers, enabling L1 taint analysis by the
codesteward-taint binary without requiring a separate source-annotation pass:
- Python — Flask/Django/FastAPI request.*, WSGI environ, Starlette Request
- TypeScript/JavaScript — Express req.body/req.query/req.params/req.headers/req.cookies;
  NestJS parameter decorators (@Body, @Param, @Query, @Headers, etc.)
- Java — Spring MVC @RequestParam, @PathVariable, @RequestBody, @RequestHeader,
  @CookieValue; Jakarta EE @QueryParam, @PathParam, @FormParam, @HeaderParam
- Go — net/http r.URL.Query(), r.FormValue(), r.Header.Get(), r.Body;
  Gin c.Query(), c.Param(), c.PostForm(), c.GetHeader()
- Rust — Actix-web/Axum typed extractors: web::Path<T>, web::Query<T>, web::Json<T>,
  web::Form<T>, web::Bytes, web::Multipart, extract::Path, extract::Json, etc.
- PHP — superglobals ($_GET, $_POST, $_REQUEST, $_FILES, $_COOKIE, $_SERVER);
  Laravel $request->input()/query()/file()/etc.; Symfony property bags ($request->query,
  $request->headers, …); PSR-7 getQueryParams()/getParsedBody()/etc.;
  CodeIgniter4 getGet()/getPost()/getJSON()/etc.
- C# — ASP.NET Core parameter attributes ([FromQuery], [FromRoute], [FromBody],
  [FromForm], [FromHeader]); HttpRequest property access (Request.Query,
  Request.Form, Request.Headers, Request.Cookies)
- Kotlin — Spring Boot @RequestParam, @PathVariable, @RequestBody, @RequestHeader,
  @CookieValue; Ktor call.receive*(), call.parameters, call.request.queryParameters;
  Http4k request.query(), request.path(), request.bodyString()
- Scala — Play Framework request.body.*, request.queryString, request.headers;
  Akka HTTP directives (parameters, entity, formField, headerValueByName, cookie, path)
- C — CGI getenv() for HTTP env vars (QUERY_STRING, HTTP_COOKIE, etc.), stdin reads
  (fread/fgets/read); Mongoose mg_http_get_var/mg_http_get_header;
  libmicrohttpd MHD_lookup_connection_value
- C++ — all C patterns reused; Crow req.body/req.url_params/req.headers;
  Drogon req->getBody()/req->getParameter()/req->getHeader()/req->getCookie();
  Pistache request.query()/request.resource(); Oat++ getPathVariable()/getQueryParameter()
- COBOL — no applicable web taint patterns; no change
tests/test_engine/test_taint_sources.py — new test module with 50+ tests covering taint-source
detection for C, C++, C#, Rust, PHP, Kotlin, Scala, and NestJS (TypeScript)

Added — codesteward-mcp

taint_analysis MCP tool: invokes the codesteward-taint Go binary as an async subprocess
and returns YAML with unsafe/sanitized path counts and a findings list. The tool is registered
only when the binary is present on PATH (shutil.which); the server starts normally without it.
TAINT_FLOW edges are now writable via graph_augment (added taint_flow to
_ALLOWED_EDGE_TYPES).
Docker image: new taint-fetcher build stage bundles the codesteward-taint binary by
default (latest GitHub Release). Pin with --build-arg TAINT_VERSION=<version> or omit
entirely with --build-arg TAINT_VERSION=none.

Changed — codesteward-mcp

codebase_graph_query semantic template updated from DATA_FLOW to TAINT_FLOW: results
now return source_name, source_file, sink_name, sink_file, cwe, hops, level,
framework instead of function_name, file, line, flow_description. Returns empty
until taint_analysis has been run.

Removed — codesteward-graph

DATA_FLOW edges are no longer emitted by any parser. Use TAINT_FLOW edges written by the
codesteward-taint binary for data-flow analysis.
_extract_semantic_edges() removed from TreeSitterBase (and all callers in python.py,
typescript.py, java.py).

Assets 6

15 Mar 23:42

github-actions

v0.2.2

1280b9b

v0.2.2

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.2.2

Full setup guide: AGENT_SETUP.md

Full Changelog: v0.2.1...v0.2.2

Assets 6

15 Mar 22:54

github-actions

v0.2.1

9ac9c07

v0.2.1

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.2.1

Full setup guide: AGENT_SETUP.md

Full Changelog: v0.2.0...v0.2.1

Assets 6

15 Mar 22:33

github-actions

v0.2.0

ccbe4e9

v0.2.0

Docker image

docker pull ghcr.io/bitkaio/codesteward:v0.2.0

Full setup guide: AGENT_SETUP.md

Full Changelog: v0.1.0...v0.2.0

Assets 6

Releases: Codesteward/codesteward

v0.5.2

Fixed — codesteward-graph

Improved — codesteward-graph

Container image

Uh oh!

v0.5.1

Fixed — codesteward-mcp

Container image

Uh oh!

v0.5.0

Added — codesteward-graph

Fixed — codesteward-graph

Security / CI

Container image

Uh oh!

v0.4.2

Fixed — codesteward-graph

Fixed — codesteward-mcp

Changed — Known issues — GraphQLite backend

Docker image

Uh oh!

v0.4.1

Fixed — codesteward-graph

Known issues — GraphQLite backend

Docker image

Uh oh!

v0.4.0

Added — codesteward-graph

Added — codesteward-mcp

Changed — codesteward-mcp

Docker image

Uh oh!

v0.3.0

Added — codesteward-graph

Added — codesteward-mcp

Changed — codesteward-mcp

Removed — codesteward-graph

Uh oh!

v0.2.2

Docker image

Uh oh!

v0.2.1

Docker image

Uh oh!

v0.2.0

Docker image

Uh oh!