Stats Output Guide

precursor --stats emits a run summary JSON object to stderr. This is designed for automation and dashboards while keeping payload records on stdout.

Quick Example

cat payloads.b64 \
  | precursor -p patterns/new -m base64 -t -d --similarity-mode lzjd --stats \
  1>/tmp/records.ndjson 2>/tmp/stats.json

Inspect:

jq '.' /tmp/stats.json

Top-Level Schema

---PRECURSOR_STATISTICS---: marker string.
Input: input volume and size metrics.
Match: pattern and hash generation metrics.
Compare: distance summary when enough pairwise comparisons exist.
Environment: run-time settings snapshot.

Field Notes

`Input`

Count: total payload candidates processed.
Unique: unique payloads by xxh3_64_sum.
AvgSize, MinSize, MaxSize, P95Size, TotalSize: size distribution.

`Match`

Patterns: number of compiled pattern expressions.
TotalMatches: total named-capture hits.
Matches: per-tag hit counts.
HashesGenerated: similarity hashes generated for matched payloads.
Size fields summarize only matched payloads.

`Compare`

Similarities, AvgDistance, MinDistance, MaxDistance, P95Distance.
May be null/empty when insufficient pairwise distances are available.
- Practical rule: provide at least 3 matched payloads to reliably populate this section.

`Environment`

Includes version and run-time selections:
- SimilarityMode
- RegexEngine
- InputMode
- HashFunction
- DistanceThreshold
- protocol inference options and Sigma count.

Compatibility Notes

Historical field names such as tlsh_similarities in record output remain for compatibility, even when running lzjd or fbhash.
HashFunction reflects TLSH algorithm selection argument and is retained for compatibility; non-TLSH modes still report the selected similarity mode explicitly via SimilarityMode.

Useful Queries

Total input and throughput:

jq '{count: .Input.Count, total: .Input.TotalSize, rate: .Environment.ProcessingRate}' /tmp/stats.json

Most frequent tags:

jq '.Match.Matches | sort_by(.Matches) | reverse | .[:10]' /tmp/stats.json

Distance snapshot:

jq '.Compare' /tmp/stats.json

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Stats Output Guide

Quick Example

Top-Level Schema

Field Notes

`Input`

`Match`

`Compare`

`Environment`

Compatibility Notes

Useful Queries

FilesExpand file tree

STATS.md

Latest commit

History

STATS.md

File metadata and controls

Stats Output Guide

Quick Example

Top-Level Schema

Field Notes

Input

Match

Compare

Environment

Compatibility Notes

Useful Queries

`Input`

`Match`

`Compare`

`Environment`