Skip to content

Add Spam Confidence Score to Detections #14881

@ssejal1508

Description

@ssejal1508

Problem / Gap
Currently, SmokeDetector flags posts as spam/non-spam in a binary manner. Moderators and metasmoke reviewers sometimes have to manually assess how “strong” a detection is (e.g., was it caught by 1 weak rule or 5 strong ones?).

Proposed Solution

  • Introduce a confidence score (0–100) alongside spam detections.

  • Score can be derived from:

    • Number of detection rules triggered
    • Strength/weight of the rules (e.g., hard blacklist = high weight, fuzzy match = low weight)
    • Presence of multiple spam signals (links, repetition, Unicode obfuscation, etc.)
  • Display this score in chat reports and in the metasmoke interface.

Benefits

  • Helps moderators quickly decide which spam needs urgent review.
  • Reduces noise by highlighting high-confidence spam.
  • Provides a foundation for future automation (e.g., auto-deletion if score > 90).

Difficulty: Medium
Labels: enhancement, feature-request

Checklist:

  • Define rule weights and scoring system
  • Implement score calculation in detection pipeline
  • Display score in chat reports
  • Add support in metasmoke for showing scores
  • Document scoring system in README/wiki

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions