Skip to content

feat(cdn-logs-report): add daily agentic traffic mapping for allowlisted site#1978

Draft
akshaymagapu wants to merge 3 commits intomainfrom
cdn-logs-report-db
Draft

feat(cdn-logs-report): add daily agentic traffic mapping for allowlisted site#1978
akshaymagapu wants to merge 3 commits intomainfrom
cdn-logs-report-db

Conversation

@akshaymagapu
Copy link
Contributor

Summary

Add a site-specific daily agentic_traffic export path for cdn-logs-report while keeping existing weekly report generation unchanged.

This implements daily processing only for allowlisted site ID:

  • 9ae8877a-bbf3-407d-9adb-d6a72ce3c5e3

What Changed

  • Added daily SQL query template:

    • src/cdn-logs-report/sql/agentic-traffic-daily-report.sql
    • Uses full URL granularity (no <10 hits => Other bucketing)
    • Simplified to a single aggregation layer
  • Added daily query builder support:

    • src/cdn-logs-report/utils/query-builder.js
    • createAgenticDailyReportQuery() now filters exact single day (year/month/day)
  • Added mapper for target agentic_traffic payload:

    • src/cdn-logs-report/utils/agentic-traffic-mapper.js
    • Maps Athena rows to schema-like objects
    • Includes citability enrichment in dimensions
    • Simplified content_type classification to XML | PDF | HTML | TXT | OTHER
  • Added shared provider inference helper:

    • src/common/user-agent-classification.js
    • inferProviderFromUserAgent() reused by mapper for platform
  • Extended handler with daily branch:

    • src/cdn-logs-report/handler.js
    • Runs daily export only for allowlisted site ID
    • Keeps weekly flow unchanged
    • Includes TODO delivery placeholders for S3 and SQS
    • Daily branch failures are isolated (do not fail weekly reporting)

Tests

  • Added new mapper tests:

    • test/audits/cdn-logs-report/agentic-traffic-mapper.test.js
  • Updated existing tests:

    • test/audits/cdn-logs-report/query-builder.test.js
    • test/audits/cdn-logs-report/handler.test.js
  • Verified locally (targeted):

    • npx mocha -i -g 'Post-Deploy' --spec=test/audits/cdn-logs-report/agentic-traffic-mapper.test.js --spec=test/audits/cdn-logs-report/query-builder.test.js --spec=test/audits/cdn-logs-report/handler.test.js
    • Result: all passing

Follow-ups (TODO)

  • Implement actual S3 upload for mapped daily payload
  • Implement SQS message dispatch with S3 reference/payload metadata
  • Fill category_id once category API is available

@github-actions
Copy link

This PR will trigger a minor release when merged.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant