AI Product Analytics Event Schema Specification #6476
gitcommitshow
started this conversation in
General
Replies: 2 comments
-
|
Read this article for more details such as - implementation examples with code, a guide to protect user privacy using LLM-powered intent classification, and SQL queries & dashboard for an analytics system following this spec. |
Beta Was this translation helpful? Give feedback.
0 replies
-
|
Requesting your feedback on this specification. For a structured feedback discussion:
|
Beta Was this translation helpful? Give feedback.
0 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Uh oh!
There was an error while loading. Please reload this page.
Uh oh!
There was an error while loading. Please reload this page.
-
Version: 0.1.0-draft
Status: Request for Comments
Authors: Sumanth Puram
Created: October 2025
Comment Period: Oct 29, 2025 - Nov 29, 2025
Abstract
This specification defines a standardized event schema for tracking and analyzing AI-powered features including chatbots, assistants, and copilots. The schema provides three core events for capturing AI interactions and integration patterns for existing analytics infrastructure. This specification aims to establish industry-wide standards for AI product analytics that enable consistent measurement and cross-product comparison.
1. Introduction
1.1 Purpose
This specification standardizes the collection and structure of analytics events for AI-powered features to:
1.2 Scope
This specification covers:
This specification does NOT cover:
1.3 Terminology
2. Conformance
The key words "MUST", "MUST NOT", "REQUIRED", "SHALL", "SHALL NOT", "SHOULD", "SHOULD NOT", "RECOMMENDED", "MAY", and "OPTIONAL" in this document are to be interpreted as described in RFC 2119.
An implementation is conformant if it:
3. Core Event Schemas
3.1 Event:
ai_user_prompt_createdPurpose: Tracks when a user submits a prompt to an AI system.
Trigger: MUST be fired immediately after user submits their query.
3.1.1 Required Properties
conversation_id3.1.2 Core Properties
prompt_textinput_methodtext,voice,button3.2 Event:
ai_llm_response_receivedPurpose: Captures AI system responses and performance metrics.
Trigger: MUST be fired after receiving response from AI system.
3.2.1 Required Properties
conversation_idresponse_statussuccess,error,timeout3.2.2 Core Properties
response_textlatency_msmodel_usedcost3.2.3 Token Count Properties
token_counttoken_count.prompt_tokenstoken_count.completion_tokenstoken_count.total_tokens3.3 Event:
ai_user_actionPurpose: Tracks user interactions with AI responses.
Trigger: MUST be fired when user performs an action on an AI response.
3.3.1 Required Properties
conversation_idaction_type3.3.2 Action Types
feedback_givensharedreportedcopied_responseregenerated3.3.3 Action Details
action_detailsFor
action_type: feedback_given:action_details.feedback_typerating,thumbsaction_details.feedback_valueaction_details.feedback_text4. Conversation Management
4.1 Conversation ID Generation
conversation_idMUST be unique within the implementing system4.2 Conversation Lifecycle
4.3 Session vs Conversation
conversation_id5. Data Types and Formats
5.1 String Encoding
All string values MUST use UTF-8 encoding.
5.2 Numeric Values
6. Privacy Considerations
6.1 Personal Data
6.2 Data Retention
7. Backwards Compatibility
Future versions of this specification will:
8. Acknowledgments
This specification was developed with input from Sumanth Puram, Dileep, Pradeep, and multiple RudderStack customers (will be listing their names soon).
9. References
Appendix A: Change Log
Version 0.1.0-draft (Current)
Appendix B: Frequently Asked Questions
B.1 How does this relate to OpenTelemetry?
This specification focuses on business-level analytics events while OpenTelemetry handles operational telemetry. They are complementary and can be used together.
B.2 Can I extend the schema with custom properties?
Yes, implementations MAY add custom properties beyond the required and core fields. Custom properties SHOULD use a naming convention that avoids conflicts with future specification versions.
B.3 What about real-time streaming use cases?
The schema is designed to work with both batch and streaming systems. The same event structures can be published to streaming platforms like Kafka or Kinesis.
How to Provide Feedback
We welcome feedback on this specification. Please submit comments via:
When providing feedback, please reference the specific section number and include concrete suggestions for improvement.
Review Checklist
When reviewing this specification, please consider:
Disclaimer
This specification is provided "as is" without warranty of any kind.
Beta Was this translation helpful? Give feedback.
All reactions