Skip to content

Conversation

@munna-shaik-s1
Copy link
Collaborator

Jira Link: https://sentinelone.atlassian.net/browse/MAPPS-4702

🥅 Goal

Need to fix the data loss which is happening on the Dataset query input.

Issue

After performing some debugging on the application in my Splunk box, I discovered that data loss was occurring due to improper handling of the checkpoint, start time, and end time in the app. In the previous application code, there was a validation checking if the splunk_dt was greater than the checkpoint_date. However, some events had the same timestamp, and since the previous design treated the timestamp as the primary key, this caused data loss when events shared the same timestamp.

🛠️ Solution

To address the issue and prevent data loss, I updated the checkpointing logic. The application now performs proper validations and checkpointing, ensuring that events with the same timestamp are handled correctly without causing data loss.

🏫 Testing

Tested the changes on my Splunk box by validating the payload being sent with the API call to ensure accuracy and proper handling. Additionally, I verified that the event count matches correctly.
image
image
image
image

Copy link

@mcgrail-s1 mcgrail-s1 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@munna-shaik-s1 munna-shaik-s1 merged commit 88002ca into main Feb 12, 2025
3 checks passed
@munna-shaik-s1 munna-shaik-s1 deleted the MAPPS-4702-Dataset_query_fix branch February 12, 2025 11:38
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants