-
Notifications
You must be signed in to change notification settings - Fork 4.5k
feat(source-mixpanel): update to latest cdk, set up concurrency #55189
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. Weβll occasionally send you account related emails.
Already on GitHub? Sign in to your account
feat(source-mixpanel): update to latest cdk, set up concurrency #55189
Conversation
The latest updates on your projects. Learn more about Vercel for Git βοΈ
|
/format-fix
|
Regression tests:test_catalog_are_the_same [failed] Record count TestDataIntegrity.test_record_schema_match_with_state |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
If I misunderstood the solution, please ping me. Else, I'm good with this change! Thanks for doing the last mile on this connector β€οΈ
|
||
request_params["from_date"] = from_date_value.format("YYYY-MM-DD") | ||
request_params["to_date"] = to_date_value.format("YYYY-MM-DD") | ||
request_params["where"] = f'properties["$time"]>=datetime({time_value})' |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Just to make sure I understand the current solution: We don't have slices of 30 days anymore. This is fine because it was slow and the very low rate limiting anyway, we can't be concurrent and there is risk is doing one query per 30 days instead of one query per page
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
yes, we don't have slices of 30 days, previous query with slices and initial state can't be implemented using concurrent cdk. now we have one slice, it should be faster (we don't need to wait 1 min between requests) and uses less requests(better rate limiting).
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
defined export as StateDelegatingStream
, so for the full refresh read we have 30 days slices, which is more safe for the first read.
/format-fix |
/format-fix
|
/format-fix
|
What
resolved: https://github.com/airbytehq/airbyte-internal-issues/issues/11795
How
Review guide
User Impact
Can this PR be safely reverted and rolled back?