Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(low-code cdk): add streams limit to full resolve manifest command #455

Merged

Conversation

lazebnyi
Copy link
Contributor

@lazebnyi lazebnyi commented Apr 1, 2025

Details

Fixed: https://github.com/airbytehq/airbyte-internal-issues/issues/12275

Summary by CodeRabbit

  • New Features

    • Introduced a configurable limit for dynamic stream generation (default value: 100), ensuring that only the allowed number of streams is generated during processing.
  • Refactor

    • Updated the stream resolution logic to produce a cleaner manifest output by removing superfluous entries and aligning the stream handling with the new limit parameter.
    • Renamed the TestReadLimits class to TestLimits for consistency across the codebase.

@github-actions github-actions bot added the enhancement New feature or request label Apr 1, 2025
@lazebnyi
Copy link
Contributor Author

lazebnyi commented Apr 1, 2025

/autofix

Auto-Fix Job Info

This job attempts to auto-fix any linting or formating issues. If any fixes are made,
those changes will be automatically committed and pushed back to the PR.

Note: This job can only be run by maintainers. On PRs from forks, this command requires
that the PR author has enabled the Allow edits from maintainers option.

PR auto-fix job started... Check job output.

✅ Changes applied successfully.

Copy link
Contributor

coderabbitai bot commented Apr 1, 2025

📝 Walkthrough

Walkthrough

The pull request renames the TestReadLimits class to TestLimits and introduces a new attribute max_streams with a default value of 100. Function signatures in connector_builder_handler.py (such as get_limits, create_source, read_stream, and full_resolve_manifest) now accept the updated type and enforce stream limits accordingly. The main file updates import statements and function parameters to accommodate TestLimits, while unit tests adjust object instantiation and manifest configurations by removing unnecessary entries and including the new limits setting.

Changes

File(s) Change Summary
airbyte_cdk/connector_builder/connector_builder_handler.py Renamed TestReadLimits to TestLimits and added a new attribute max_streams (default=100). Updated functions (get_limits, create_source, read_stream, full_resolve_manifest) to use TestLimits and apply stream limits.
airbyte_cdk/connector_builder/main.py Updated import and function signature in handle_connector_builder_request to use TestLimits instead of TestReadLimits. Modified the call to full_resolve_manifest to pass the new limits parameter.
unit_tests/connector_builder/test_connector_builder_handler.py Replaced references to TestReadLimits with TestLimits. Updated tests to initialize using TestLimits, added __test_read_config: {"max_streams": 2} to configurations, and removed extraneous dynamic stream entries.

Sequence Diagram(s)

sequenceDiagram
    participant C as ConnectorBuilderHandler
    participant L as TestLimits
    participant S as Source
    participant M as ManifestResolver

    C->>L: get_limits(config)
    L-->>C: return TestLimits(max_streams)
    C->>S: create_source(config, limits)
    S->>S: read_stream(..., limits)
    C->>M: full_resolve_manifest(source, limits)
    M->>L: Validate stream count using max_streams
    M-->>C: return AirbyteMessage with resolved manifest
Loading

Possibly related PRs

Suggested reviewers

  • maxi297
  • artem1205

How does this look to you? Any adjustments you'd like to make?


📜 Recent review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 90ca21f and 6a96feb.

📒 Files selected for processing (1)
  • airbyte_cdk/connector_builder/connector_builder_handler.py (4 hunks)
🚧 Files skipped from review as they are similar to previous changes (1)
  • airbyte_cdk/connector_builder/connector_builder_handler.py
⏰ Context from checks skipped due to timeout of 90000ms (9)
  • GitHub Check: Check: 'source-pokeapi' (skip=false)
  • GitHub Check: Check: 'source-amplitude' (skip=false)
  • GitHub Check: Check: 'source-shopify' (skip=false)
  • GitHub Check: Check: 'source-hardcoded-records' (skip=false)
  • GitHub Check: SDM Docker Image Build
  • GitHub Check: Pytest (All, Python 3.11, Ubuntu)
  • GitHub Check: Pytest (Fast)
  • GitHub Check: Pytest (All, Python 3.10, Ubuntu)
  • GitHub Check: Analyze (python)

🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

‼️ IMPORTANT
Auto-reply has been disabled for this repository in the CodeRabbit settings. The CodeRabbit bot will not respond to your replies unless it is explicitly tagged.

  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai plan to trigger planning for file edits and PR creation.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 1

🔭 Outside diff range comments (1)
airbyte_cdk/connector_builder/connector_builder_handler.py (1)

1-162: ⚠️ Potential issue

Fix formatting issue flagged by the pipeline

The pipeline failure indicates a formatting issue with this file that needs to be fixed.

According to the error message, you need to run 'poetry run ruff format' to fix the code style issues in this file.

🧰 Tools
🪛 GitHub Actions: Linters

[error] 1-1: Ruff formatting check failed. 1 file would be reformatted. Please run 'poetry run ruff format' to fix code style issues in this file.

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between b9f84d5 and 7963144.

📒 Files selected for processing (3)
  • airbyte_cdk/connector_builder/connector_builder_handler.py (4 hunks)
  • airbyte_cdk/connector_builder/main.py (3 hunks)
  • unit_tests/connector_builder/test_connector_builder_handler.py (14 hunks)
🧰 Additional context used
🧬 Code Definitions (2)
airbyte_cdk/connector_builder/connector_builder_handler.py (4)
airbyte_cdk/sources/declarative/manifest_declarative_source.py (4)
  • ManifestDeclarativeSource (60-436)
  • resolved_manifest (113-114)
  • streams (144-169)
  • dynamic_streams (121-124)
unit_tests/connector_builder/test_connector_builder_handler.py (2)
  • resolved_manifest (678-679)
  • streams (810-811)
airbyte_cdk/sources/declarative/concurrent_declarative_source.py (1)
  • streams (179-188)
unit_tests/sources/mock_server_tests/mock_source_fixture.py (1)
  • streams (406-413)
unit_tests/connector_builder/test_connector_builder_handler.py (1)
airbyte_cdk/connector_builder/connector_builder_handler.py (1)
  • TestLimits (39-43)
🪛 GitHub Actions: Linters
airbyte_cdk/connector_builder/connector_builder_handler.py

[error] 1-1: Ruff formatting check failed. 1 file would be reformatted. Please run 'poetry run ruff format' to fix code style issues in this file.

⏰ Context from checks skipped due to timeout of 90000ms (9)
  • GitHub Check: Check: 'source-pokeapi' (skip=false)
  • GitHub Check: Check: 'source-amplitude' (skip=false)
  • GitHub Check: Check: 'source-shopify' (skip=false)
  • GitHub Check: Check: 'source-hardcoded-records' (skip=false)
  • GitHub Check: Pytest (All, Python 3.11, Ubuntu)
  • GitHub Check: Pytest (All, Python 3.10, Ubuntu)
  • GitHub Check: SDM Docker Image Build
  • GitHub Check: Pytest (Fast)
  • GitHub Check: Analyze (python)
🔇 Additional comments (12)
airbyte_cdk/connector_builder/main.py (3)

13-13: Updated import to use TestLimits instead of TestReadLimits

The import has been updated to use the new TestLimits class instead of the previous TestReadLimits. This aligns with the renaming of the class in the connector_builder_handler.py file.


76-76: Updated parameter type to TestLimits

The parameter type in the function signature has been updated to use TestLimits, which is consistent with the renamed class.


86-86: Added limits parameter to full_resolve_manifest call

The function call has been updated to pass the limits parameter to the full_resolve_manifest function, which enables the stream limit functionality.

unit_tests/connector_builder/test_connector_builder_handler.py (4)

23-23: Updated import to use TestLimits instead of TestReadLimits

The import statement has been updated to reference the renamed class.


387-387: Added max_streams configuration to test config

The test configuration now includes the max_streams parameter with a value of 2, which is used to test the new stream limiting functionality.


528-528: Updated all instances of TestReadLimits to TestLimits

All object instantiations have been updated to use the renamed class. This ensures that the tests properly exercise the new functionality.

Also applies to: 732-732, 793-793, 829-829, 869-869, 986-986, 1068-1068, 1115-1115, 1199-1199, 1285-1285, 1343-1343


1371-1371: Using TestLimits with max_streams in full_resolve_manifest test

The test for full_resolve_manifest now explicitly sets max_streams to 2, ensuring that the new stream limiting functionality is tested properly.

airbyte_cdk/connector_builder/connector_builder_handler.py (5)

7-7: Added Dict to imports

Added Dict to the imports to properly type-hint the mapped_streams dictionary in the full_resolve_manifest function.


30-30: Added constants for maximum streams

Added constants to define the default maximum number of streams and the corresponding configuration key. This provides consistent values throughout the codebase.

Also applies to: 35-35


39-43: Renamed class to TestLimits and added max_streams field

The TestReadLimits class has been renamed to TestLimits and now includes a max_streams field with a default value of 100. This is a core part of the PR changes.


57-57: Updated function signatures to use TestLimits

The function signatures for create_source and read_stream have been updated to use the TestLimits type, which is consistent with the class rename.

Also applies to: 78-78


124-142: Implemented stream limiting in full_resolve_manifest

The full_resolve_manifest function now includes logic to limit the number of streams generated for each dynamic stream according to the max_streams setting. This is the core functionality being added in this PR.

The implementation:

  1. Creates a dictionary to track generated streams by dynamic stream name
  2. Restricts the number of generated streams to the configured limit
  3. Extends the streams list with the limited number of generated streams

@lazebnyi lazebnyi requested a review from maxi297 April 1, 2025 12:54
Copy link
Contributor

@maxi297 maxi297 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Just one question but if this is what we want to achieve, I'm approving this PR

@lazebnyi lazebnyi merged commit 81fd920 into main Apr 1, 2025
26 checks passed
@lazebnyi lazebnyi deleted the lazebnyi/add-limit-streams-to-full-resolve-manifest-command branch April 1, 2025 14:16
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants