Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

feat(low-code cdk): add dynamic stream name to stream model #441

Closed

Conversation

lazebnyi
Copy link
Contributor

@lazebnyi lazebnyi commented Mar 21, 2025

Fixed: https://github.com/airbytehq/airbyte-internal-issues/issues/12171

Summary by CodeRabbit

  • New Features

    • Added dynamic stream naming support to enhance configuration flexibility and enable distinct stream identification.
    • Updated schema definitions to include new optional properties for assigning dynamic stream names.
  • Tests

    • Enhanced test coverage to verify proper dynamic stream name assignment and validate default behaviors.

@Copilot Copilot bot review requested due to automatic review settings March 21, 2025 23:26
@github-actions github-actions bot added the enhancement New feature or request label Mar 21, 2025
@lazebnyi
Copy link
Contributor Author

lazebnyi commented Mar 21, 2025

/autofix

Auto-Fix Job Info

This job attempts to auto-fix any linting or formating issues. If any fixes are made,
those changes will be automatically committed and pushed back to the PR.

Note: This job can only be run by maintainers. On PRs from forks, this command requires
that the PR author has enabled the Allow edits from maintainers option.

PR auto-fix job started... Check job output.

✅ Changes applied successfully.

Copy link
Contributor

coderabbitai bot commented Mar 21, 2025

📝 Walkthrough

Walkthrough

This pull request introduces new properties to support dynamic stream naming. Changes were made to the declarative schema definitions in both the YAML and Python model files by adding a string property (dynamic_stream_name in DeclarativeStream and name in DynamicDeclarativeStream). The manifest source file was enhanced to include a new key in its stream configurations, using an index-based default in dynamic streams via enumerate. Unit tests were updated to assert the presence and correctness of these dynamic stream names.

Changes

File(s) Change Summary
airbyte_cdk/sources/declarative/declarative_component_schema.yaml
airbyte_cdk/sources/declarative/models/declarative_component_schema.py
Added new properties for dynamic stream naming: dynamic_stream_name (in DeclarativeStream) and name (in DynamicDeclarativeStream) with default values and descriptive metadata.
airbyte_cdk/sources/declarative/manifest_declarative_source.py Updated stream configuration logic to include dynamic_stream_name; modified _dynamic_stream_configs to use enumerate for assigning index-based defaults.
unit_tests/sources/declarative/resolvers/test_http_components_resolver.py
unit_tests/sources/declarative/test_concurrent_declarative_source.py
Enhanced unit tests by adding assertions that check for correct dynamic stream names in the configurations.

Sequence Diagram(s)

sequenceDiagram
  participant DS as DeclarativeSource
  participant SC as Stream Configs
  participant DD as Dynamic Definitions

  DS->>SC: Invoke _stream_configs
  SC-->>DS: Return stream configs with dynamic_stream_name = None
  
  DS->>DD: Invoke _dynamic_stream_configs (using enumerate)
  Note right of DD: For each dynamic definition,<br>assign dynamic_stream_name (either provided or default: "dynamic_stream_i")
  DD-->>DS: Return updated dynamic stream configs
Loading

Possibly related PRs

Suggested reviewers

  • maxi297: Would you mind reviewing the dynamic stream naming changes and updated control flow in the manifest configuration? wdyt?
  • darynaishchenko: Could you please take a look at the new schema properties and their impact on stream processing? wdyt?
✨ Finishing Touches
  • 📝 Generate Docstrings

Thanks for using CodeRabbit! It's free for OSS, and your support helps us grow. If you like it, consider giving us a shout-out.

❤️ Share
🪧 Tips

Chat

There are 3 ways to chat with CodeRabbit:

‼️ IMPORTANT
Auto-reply has been disabled for this repository in the CodeRabbit settings. The CodeRabbit bot will not respond to your replies unless it is explicitly tagged.

  • Files and specific lines of code (under the "Files changed" tab): Tag @coderabbitai in a new review comment at the desired location with your query. Examples:
    • @coderabbitai generate unit testing code for this file.
    • @coderabbitai modularize this function.
  • PR comments: Tag @coderabbitai in a new PR comment to ask questions about the PR branch. For the best results, please provide a very specific query, as very limited context is provided in this mode. Examples:
    • @coderabbitai gather interesting stats about this repository and render them as a table. Additionally, render a pie chart showing the language distribution in the codebase.
    • @coderabbitai read src/utils.ts and generate unit testing code.
    • @coderabbitai read the files in the src/scheduler package and generate a class diagram using mermaid and a README in the markdown format.
    • @coderabbitai help me debug CodeRabbit configuration file.

Note: Be mindful of the bot's finite context window. It's strongly recommended to break down tasks such as reading entire modules into smaller chunks. For a focused discussion, use review comments to chat about specific files and their changes, instead of using the PR comments.

CodeRabbit Commands (Invoked using PR comments)

  • @coderabbitai pause to pause the reviews on a PR.
  • @coderabbitai resume to resume the paused reviews.
  • @coderabbitai review to trigger an incremental review. This is useful when automatic reviews are disabled for the repository.
  • @coderabbitai full review to do a full review from scratch and review all the files again.
  • @coderabbitai summary to regenerate the summary of the PR.
  • @coderabbitai generate docstrings to generate docstrings for this PR.
  • @coderabbitai resolve resolve all the CodeRabbit review comments.
  • @coderabbitai configuration to show the current CodeRabbit configuration for the repository.
  • @coderabbitai help to get help.

Other keywords and placeholders

  • Add @coderabbitai ignore anywhere in the PR description to prevent this PR from being reviewed.
  • Add @coderabbitai summary to generate the high-level summary at a specific location in the PR description.
  • Add @coderabbitai anywhere in the PR title to generate the title automatically.

CodeRabbit Configuration File (.coderabbit.yaml)

  • You can programmatically configure CodeRabbit by adding a .coderabbit.yaml file to the root of your repository.
  • Please see the configuration documentation for more information.
  • If your editor has YAML language server enabled, you can add the path at the top of this file to enable auto-completion and validation: # yaml-language-server: $schema=https://coderabbit.ai/integrations/schema.v2.json

Documentation and Community

  • Visit our Documentation for detailed information on how to use CodeRabbit.
  • Join our Discord Community to get help, request features, and share feedback.
  • Follow us on X/Twitter for updates and announcements.

Copy link
Contributor

@coderabbitai coderabbitai bot left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Actionable comments posted: 0

🧹 Nitpick comments (5)
airbyte_cdk/sources/declarative/declarative_component_schema.yaml (1)

1430-1437: Review Comment: New dynamic_stream_name Property in DeclarativeStream

You've added the dynamic_stream_name property with a string type, a default of None, and an example value of "Tables". This addition fits the PR objective of supporting dynamic stream naming nicely. Would you consider reviewing if a default of None (versus, say, an explicit empty string or YAML null) is consistent with how similar properties (like name in DynamicDeclarativeStream) are handled in our schema? wdyt?

airbyte_cdk/sources/declarative/models/declarative_component_schema.py (3)

2052-2057: Is "None" the right default as a string?

The default value for dynamic_stream_name is set as the string "None" rather than the Python None value. This might cause unexpected behavior if code elsewhere checks for this field being None using equality (is None or == None). Would changing this to actual None make more sense, wdyt?

dynamic_stream_name: Optional[str] = Field(
-    "None",
+    None,
    description="The dynamic stream name that create current stream, if None is static stream.",
    example=["Tables"],
    title="Dynamic Stream Name",
)

2052-2057: Consider improving the description grammar

The description has a minor grammatical issue. Would you mind reviewing it for clarity?

dynamic_stream_name: Optional[str] = Field(
    "None",
    description="The dynamic stream name that create current stream, if None is static stream.",
-    example=["Tables"],
+    examples=["Tables"],
    title="Dynamic Stream Name",
)

I noticed example is used instead of examples (which is used in other similar fields throughout the file). Also, the description could be clearer about what this field represents.


2498-2500: Consistency in examples vs example

For consistency with the rest of the file, should this be examples instead of example?

name: Optional[str] = Field(
    "", description="The dynamic stream name.", 
-    example=["Tables"], 
+    examples=["Tables"],
    title="Name"
)
airbyte_cdk/sources/declarative/manifest_declarative_source.py (1)

397-399: Consider consistent naming patterns

The fallback pattern dynamic_stream_{dynamic_definition_index} works, but would a more descriptive pattern be better? Perhaps including some identifier from the stream_template if available?

For example: {template_name}_{dynamic_definition_index} might give more context about what kind of stream this is. Just a thought for consideration!

📜 Review details

Configuration used: CodeRabbit UI
Review profile: CHILL
Plan: Pro

📥 Commits

Reviewing files that changed from the base of the PR and between 837913f and 5dd84b6.

📒 Files selected for processing (5)
  • airbyte_cdk/sources/declarative/declarative_component_schema.yaml (2 hunks)
  • airbyte_cdk/sources/declarative/manifest_declarative_source.py (3 hunks)
  • airbyte_cdk/sources/declarative/models/declarative_component_schema.py (2 hunks)
  • unit_tests/sources/declarative/resolvers/test_http_components_resolver.py (3 hunks)
  • unit_tests/sources/declarative/test_concurrent_declarative_source.py (1 hunks)
⏰ Context from checks skipped due to timeout of 90000ms (5)
  • GitHub Check: Check: 'source-amplitude' (skip=false)
  • GitHub Check: Check: 'source-shopify' (skip=false)
  • GitHub Check: Check: 'source-hardcoded-records' (skip=false)
  • GitHub Check: Pytest (All, Python 3.11, Ubuntu)
  • GitHub Check: Pytest (All, Python 3.10, Ubuntu)
🔇 Additional comments (8)
unit_tests/sources/declarative/test_concurrent_declarative_source.py (1)

712-714: The new assertion verifies that dynamic_stream_name is None by default

Good addition! This test ensures that every stream in the configuration has the dynamic_stream_name property initialized to None by default, which establishes a clear baseline behavior for the property.

unit_tests/sources/declarative/resolvers/test_http_components_resolver.py (3)

60-60: Nice addition of the name property for dynamic streams

Adding the name property to the DynamicDeclarativeStream configuration in the test manifest helps establish the expected behavior for dynamic stream naming. This is essential for testing the new functionality.


491-493: Good test coverage for the dynamic stream name

This assertion verifies that the dynamic stream name is correctly propagated from the manifest to the stream configuration. I like how you're validating that the value matches "TestDynamicStream" exactly as defined in the manifest.


579-581: Nice verification for index-based dynamic stream names

This assertion checks that dynamic stream names follow the expected naming convention when not explicitly provided in the manifest. The assertion for "dynamic_stream_0" suggests an index-based default naming strategy, which provides good backwards compatibility.

airbyte_cdk/sources/declarative/declarative_component_schema.yaml (1)

3776-3782: Review Comment: Added name Property in DynamicDeclarativeStream

The new name property for DynamicDeclarativeStream has been introduced with type string, a default of an empty string, and an example "Tables". This change aligns well with the dynamic stream naming feature outlined in the PR. Do you feel that the use of an empty string here (versus using a similar default as in DeclarativeStream) provides the clear distinction you intended between static and dynamic streams? wdyt?

airbyte_cdk/sources/declarative/manifest_declarative_source.py (3)

346-346: Good addition of dynamic_stream_name to static streams!

Setting dynamic_stream_name to None for static streams is a clean approach to differentiate them from dynamic streams. This will help with stream identification and make the codebase more maintainable.


358-358: Good use of enumerate for index tracking

Using enumerate() is a clean way to get both the index and value in Python. This change allows you to create default names for dynamic streams based on their position in the configuration.


397-399: Nice fallback mechanism for dynamic stream naming

I like how you've implemented a fallback mechanism that uses the explicit name if provided, or generates a default using the index. This ensures every dynamic stream has a unique identifier even if names aren't explicitly provided.

@lazebnyi
Copy link
Contributor Author

will be added in #442

@lazebnyi lazebnyi closed this Mar 26, 2025
@lazebnyi lazebnyi reopened this Apr 1, 2025
@lazebnyi lazebnyi closed this Apr 1, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

Successfully merging this pull request may close these issues.

1 participant