[Issue #185] Support mapping-based transformations #193

New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Open

widal001 wants to merge 12 commits into main from issue-185-mapping-based-transformations

Collaborator

widal001 commented May 19, 2025 •

edited

Loading

Summary

Adds support for mapping based transformations to the Python SDK.

Fixes [Py SDK] Support for mapping-based transformations #185
Time to review: 10 minutes

Changes proposed

What was added, updated, or removed in this PR.

Adds transform_from_mapping() function that transforms arbitrary JSON from one format to another using a mapping format that serves as a kind of lightweight DSL.
Adds a validate_with_mapping() to the CommonGrantsBaseModel to deserialize a dictionary after transforming it with a mapping.
Adds a dump_with_mapping() to CommonGrantsBaseModel to serialize and transform a model to a python dictionary.
Removes the extra push trigger from the ci-python-sdk.yaml to avoid duplicate runs of the CI checks on each PR.

Context for reviewers

Testing instructions, background context, more in-depth details of the implementation, and anything else you'd like to call out or ask reviewers. Explain how the changes were verified.

Checkout the PR
Change directory into python-sdk: cd lib/python-sdk/

Additional information

Screenshots, GIF demos, code examples or output to help show the changes working as expected.

widal001 added 2 commits

May 19, 2025 12:35


          feat(py-sdk): Adds a transform_from_mapping() func

bde6a8d


          ci(py-sdk): Adds type checking to VSCode settings

b456c6f

widal001 requested a review from DavidDudas-Intuitial

May 19, 2025 16:49

widal001 mentioned this pull request

[Issue 191] Transformer Scaffolding #192

Draft

widal001 added 5 commits

May 19, 2025 14:58


          refactor(py-sdk): Simplifies type hints

21b11c2


          feat(py-sdk): Adds dump and validate methods with mapping

e1d0152

Adds methods to serialize and deserialize schemas using a mapping config


          refactor(py-sdk): Fixes type hint for class method

6c473e1

Switches to using `Self` so that it will display the correct model in
intellisense type hints


          ci(py-sdk): Removes push trigger to avoid duplicate runs

c1d6029


          refactor(py-sdk): Moves test files to match package

2c15110

widal001 commented

View reviewed changes

.github/workflows/ci-python-sdk.yml

Comment on lines -4 to -11

-                push:
-                  paths:
-                    - 'lib/python-sdk/**'
-                    - '.github/workflows/ci-python-sdk.yml'
                 pull_request:
                   paths:
-                    - 'lib/python-sdk/**'
-                    - '.github/workflows/ci-python-sdk.yml'

Collaborator Author

widal001 May 19, 2025

Removes the push trigger because it was causing duplicate runs of the CI checks on each PR. Not a big deal, but HHS admins recently asked us to try to reduce GitHub action usage wherever we can.

widal001 commented

View reviewed changes

lib/python-sdk/.vscode/settings.json

Comment on lines +1 to +3

+              {
+                  "python.analysis.typeCheckingMode": "basic"
+              }

Collaborator Author

widal001 May 19, 2025

Enables Python type checking when using VSCode

widal001 commented

View reviewed changes

lib/python-sdk/common_grants_sdk/schemas/base.py

                   @classmethod
-                  def from_json(cls, json_str: str) -> "CommonGrantsBaseModel":
+                  def from_json(cls, json_str: str) -> Self:

Collaborator Author

widal001 May 19, 2025

This improves the intellisense output, so that the class method displays the name of the sub-class being instantiated (if it inherits from this base model) instead of always displaying "CommonGrantsBaseModel".

widal001 commented

View reviewed changes

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated

Comment on lines 82 to 86

+              DEFAULT_HANDLERS: dict[str, handle_func] = {
+                  "field": handle_field,
+                  "const": handle_const,
+                  "match": handle_match,
+              }

Collaborator Author

widal001 May 19, 2025

Right now we just define a base set of reserved words and their transformation functions, but this pattern makes it really easy to extend the mapping with additional key words.

widal001 commented

View reviewed changes

lib/python-sdk/common_grants_sdk/utils/transformation.py

+                  mapping: dict,
+                  depth: int = 0,
+                  max_depth: int = 500,
+                  handlers: dict[str, handle_func] = DEFAULT_HANDLERS,

Collaborator Author

widal001 May 19, 2025

Allows library users to provide their own custom handlers if they'd like to extend the base set.

widal001 marked this pull request as ready for review

May 20, 2025 16:54

DavidDudas-Intuitial reviewed

View reviewed changes

Collaborator

DavidDudas-Intuitial left a comment •

edited

Loading

Overall LGTM. I left a small number of suggestions and questions.

Consider adding a README to explain usage of transform_from_mapping().

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated Show resolved Hide resolved

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated Show resolved Hide resolved

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated Show resolved Hide resolved

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated Show resolved Hide resolved

lib/python-sdk/common_grants_sdk/utils/transformation.py

+                  Returns:
+                      A new dictionary containing the transformed data according to the mapping
+                  """
+                  if depth > max_depth:

Collaborator

DavidDudas-Intuitial May 20, 2025

What is the purpose of enforcing a max depth? I agree 500 is an absurd depth, but curious what the intent is. Fear of runaway recursion?

Collaborator Author

widal001 May 23, 2025 •

edited

Loading

I added a note explaining max depth here: docs(py-sdk): Add note explaining max_depth

It's mainly a check to avoid exceeding python's max recursion limit (1000) since this transformation function might be running on third-party (untrusted) mapping inputs.

In a future iteration I might consider refactoring this so we're using a loop instead of recursion and stress testing how depth is incremented.

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated Show resolved Hide resolved

lib/python-sdk/common_grants_sdk/utils/transformation.py Outdated

+                          for k in node:
+                              if k in handlers:
+                                  return handlers[k](data, node)
+                          return {k: transform_node(v, depth + 1) for k, v in node.items()}

Collaborator

DavidDudas-Intuitial May 20, 2025

There's a lot going on in these 4 lines! Maybe add a brief comment to explain what's happening.

Collaborator Author

widal001 May 23, 2025

Good call, I added some comments and example to show what happens at each step in this commit: refactor(py-sdk): Adds docs to transformation functions

widal001 added 5 commits

May 23, 2025 11:33


          refactor(py-sdk): Update handler functions

62dc251

- Uses more descriptive names for each handler function
- Changes how the known keys for handler functions are fetched


          refactor(py-sdk): Adds docs to transformation functions

3b179f2

- Adds an example to the docstrings
- Adds more code comments to the recursive transform_node() step


          refactor(py-sdk): Removes "const" reserved word

7e5ffa6

The transformation function already handles literal values so an extra
reserved word just adds confusion and makes the mapping more verbose


          docs(py-sdk): Adds example to README

1c634c4


          docs(py-sdk): Add note explaining max_depth

da5ac24

widal001 requested a review from DavidDudas-Intuitial

May 23, 2025 18:13

DavidDudas-Intuitial approved these changes

View reviewed changes

Collaborator

DavidDudas-Intuitial left a comment

Looks good. Nice work!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet