[FEATURE] Extend `make_job` to run `SparkPythonTask` #60

JCZuurmond · 2024-10-04T14:50:48Z

Changes

Extend make_job to run SparkPythonTask by:

Add make_workspace_file fixture to create the Python file to run.
Support SQL notebooks and files
Add task_type to make_job to signal what type of task to run (notebook or python file)
Add instance_pool_id to make_job for speeding up a job in integration tests by reusing an instance pool
Add unit tests for changed fixtures
Run jobs in integration tests

Tests

manually tested
added unit tests
added integration tests

JCZuurmond · 2024-10-04T14:51:13Z

Needs #58 to let CI pass

github-actions · 2024-10-04T14:51:34Z

This PR breaks backwards compatibility for databrickslabs/blueprint downstream. See build logs for more details.

_{Running from downstreams #61}

github-actions · 2024-10-04T14:51:44Z

This PR breaks backwards compatibility for databrickslabs/lsql downstream. See build logs for more details.

_{Running from downstreams #61}

github-actions · 2024-10-04T14:58:20Z

✅ 37/37 passed, 3 skipped, 8m38s total

_{Running from acceptance #102}

nfx

lgtm

* Documentation: fix `make_query()` parameter name ([#61](#61)). The `make_query()` fixture's documentation has been updated to correct the name of the `query` parameter to `sql_query`. The `sql_query` parameter is used to specify the SQL query stored in the fixture, with the default value being "SELECT \* FROM <newly created random table>". This change aims to enhance clarity and consistency in the naming of the argument, making it easier for users of the `make_query()` fixture to comprehend its purpose and usage. By correcting the parameter name, the documentation provides a clearer and more consistent user experience. * Removed references to UCX ([#56](#56)). This release includes changes to remove references to UCX in fixture names and descriptions within the testing process. The `create` function in `catalog.py` now appends a random string to "dummy_t", "dummy_s", or `dummy_c` for table, schema, and catalog names, respectively, instead of using "ucx_t", "ucx_S", and "ucx_C". The `test_catalog_fixture` function has also been updated to replace `dummy` with `dummy_c` and `dummy_s` for catalogs and schemas. Additionally, the description of a test query in `redash.py` has been updated to remove the reference to UCX. Lastly, fixture names in the unit tests for a catalog have been updated to use `dummy` instead of "ucx". These changes improve the independence of the testing process by removing technology-specific references, without affecting functionality. * Store watchdog tags in storage credentials comment ([#57](#57)). In this release, the watchdog's behavior has been modified to retain properly tagged credentials when deleting them, as previously all credentials were removed without discrimination. This change introduces tagging for preserving specific credentials, and the `watchdog_remove_after` fixture has been added to the README file for documentation. The `make_storage_credential` fixture has been updated to include a new parameter, `watchdog_remove_after`, which specifies the time at which the storage credential should be removed by the watchdog. The `create` function has been updated to accept this parameter and adds it as a comment to the storage credential. The `remove` function remains unmodified. The related fixtures section has been updated to include the new `watchdog_remove_after` fixture. This change was co-authored by Eric Vergnaud, but please note that it has not been tested yet. * [FEATURE] Extend `make_job` to run `SparkPythonTask` ([#60](#60)). The `make_job` fixture has been extended to support running `SparkPythonTask` in addition to notebook tasks. A new `make_workspace_file` fixture has been added to create and manage Python files in the workspace. The `make_job` fixture now supports SQL notebooks and files and includes a `task_type` parameter to specify the type of task to run and an `instance_pool_id` parameter to reuse an instance pool for faster job execution during integration tests. Additionally, unit and integration tests have been added to ensure the proper functioning of the new and modified fixtures. These changes allow for more flexible and efficient testing of Databricks jobs with different task types and configurations. The `make_notebook` fixture has also been updated to accept a `content` parameter for creating notebooks with custom content. The `Language` enum from the `databricks.sdk.service.workspace` module is used to specify the language of a notebook or workspace file.

…nt` workflow (#2815) ## Changes Scope resources to run the assessment workflow for in integration tests to shorten the time it takes for the assessment to run: <img width="1582" alt="Screenshot 2024-10-04 at 17 45 52" src="https://github.com/user-attachments/assets/8ee12e89-1068-45f4-a5de-c2da121a6101"> - Move the populate for linting logic to the context - Set the `include_job_ids` to the job ids created by the populate for linting logic - Set the `include_dashboard_ids` to the dashboard ids created by the populate for linting logic - Move the create job based on Python file fixture to [pytester](databrickslabs/pytester#60) ### Linked issues Resolves #2637 Resolves #2849 ### Tests - [x] modified integration tests: - `test_running_real_assessment_job` - `test_running_real_assessment_job_ext_hms` - `test_running_real_migration_progress_job`

## Changes This PR updates the minimum required version of `pytester` from 0.2.1 to 0.3.0. As of #2852 our integration tests depends on changes introduced in databrickslabs/pytester#60 (and released with 0.3.0).

JCZuurmond added 30 commits October 3, 2024 17:55

Clean make job create

9c3ff0f

Deprecate notebook path

f79444a

Add content option to make_job

32eab9a

Support Spark Python task in make_job

5d4cb35

Support Spark Python task in make_job

f6ac823

Raise error when both path and content are specified

5e6ab67

Clean up docs

29e7c4c

Remove redundant or

a219fdf

Make content exclusive with tasks

57824a4

Add encoding to make_notebook

7ca23ed

Add encoding to make_notebook

ac2011f

Fix make_notebook

08f93c6

Fix typo

dbc02e8

Shorten tags logic

c9e0d99

Handle mock in make_job for unit testing

0b9f0a5

Test job with name

978ff7d

Test make_job job has by default one task

2276570

Test setting path

512269d

Fix type hints

8767058

Mock WorkspaceClient

b6f7ad8

Test notebook wiuth path

357fb18

Format

0f95444

Test notebook with text content

4f01613

Use test user

c1bd16a

Let content also be bytes

4637f8a

Test bytes content

a7d02f5

Shorten testing logic

f64187e

Test default notebook outcome

bb5355e

Return mock uri

3f3d130

Test path to be exclusive of content encoding and language

06a0614

JCZuurmond added 3 commits October 4, 2024 16:14

Fix docs

e17a990

Test job with SparkPythonTask

6ec8c0d

Add instance pool id to make_job to speed up testing

eb166cf

JCZuurmond added the enhancement New feature or request label Oct 4, 2024

JCZuurmond self-assigned this Oct 4, 2024

JCZuurmond requested a review from nfx as a code owner October 4, 2024 14:50

JCZuurmond temporarily deployed to account-admin October 4, 2024 14:50 — with GitHub Actions Inactive

JCZuurmond mentioned this pull request Oct 4, 2024

Limit resources to assess in integration tests that run the assessment workflow databrickslabs/ucx#2815

Merged

1 task

nfx approved these changes Oct 7, 2024

View reviewed changes

Update README

7c2dd39

JCZuurmond mentioned this pull request Oct 7, 2024

Replace greenlit with unknown library in integration tests databrickslabs/ucx#2852

Merged

1 task

Merge branch 'main' into feat/add-file-option-to-make-job

9655153

JCZuurmond had a problem deploying to account-admin October 8, 2024 07:01 — with GitHub Actions Error

JCZuurmond added 3 commits October 8, 2024 09:02

Add missing test return types

340c018

Fix type hints

171e6f0

Ignore too many arguments

c513fd2

JCZuurmond temporarily deployed to account-admin October 8, 2024 07:11 — with GitHub Actions Inactive

JCZuurmond mentioned this pull request Oct 8, 2024

Update linter integration tests to ensure notebook source is linted databrickslabs/ucx#2853

Merged

1 task

nfx merged commit 3aa8265 into main Oct 8, 2024
7 of 9 checks passed

nfx deleted the feat/add-file-option-to-make-job branch October 8, 2024 08:45

nfx mentioned this pull request Oct 8, 2024

Release v0.3.0 #62

Merged

asnare mentioned this pull request Oct 8, 2024

Upgrade minimum required version of pytester: 0.2.1 → 0.3.0 databrickslabs/ucx#2863

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

[FEATURE] Extend `make_job` to run `SparkPythonTask` #60

[FEATURE] Extend `make_job` to run `SparkPythonTask` #60

Uh oh!

JCZuurmond commented Oct 4, 2024

Uh oh!

JCZuurmond commented Oct 4, 2024

Uh oh!

github-actions bot commented Oct 4, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 4, 2024 •

edited

Loading

Uh oh!

github-actions bot commented Oct 4, 2024 •

edited

Loading

Uh oh!

nfx left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FEATURE] Extend make_job to run SparkPythonTask #60

[FEATURE] Extend make_job to run SparkPythonTask #60

Uh oh!

Conversation

JCZuurmond commented Oct 4, 2024

Changes

Tests

Uh oh!

JCZuurmond commented Oct 4, 2024

Uh oh!

github-actions bot commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

github-actions bot commented Oct 4, 2024 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

nfx left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

[FEATURE] Extend `make_job` to run `SparkPythonTask` #60

[FEATURE] Extend `make_job` to run `SparkPythonTask` #60

github-actions bot commented Oct 4, 2024 •

edited

Loading

github-actions bot commented Oct 4, 2024 •

edited

Loading

github-actions bot commented Oct 4, 2024 •

edited

Loading