Skip to content

Commit 86d8f2c

Browse files
authored
Add medium, large job types to GX job (#154)
* upgrade to v2 of ecr login action * Revert "upgrade to v2 of ecr login action" This reverts commit bcfef0e. * Add medium, large job types to GX job
1 parent 8901de2 commit 86d8f2c

File tree

1 file changed

+45
-3
lines changed

1 file changed

+45
-3
lines changed

templates/glue-job-run-great-expectations-on-parquet.j2

Lines changed: 45 additions & 3 deletions
Original file line numberDiff line numberDiff line change
@@ -43,16 +43,45 @@ Parameters:
4343
DefaultWorkerType:
4444
Type: String
4545
Description: >-
46-
Which worker type to use for this job.
46+
Which worker type to use for most data types
4747
Default: 'Standard'
4848

49+
MediumJobWorkerType:
50+
Type: String
51+
Description: >-
52+
Which worker type to use for this job.
53+
Medium data types include: HealthKitV2Samples, HealthKitV2Electrocardiogram,
54+
FitbitDailyData, FitbitSleepLogs
55+
Default: 'G.4X'
56+
57+
LargeJobWorkerType:
58+
Type: String
59+
Description: >-
60+
Which worker type to use for this job.
61+
Large data types include: FitbitIntradayCombined
62+
Default: 'G.8X'
63+
4964
DefaultNumberOfWorkers:
5065
Type: Number
5166
Description: >-
52-
How many DPUs to allot to this job. This parameter is not used for types
53-
FitbitIntradayCombined and HealthKitV2Samples.
67+
How many DPUs to allot for most data types.
5468
Default: 1
5569

70+
MediumJobNumberOfWorkers:
71+
Type: Number
72+
Description: >-
73+
How many DPUs to allot to this job.
74+
Medium data types include: HealthKitV2Samples, HealthKitV2Electrocardiogram,
75+
FitbitDailyData, FitbitSleepLogs
76+
Default: 4
77+
78+
LargeJobNumberOfWorkers:
79+
Type: Number
80+
Description: >-
81+
How many DPUs to allot to this job.
82+
Large data types include: FitbitIntradayCombined
83+
Default: 8
84+
5685
ExpectationSuiteKey:
5786
Type: String
5887
Description: The S3 key of the GX expectation file.
@@ -115,8 +144,21 @@ Resources:
115144
GlueVersion: !Ref GlueVersion
116145
MaxRetries: !Ref MaxRetries
117146
Name: !Sub "${Namespace}-{{ dataset["stackname_prefix"] }}-GreatExpectationsParquetJob"
147+
{% if dataset["type"] == "FitbitIntradayCombined" -%}
148+
WorkerType: !Ref LargeJobWorkerType
149+
NumberOfWorkers: !Ref LargeJobNumberOfWorkers
150+
{% elif (
151+
dataset["type"] == "HealthKitV2Samples"
152+
or dataset["type"] == "HealthKitV2Electrocardiogram"
153+
or dataset["type"] == "FitbitDailyData"
154+
or dataset["type"] == "FitbitSleepLogs"
155+
) -%}
156+
WorkerType: !Ref MediumJobWorkerType
157+
NumberOfWorkers: !Ref MediumJobNumberOfWorkers
158+
{% else -%}
118159
WorkerType: !Ref DefaultWorkerType
119160
NumberOfWorkers: !Ref DefaultNumberOfWorkers
161+
{%- endif %}
120162
Role: !Ref JobRole
121163
Timeout: !Ref TimeoutInMinutes
122164
{% endfor %}

0 commit comments

Comments
 (0)