Skip to content

Create configuration for where current_elections_parquet is written to #22

@GeoWill

Description

@GeoWill

At the moment the final product of the current_elections stack is only defined as input to the outcodes lambda (

"dest_bucket_name": pollingstations_private_data.bucket_name,
"dest_path": f"addressbase/{self.dc_environment}/current_elections_parquet",
)

This means if we wanted another step which checked the data we have written (eg the single source of addressbase, or #21 ) then we have to copy paste the output. I have a feeling glue can't handle the level of partitioning involved, so I'm not sure it makes sense to have it as a GlueTable. So probably want to define a new model and create it based on whatever that is.

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions