Skip to content

[Feature/Addition] Adding Avro support to Google BQ/GCS Connectors #1370

@matthewkrausse

Description

@matthewkrausse

Wanted to start a discussion here...

According to Big Query documentation, Avro is the preferred format for loading into Big Query from GCS. It seems like we could see improvements to load times using this format instead of .csv files.

A toavro() method exists in the PETL library so we can just expose that as well.

Looking at the code, it seems like we could probably add a flag in the bq.copy() method as to the preferred file format as to keep this from being a breaking change. Want to hear from folks if there are any thoughts and if it is a feasible project?

Metadata

Metadata

Labels

enhancementImpact - something should be added to or changed about Parsons that isn't causing a current breakage

Type

No type
No fields configured for issues without a type.

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions