Skip to content
This repository was archived by the owner on Sep 8, 2025. It is now read-only.

Conversation

@ambrussimon
Copy link
Collaborator

Extend features to also cover TUM needs in a generic way, and update with current gear best practices.

Breaks

  • input and config items changed

New features

  • encoding configurable (auto-detect by default)
  • delimiter configurable (auto-detect by default)
  • match column can be auto-detected if missing (first header re.seach(r'id|subject'))
  • circle config (someone please either advise me on further steps or enable it for the repo)

@ambrussimon ambrussimon requested review from ehlertjd and hkethi002 May 23, 2019 13:58
Copy link

@hkethi002 hkethi002 left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

It may be better to have an explicit message in the job log that the input is of an unsupported file type if they try a non csv file.

# Read CSV bytes
filepath = context.get_input_path('subject_csv')
filename = os.path.basename(filepath)
with open(filepath, mode='rb') as subject_csv:

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Do we want to have a check here to make sure the document extension is a csv?

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants