Skip to content
Discussion options

You must be logged in to vote

You basically have two options

  1. Running the project locally: You can modify config and code as you wish and run the project in your machine.
    • 1_acquire.py will update raw data in data/raw in your local
    • 2_prepare.py will re-create prepared csv in data/prep
  2. Contribute you changes to the main project: Create a PR and have your changes merged to the main project. Acquiring and preparation scripts run on schedule every week and commit any new data to S3, data.world and Kaggle.
    • This the option I explained in my previous comment.

In general, I'd recommend option 2, because then you can benefit from the automation that updates the data weekly and simply consume the updated data from data.wo…

Replies: 2 comments 2 replies

Comment options

You must be logged in to vote
1 reply
@srdjov18
Comment options

Comment options

You must be logged in to vote
1 reply
@srdjov18
Comment options

Answer selected by srdjov18
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
2 participants