Skip to content
Discussion options

You must be logged in to vote

Limits on history

Although in theory it would be possible to extract the data for all historical seasons (as long as it is accessible in Transfermakt), the scraper scope is limited to a few recent seasons (see #12). The two main reasons are

  • Scraping a full new season is a significant effort, particularly the data validation step. Historical data tends to be less accurate, and it's harder to keep the dataset consistent as we add in older seasons
  • Most of the value of the data tends to be on the most recent seasons anyways, so it's also more "economic" to focus on those

Also, although less importantly, data.world does not support datasets larger than 100MB in its free tier, and with the cu…

Replies: 1 comment

Comment options

dcaribou
Jun 14, 2023
Maintainer Author

You must be logged in to vote
0 replies
Answer selected by dcaribou
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
FAQ
Labels
1 participant