Description
Hi,
I'm learning FinRL and i'm on the second notebook of the roadmap. I'm really enjoying it but my issue started with the data download of the 30 tickers.
At first there was a api rate limit coming from yfinance, guess Yahoo Finance detects when you're downloading a lot of data (from 2008 to 2025), the thing is i had to apply a change in that library, a ratelimiter applying 1 second. The issue? It's been executing for 18 hours and still going (probably will reach 60 hours, hopefully not) just for downloading the data. If i didn't apply that change to stop that, i would be getting a realy low number of rows in the df, like 3k or at best 30k, way less than nearly 100k in the tutorial.
Also had to apply a %%capture to the cell because it was lagging the tab browser a lot and ended up crashing a couple of times from all the printing progress bar for each day.
I'm having a blast but damn, am i doing something wrong? The tutorial seemed pretty straightforward and looks like there has been some changes there since the image shows way less printing than what i was getting.
A warning telling this notebook takes some time to execute or adding a ratelimiter as a parameter for the user would be great inside of FinRL, even though is not a direct issue, it can affect users.
Tell me if this is a bad approach, does the api rate limit make the data download go to a bad shape in the df or just pauses every x seconds? At this point i don't want to stop the download.