What happened:
Running the bag.ipynb example several times on the same machine eventually fails at the first Transform and Save code cell. The .to_textfiles() call causes a JSONDecodeError due to an unterminated string. The line/column varies, but the error is consistent.
What you expected to happen:
The example should run without error.
Minimal Complete Verifiable Example:
bag.ipynb example. I need to run it top-to-bottom 3-4 times before the error appears. The error persists after restarting the kernel, I suspect it's related to the buildup of files in the data directory.
Anything else we need to know?:
Environment:
- Dask version: 2022.1.1
- Python version: 3.9
- Operating System: Ubuntu
- Install method (conda, pip, source): Conda