I am using AWS Glue for processing data coming from Kinesis.
We currently use the library for executing jobs that, in other words, have an end where the library deletes the shuffles, but jobs such as streaming have no end and the library does not proceed to clean up these shuffle files that have come to occupy 1TB of S3 storage.
Is there any way to force the shuffle files cleaning in a point in the script or automatically clean the shuffles for each batch?