Assess how we ended up with text file document records for the Comoros

I have two text files in my local database and my S3 storage. I suspect these records were created before the extraction workflow reached maturity, but maybe not? I should re-run the workflow for just this publication and see if the problem recurs. Downloading non-PDF files should be prevented.