Would it be possible to reduce the size of the image?
This gives us several issues
- We can't use unstructured-api in a Gitgub actions as the virtual machine fills up.
- The attack surface for this image is very large.
- Deployments are slow
The main issue seems to be that the image includes the whole of libraoffice.
Here are the results from dive https://github.com/wagoodman/dive
