Is your feature request related to a problem? Please describe.
We have an ImageQueryTool that gives an agent the ability to describe an image. As more models get multi-modal, it'd be nice to have a general purpose FileQueryTool that understands how to handle more artifact types - text, image, audio, video. Then we could simply use that tool and it would choose the appropriate loader