Prediction mode

### Search before asking

- [X] I have searched the Multimodal Maestro [issues](https://github.com/roboflow/multimodal-maestro/issues) and found no similar feature requests.


### Description

Currently, it is only possible to finetune models. But it would be amazing to have **predict** mode where user can utilize base model functionality without much efforts. Something like florence-2 supports different basic tasks such as image caption, object detection, classification, etc. 

### Use case

- Image caption mode can easily be used to finetune flux for text2image.


### Additional

_No response_

### Are you willing to submit a PR?

- [X] Yes I'd like to help by submitting a PR!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Prediction mode #77

Search before asking

Description

Use case

Additional

Are you willing to submit a PR?

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Prediction mode #77

Description

Search before asking

Description

Use case

Additional

Are you willing to submit a PR?

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions