🚀 The feature
Add a method to the VisionDataset which make it possible to use each dataset also for recognition task.
Todo would be crop each box as image with there plain label and all exsisting chars as vocab.
A must have would be to merge multible datasets
@fg-mindee
wdyt ?
Motivation, pitch
Provide datasets also for recognition would be a big benefit for training part and i think a very good way to validate the results
Alternatives
No response
Additional context
No response
🚀 The feature
Add a method to the VisionDataset which make it possible to use each dataset also for recognition task.
Todo would be crop each box as image with there plain label and all exsisting chars as vocab.
A must have would be to merge multible datasets
@fg-mindee
wdyt ?
Motivation, pitch
Provide datasets also for recognition would be a big benefit for training part and i think a very good way to validate the results
Alternatives
No response
Additional context
No response