Change the repository type filter
All
Repositories list
28 repositories
- One-for-All Multimodal Evaluation Toolkit Across Text, Image, Video, and Audio Tasks
- A simple, unified multimodal models training engine. Lean, flexible, and built for hacking at scale.
- Holistic Evaluation of Multimodal LLMs on Spatial Intelligence
NEO
Publicsglang
Publicmultimodal-search-r1
PublicDiffSynth-Studio
Publiclean-runner
PublicMGPO
PublicDeepEyes
PublicAero-1
PublicEgoLife
PublicLongVA
PublicOtter
Public🦦 Otter, a multi-modal model based on OpenFlamingo (open-sourced version of DeepMind's Flamingo), trained on MIMIC-IT and showcasing improved instruction-following and in-context learning ability.RelateAnything
Public