Change the repository type filter
All
Repositories list
56 repositories
VLM2Vec
PublicThis repo contains the code for "VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks" [ICLR 2025]ImagenWorld
PublicStress-Testing Image Generation Models with Explainable Human Evaluation on Open-ended Real-World Tasks [ICLR 2026]verl-tool
PublicOpenResearcher
PublicOpenResearcher: A Fully Open Pipeline for Long-Horizon Deep Research Trajectory SynthesisVisPhyWorld
PublicMMLU-Pro
PublicThe code and data for "MMLU-Pro: A More Robust and Challenging Multi-Task Language Understanding Benchmark" [NeurIPS 2024]EditReward
PublicEditReward: A Human-Aligned Reward Model for Instruction-Guided Image Editing [ICLR 2026]VisualWebInstruct
Public- The official code of "VisCoder2: Building Multi-Language Visualization Coding Agents" [ICLR26]
BrowserAgent
PublicStructEval
PublicEvaluating LLMs' abilities to generate structural output [TMLR2025]Mantis
PublicVideoScore2
PublicVideoScore
Publicofficial repo for "VideoScore: Building Automatic Metrics to Simulate Fine-grained Human Feedback for Video Generation" [EMNLP2024]ImagenHub
PublicA one-stop library to standardize the inference and evaluation of all the conditional image generation models. [ICLR 2024]General-Reasoner
PublicPixel-Reasoner
PublicPixel-Level Reasoning Model trained with RL [NeuIPS25]QuickCodec
PublicQuickVideo
PublicQuick Long Video Understanding [TMLR2025]Hierarchical-Reasoner
PublicCritique-Coder
PublicVideoEval-Pro
PublicMore reliable Video Understanding EvaluationVisCoder
PublicThe official code of "VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation" [EMNLP25]PixelWorld
PublicOne-Shot-CFT
PublicABC
PublicABC: Achieving Better Control of Multimodal Embeddings using VLMs [TMLR2025]Vamba
PublicTheoremExplainAgent
PublicOfficial Repo for "TheoremExplainAgent: Towards Video-based Multimodal Explanations for LLM Theorem Understanding" [ACL 2025 oral]CritiqueFineTuning
PublicCode for "Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate" [COLM 2025]