Skip to content
Change the repository type filter

All

    Repositories list

    • EarthWhere

      Public
      Python
      01400Updated Nov 15, 2025Nov 15, 2025
    • [ML4H'25] MedVLThinker: Simple Baselines for Multimodal Medical Reasoning
      Jupyter Notebook
      23800Updated Nov 1, 2025Nov 1, 2025
    • MedVLSynther

      Public
      MedVLSynther: Synthesizing High-Quality Visual Question Answering from Medical Documents with Generator-Verifier LMMs
      Python
      0700Updated Nov 1, 2025Nov 1, 2025
    • JavaScript
      0000Updated Oct 29, 2025Oct 29, 2025
    • MeDiM

      Public
      Python
      02010Updated Oct 23, 2025Oct 23, 2025
    • [TMLR 25] SFT or RL? An Early Investigation into Training R1-Like Reasoning Large Vision-Language Models
      Python
      114230Updated Oct 10, 2025Oct 10, 2025
    • [ICCV 2025] OpenVision: A Fully-Open, Cost-Effective Family of Advanced Vision Encoders for Multimodal Learning
      Python
      2040650Updated Sep 14, 2025Sep 14, 2025
    • [ICLR 2025] This is the official repository of our paper "MedTrinity-25M: A Large-scale Multimodal Dataset with Multigranular Annotations for Medicine“
      Python
      27385100Updated Jul 11, 2025Jul 11, 2025
    • MedReason

      Public
      MedReason: Eliciting Factual Medical Reasoning Steps in LLMs via Knowledge Graphs
      Python
      1923710Updated Jun 19, 2025Jun 19, 2025
    • AttnGCG-attack

      Public
      Python
      42050Updated Jun 17, 2025Jun 17, 2025
    • Official repo of Knowledge or Reasoning? A Close Look at How LLMs Think Across Domains.
      Python
      64100Updated Jun 6, 2025Jun 6, 2025
    • Complex-Edit

      Public
      Complex-Edit: CoT-Like Instruction Generation for Complexity-Controllable Image Editing Benchmark
      Python
      12610Updated Apr 22, 2025Apr 22, 2025
    • CLIPS

      Public
      An Enhanced CLIP Framework for Learning with Synthetic Captions
      Python
      13730Updated Apr 18, 2025Apr 18, 2025
    • m1

      Public
      [ML4H'25] m1: Unleash the Potential of Test-Time Scaling for Medical Reasoning in Large Language Models
      Jupyter Notebook
      34620Updated Apr 14, 2025Apr 14, 2025
    • STAR-1

      Public
      Python
      13200Updated Apr 7, 2025Apr 7, 2025
    • Python
      24810Updated Feb 26, 2025Feb 26, 2025
    • EpiFoundation

      Public
      Pytorch implementation of EpiFoundation
      Python
      02410Updated Feb 25, 2025Feb 25, 2025
    • A Training-free Iterative Framework for Long Story Visualization
      Python
      13293000Updated Jan 18, 2025Jan 18, 2025
    • JavaScript
      0000Updated Sep 24, 2024Sep 24, 2024
    • Python
      0700Updated Sep 4, 2024Sep 4, 2024
    • [ICML 2025] This is the official repository of our paper "What If We Recaption Billions of Web Images with LLaMA-3 ?"
      114390Updated Jun 13, 2024Jun 13, 2024
    • AQA-Bench

      Public
      Algorithmic-Q&A-Bench: An Interactive Benchmark for Evaluating LLMs’ Sequential Reasoning Ability
      Python
      0410Updated Jun 13, 2024Jun 13, 2024
    • CLIPA

      Public
      [NeurIPS 2023] This repository includes the official implementation of our paper "An Inverse Scaling Law for CLIP Training"
      Python
      1432010Updated Jun 3, 2024Jun 3, 2024
    • This repository includes the official implementation our paper "Scaling White-Box Transformers for Vision"
      Python
      14810Updated Jun 3, 2024Jun 3, 2024
    • [CVPR 2024] This repository includes the official implementation our paper "MicroDiffusion: Implicit Representation-Guided Diffusion for 3D Reconstruction from Limited 2D Microscopy Projections"
      Python
      15410Updated May 13, 2024May 13, 2024
    • FedConv

      Public
      [TMLR'24] This repository includes the official implementation our paper "FedConv: Enhancing Convolutional Neural Networks for Handling Data Heterogeneity in Federated Learning"
      Python
      02500Updated Apr 30, 2024Apr 30, 2024
    • EVP

      Public
      [TMLR'24] This repository includes the official implementation our paper "Unleashing the Power of Visual Prompting At the Pixel Level"
      Python
      54200Updated Apr 30, 2024Apr 30, 2024
    • AdvXL

      Public
      [CVPR 2024] This repository includes the official implementation our paper "Revisiting Adversarial Training at Scale"
      Python
      12030Updated Apr 21, 2024Apr 21, 2024
    • MixCon3D

      Public
      [CVPR 2024] The official implementation of paper "Sculpting Holistic 3D Representation in Contrastive Language-Image-3D Pre-training"
      Python
      33640Updated Apr 21, 2024Apr 21, 2024
    • HQ-Edit

      Public
      [ICLR 2025] HQ-Edit: A High-Quality and High-Coverage Dataset for General Image Editing
      Python
      411170Updated Apr 18, 2024Apr 18, 2024