Skip to content
View mihirp1998's full-sized avatar
  • Pittsburgh

Block or report mihirp1998

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Please don't include any personal information such as legal names or email addresses. Maximum 100 characters, markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse

Pinned Loading

  1. AlignProp AlignProp Public

    AlignProp uses direct reward backpropogation for the alignment of large-scale text-to-image diffusion models. Our method is 25x more sample and compute efficient than reinforcement learning methods…

    Python 282 10

  2. VADER VADER Public

    Video Diffusion Alignment via Reward Gradients. We improve a variety of video diffusion models such as VideoCrafter, OpenSora, ModelScope and StableVideoDiffusion by finetuning them using various r…

    Python 274 15

  3. alexanderswerdlow/unidisc alexanderswerdlow/unidisc Public

    UniDisc: A discrete diffusion model for joint multimodal generation, enabling controllable and efficient text-image synthesis, editing, and inpainting.

    Python 95 4

  4. Diffusion-TTA Diffusion-TTA Public

    Diffusion-TTA improves pre-trained discriminative models such as image classifiers or segmentors using pre-trained generative models.

    Python 72 5

  5. Slot-TTA Slot-TTA Public

    Slot-TTA shows that test-time adaptation using slot-centric models can improve image segmentation on out-of-distribution examples.

    Python 26 3

  6. huggingface/trl huggingface/trl Public

    Train transformer language models with reinforcement learning.

    Python 13.6k 1.9k