Skip to content

yczhou001/Awesome-Diffusion-LLM

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

43 Commits
 
 
 
 

Repository files navigation

Awesome Diffusion Large Language Models

Awesome License: MIT

This repository contains:

  • A regularly updated paper list for Diffusion Large Language Models.
  • A tutorial for Diffusion Large Language Models.
  • A nano code snippet for Diffusion Large Language Models.

Nano code

We have released a simple implementation in code folder. It contains two types of diffusion language models, presented as Jupyter notebooks:

  • continuous_diff.ipynb: This notebook demonstrates a Continuous Diffusion Language Model, illustrating how diffusion modeling and sampling operate in a continuous space, such as word vector space. It could be a start point for grasping the fundamental principles of continuous diffusion language models.
  • masked_diff.ipynb: This notebook implements a Mask-based Discrete Diffusion Language Model, which works in the discrete token space and supports text generation tasks. We draw inspiration from GUIDELINES.md of LLaDA for parts of our implementation. Thanks for their valuable contributions!

Playground

Product/ Blog / Technical Report

  • Gemini Diffusion blog
  • Mercury: Ultra-Fast Language Models Based on Diffusion tech report
  • Dream7B blog

Diffusion Language Model for Multimodal Understanding

Discrete Diffusion Model

Continuous-Time Diffusion Model

Application