Skip to content

Latest commit

 

History

History
116 lines (93 loc) · 11.4 KB

File metadata and controls

116 lines (93 loc) · 11.4 KB

Awesome Diffusion Large Language Models

Awesome License: MIT

This repository contains:

  • A regularly updated paper list for Diffusion Large Language Models.
  • A tutorial for Diffusion Large Language Models.
  • A nano code snippet for Diffusion Large Language Models.

Nano code

We have released a simple implementation in code folder. It contains two types of diffusion language models, presented as Jupyter notebooks:

  • continuous_diff.ipynb: This notebook demonstrates a Continuous Diffusion Language Model, illustrating how diffusion modeling and sampling operate in a continuous space, such as word vector space. It could be a start point for grasping the fundamental principles of continuous diffusion language models.
  • masked_diff.ipynb: This notebook implements a Mask-based Discrete Diffusion Language Model, which works in the discrete token space and supports text generation tasks. We draw inspiration from GUIDELINES.md of LLaDA for parts of our implementation. Thanks for their valuable contributions!

Playground

Product/ Blog / Technical Report

  • Gemini Diffusion blog
  • Mercury: Ultra-Fast Language Models Based on Diffusion tech report
  • Dream7B blog

Diffusion Language Model for Multimodal Understanding

Discrete Diffusion Model

Continuous-Time Diffusion Model

Application