Diffusion language models

Diffusion models have completely taken over generative modelling of perceptual signals — why is autoregression still the name of the game for language modelling? Can we do anything about that?

GPT-4 says:
Diffusion models have revolutionized generative modeling for perceptual signals like images, audio, and video. However, autoregression remains dominant in language modeling. This article explores the potential of diffusion models in language modeling, discussing the challenges and advantages of iterative refinement techniques. It also examines the use of continuous Gaussian diffusion for discrete data and the possibility of learning higher-level continuous representations for language modeling. While autoregression remains a tough baseline to beat, further exploration of diffusion models in language modeling could yield significant benefits.
Read more at Sander Dieleman…

Diffusion language models

Related

Command Smarts: Exploring the Power of MCP Tools

Shingles Vaccine Linked to Lower Dementia Risk in Long-Term Study

DeepMind’s Silence: How Openness in AI Research Is Fading

Why Passwords Aren’t the Problem—But How We Use Them Is

Claude 3.7 Sonnet Set to Expand Context Window to 500K Tokens

IngressNightmare: Critical Flaws in NGINX Controller Expose Kubernetes Clusters to RCE

Google’s Gemini 2.5 Pro Thinks Slower to Answer Smarter

In Pursuit of Efficiency: Rethinking AI with DeepSeek-V3-0324

AI-Generated Research: Charting New Territory in Peer-Reviewed Science