Tag
1 dispatch
Google released DiffusionGemma, a 26B open model that generates entire text blocks in parallel using diffusion, not token-by-token prediction. It's 4x faster and worse. That's the interesting part.