DiffusionGemma hits 1,000 tokens per second by ditching word-by-word generation entirely. It just doesn't run on most people's machines yet.

Source: [Decrypt](https://decrypt.co/370706/google-new-open-model-generates-text-diffusiongemma)

Sponsored