Skip to content

ABC Tool

  • Home
  • About / Contect
    • PRIVACY POLICY
Google’s latest DiffusionGemma open AI model comes with a 4x speed boost

Google’s latest DiffusionGemma open AI model comes with a 4x speed boost

Posted on June 10, 2026 By safdargal12 No Comments on Google’s latest DiffusionGemma open AI model comes with a 4x speed boost
Blog


Another day, another AI model from Google. This time, Google DeepMind has released a new member of the Gemma 4 open model family, but it’s fundamentally different from the rest of the lineup. DiffusionGemma doesn’t generate outputs linearly like most AI models. Instead, it can produce an entire block of text in parallel. Google says this makes it faster and more efficient when running on local hardware like an Nvidia DGX or a humble gaming GPU.

Most AI models are designed to be autoregressive—they generate text left to right one token at a time. DiffusionGemma has more in common with image generation models, which start with static and then denoise it to create the desired content. This model takes a field of placeholder tokens running over the canvas multiple times to generate likely tokens and using those to improve estimation of others. At the end of the process, the model finalizes its token outputs in one large block—the “denoised” text canvas.

DiffusionGemma is fairly large in the realm of Google’s open models. It’s a Mixture of Experts (MoE) model with a total of 26 billion parameters, but only 3.8 billion are activated during inference. That means it should fit in the 18GB ram allotment of a high-end GPU. In testing with an RTX 5090, DiffusionGemma spits out around 700 tokens per second. With a single Nvidia H100 AI accelerator, DiffusionGemma can produce 1,000+ tokens per second. That’s about four times the output of the similarly sized autoregressive Gemma models.

This approach to text generation shifts the bottleneck from memory bandwidth to compute, generating up to 256 tokens in parallel. Google says this offers a measurable boost in non-linear tasks like in-line editing, molecular sequencing, and mathematical graphing. The animation above shows how DiffusionGemma was tuned to solve Sudoku puzzles, which is a notoriously challenging task for standard autoregressive AI models because each token depends on future tokens. DiffusionGemma’s ability to continuously self-correct large sets of tokens makes that easier.



Source link

Post Views: 3

Post navigation

❮ Previous Post: North Koreans behind nearly half of US tech industry hacks, says CrowdStrike
Next Post: My Eyes Love Logitech’s New Mobi Fold Mouse, My Hand a Little Less So ❯

You may also like

As Grok flounders, SpaceX bets future on beating Big Tech at AI
Blog
As Grok flounders, SpaceX bets future on beating Big Tech at AI
May 22, 2026
American Airlines Signs Up for Starlink Wi-Fi Service on Its Flights
Blog
American Airlines Signs Up for Starlink Wi-Fi Service on Its Flights
May 27, 2026
Microsoft reports expose AI’s cost problem: The tech is more expensive than paying human employees
Blog
Microsoft reports expose AI’s cost problem: The tech is more expensive than paying human employees
May 23, 2026
Prices of smartphones and mobile infrastructure rise due to heavy demand for memory chips
Blog
Prices of smartphones and mobile infrastructure rise due to heavy demand for memory chips
June 6, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Today’s NYT Strands Hints, Answer and Help for June 11 #830- CNET
  • Today’s NYT Wordle Hints, Answer and Help for June 11 #1818
  • Xbox warns of a ‘reset’ as it prepares for layoffs
  • Raspberry Pi 5 – 16 GB RAM : Adafruit Industries, Unique & fun DIY electronics and kits
  • Apple Wallet Set to Get a Suite of New Features With iOS 27

Recent Comments

  1. Last Chance for Big Savings on TechCrunch Disrupt 2026 Tickets – Artiverse on 5 days left: Save up to $410 on Disrupt 2026 passes

Archives

  • June 2026
  • May 2026
  • April 2026

Categories

  • Blog

Copyright © 2026 ABC Tool.

Theme: Oceanly News by ScriptsTown