潜龙 QianLong · 中文 AI 内容与工具平台

Remember when generating a single high-quality AI image meant staring at a progress bar for what felt like an eternity, listening to your computer's cooling fans spin up like a jet engine? The race for speed and accessibility in generative AI has just hit a significant new milestone, courtesy of a quiet but powerful release from Google.

Google has officially open-sourced DiffusionGemma, a heavyweight 26-billion parameter image generation model. Released under the permissive Apache 2.0 license, this isn't just a restricted research preview—it's a fully open tool that developers can freely integrate, modify, and build upon.

For close observers of the AI space, the DNA of this new model might look familiar. It represents the triumphant return of research that began with an experimental "Gemini Diffusion" model briefly showcased last May. Even in its early experimental phase, testers noted its remarkable speed. Now, fully realized as DiffusionGemma, velocity remains its defining characteristic.

In practical terms, the performance is blistering. Early benchmarks from developers testing the model reveal that it can process well over 500 tokens per second. To put that into perspective, one developer tasked the AI with generating a highly specific and whimsical prompt: a "pelican riding a bicycle." The model delivered the completed image in a mere 4.4 seconds. In an industry where high-fidelity image generation can often take tens of seconds or even minutes on standard hardware, sub-five-second generation feels like a paradigm shift.

But a powerful, lightning-fast open-source model is only half the equation. The traditional bottleneck for everyday users and independent developers has always been hardware. Running a 26-billion parameter model locally requires serious, expensive GPU power.

This is where hardware giant NVIDIA has stepped in to complete the puzzle. NVIDIA is currently hosting DiffusionGemma for free on its NIM cloud API platform. By providing the heavy computational lifting on their end, NVIDIA has effectively removed the barrier to entry. Creators and developers don't need a massive server rack in their basement to experiment with Google's latest technology; they just need an internet connection.

The release of DiffusionGemma highlights a vital trend in the artificial intelligence landscape. While massive tech companies continue to build proprietary, walled-garden AI ecosystems, there is a parallel push to democratize access to foundational tools. By pairing Google's open-source ethos with NVIDIA's robust cloud infrastructure, the industry is turning cutting-edge generative AI from a luxury resource into a basic utility. When the friction of time and hardware costs is removed, the only limit left is the user's imagination.

Key Points

Google has released DiffusionGemma, a 26-billion parameter open-weight image generation model.
Licensed under Apache 2.0, the model is built on last year's Gemini Diffusion research.
It operates at blistering speeds, generating complex images in under five seconds.
NVIDIA is currently offering free access to the model via its NIM cloud API.

Why It Matters

By pairing a highly capable, open-source Google model with free NVIDIA cloud infrastructure, the AI industry is making top-tier creative tools accessible to anyone without requiring expensive hardware.

Sources:

DiffusionGemma — Simon Willison's Weblog

潛

本文完

潜龙编辑部 · 2026/6/13

Speed Meets Open Source: Inside Google's DiffusionGemma

Key Points

Why It Matters

更多专栏

Decoding the Beautiful Game: Soccer's AI Renaissance

The Playlist Police: Deezer Goes Rogue on AI Music

The Thirsty Cloud: Amazon's AI Data Centers Used 2.5 Billion Gallons of Water