Skip to content

ABC Tool

  • Home
  • About / Contect
    • PRIVACY POLICY
Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM

Posted on June 4, 2026 By safdargal12 No Comments on Google’s new Gemma 4 12B model is designed to run on any laptop with 16GB of RAM
Blog


Gemma 4 12B is almost as capable as the version with 26 billion parameters.

Credit:
Google

Gemma 4 12B is almost as capable as the version with 26 billion parameters.


Credit:

Google

Google says the new model is capable of complex multistep reasoning and agentic workflows that previously required the larger Gemma variants. Despite the smaller parameter count, Gemma 4 12B comes with the newly devised Multi-Token Prediction (MTP) drafters, which take advantage of unused processing cycles to calculate possible future tokens. The result is greater speed and efficiency. Google has released optional MTP versions of the other Gemma 4 models, but this is the first one to have MTP out of the box.

Gemma 4 12B is also more efficient thanks to a new approach to multimodality. The Gemma 4 family is natively multimodal, accepting text, audio, or images as inputs. Most gen AI models—including the other Gemma 4 variants—use dedicated encoders to process non-text inputs and pass that data to the LLM. This works well enough, but it increases latency and memory usage.

With the new mid-weight model, Google has implemented a streamlined embedding module for vision, featuring single-matrix multiplication and positional embedding, which allows the data to pass to the LLM with proper spatial awareness. This eliminates the need for a bulky middleman encoder. For audio, there’s no encoding at all. The developers worked out a method of projecting the raw audio signal into the same vectors used for text tokens.

If you want to check out the new Gemma 4 model, it’s accessible without a download via tools like LM Studio, Google AI Edge Gallery, and more. But the whole idea with Gemma 4 12B is that you can run it locally and on your own terms. If you’ve got the RAM, the model weights are available for download immediately on Kaggle and Hugging Face. It’s just shy of 18GB.



Source link

Post Views: 3

Post navigation

❮ Previous Post: Uber to put 500 data-collection vehicles on the road this year
Next Post: Best Smart Sprinklers for 2026: Irrigation the Easy Way ❯

You may also like

Google Photos may be about to lean on the one app it fought hardest
Blog
Google Photos may be about to lean on the one app it fought hardest
May 30, 2026
Dyson’s New Find+Follow Smart Fan Is Like a Personal Cooling Butler for Summer Heat
Blog
Dyson’s New Find+Follow Smart Fan Is Like a Personal Cooling Butler for Summer Heat
May 26, 2026
Xreal, Google’s smartglasses partner, thinks it has finally mastered this notoriously tricky industry
Blog
Xreal, Google’s smartglasses partner, thinks it has finally mastered this notoriously tricky industry
May 24, 2026
Tesla just increased its spending plan to B — here’s where the money is going
Blog
Tesla just increased its spending plan to $25B — here’s where the money is going
April 23, 2026

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Recent Posts

  • Google’s wired Nest Doorbell gets a 22% early Prime Day price cut
  • WWDC Will Be Tim Cook’s Swan Song. I Expect Something Siri-ous
  • Let us filter AI slop, you cowards
  • AI-enabled Senior Software Engineer (Typescript Focus) at AccessOwl
  • Pixel Watch’s ECG app gets a new look as part of Google’s icon makeover

Recent Comments

  1. Last Chance for Big Savings on TechCrunch Disrupt 2026 Tickets – Artiverse on 5 days left: Save up to $410 on Disrupt 2026 passes

Archives

  • June 2026
  • May 2026
  • April 2026

Categories

  • Blog

Copyright © 2026 ABC Tool.

Theme: Oceanly News by ScriptsTown