Gemma 3n

Our powerful and efficient open model designed to run locally on phones, tablets, and laptops.

Explore Gemma 3n

Gemma 3n was created in close collaboration with leading mobile hardware manufacturers. It shares architecture with the next generation of Gemini Nano to empower a new wave of intelligent, on-device applications.

Capabilities
Get started
Download

Capabilities

memory

Optimized on-device performance

Engineered for speed and quality, with a significantly reduced memory footprint.

lock

Privacy-first, offline-ready

Enables developers to build intelligent, interactive features that respect user privacy and work reliably offline.

books_movies_and_music

Multimodal understanding

Understands and processes audio, text, images, and videos, and is capable of both transcription and translation.

animation

Dynamic resource usage

Features a 4B active memory footprint with nested 2B active memory submodel – with the ability to create submodels for quality-latency tradeoffs.

Build new on-the-go experiences

smartphone

Live interactive applications

Create apps that understand and respond to real-time visual and audio cues from the user's environment.

photo_library

Applications based on deep understanding

Using combined audio, image, video, and text inputs—all processed privately on-device.

audio_file

Advanced audio-centric applications

Including real-time speech transcription, translation, and rich voice-driven interactions.

Start building with Google’s APIs

spark

Gemini API

Run Gemma with the Gemini API 

keyboard_arrow_right

Learn more

google

Google AI Edge

Run large language models (LLMs) completely on-device