Search

4 results

Clear filters
  • MAY 20, 2025 / AI Edge

    On-device small language models with multimodality, RAG, and Function Calling

    Google AI Edge advancements, include new Gemma 3 models, broader model support, and features like on-device RAG and Function Calling to enhance on-device generative AI capabilities.

    Google AI Edge: Small Language Models with Multimodality, RAG, and Function Calling
  • APRIL 18, 2025 / Gemma

    Gemma 3 QAT Models: Bringing state-of-the-Art AI to consumer GPUs

    The release of int4 quantized versions of Gemma 3 models, optimized with Quantization Aware Training (QAT) brings significantly reduced memory requirements, allowing users to run powerful models like Gemma 3 27B on consumer-grade GPUs such as the NVIDIA RTX 3090.

    Gemma 3 Quantization Aware - meta
  • MARCH 12, 2025 / Gemma

    Gemma 3 on mobile and web with Google AI Edge

    Gemma 3 1B, a new small language model for mobile and web applications via Google AI Edge, is now available, with increased efficiency, improved performance, and offline availability.

    Gemma 3 - Google AI Edge
  • MARCH 12, 2025 / Gemma

    Introducing Gemma 3: The Developer Guide

    Gemma 3 is a new, advanced version of the Gemma open-model family featuring multimodality, longer context windows, and improved language capabilities, with various sizes and deployment options for developers to experiment.

    Gemma 3