Google LiteRT AI runs on your phone and watch

Google's new LiteRT AI framework is now running on devices like the Pixel Watch and Chromebook Plus. This means AI features can work without an internet connection, making them faster and more private.

Google is intensifying its efforts to embed generative artificial intelligence (GenAI) directly onto consumer devices. The company is championing LiteRT, a framework billed as a "universal" solution for on-device machine learning. Its specific iteration for language models, LiteRT-LM, is being highlighted for enabling rapid, local processing of AI tasks, moving away from cloud dependency.

Blazing fast on-device GenAI with LiteRT-LM - Google Developers Blog - 1

LiteRT-LM is now powering GenAI features within Google's own Chrome browser, Chromebook Plus devices, and the Pixel Watch. This marks a significant step in making advanced AI capabilities accessible without constant network connection. The framework promises "blazing fast" performance, with claims of outperforming established alternatives like llama.cpp on various hardware configurations.

Blazing fast on-device GenAI with LiteRT-LM - Google Developers Blog - 2

FRAMEWORK UNDERPINNINGS

The LiteRT framework itself is presented as an evolution from Google's earlier TFLite, aiming for "high-performance deployment." It offers acceleration across multiple hardware types – including CPUs, GPUs, and dedicated Neural Processing Units (NPUs) found in modern chipsets. Companies like Qualcomm and MediaTek are explicitly mentioned as partners in this endeavor, integrating LiteRT to harness the power of their NPUs for on-device AI.

Blazing fast on-device GenAI with LiteRT-LM - Google Developers Blog - 3

LiteRT is designed to be cross-platform, supporting operating systems such as Android and iOS, alongside desktop and even Internet of Things (IoT) devices like the Raspberry Pi. The framework facilitates the deployment of popular open-weight GenAI models, including Google's own Gemma family, which offers various sizes suitable for different device capabilities.

Blazing fast on-device GenAI with LiteRT-LM - Google Developers Blog - 4

DEVELOPMENT AND DEPLOYMENT

For developers, Google provides tools and resources to convert existing PyTorch GenAI models into formats compatible with LiteRT. This includes a Python module called the Torch Generative API and a dedicated LiteRT GenAI Model Zoo hosted on platforms like Hugging Face. A command-line interface (CLI) is also available, allowing for more direct interaction and deployment.

The company emphasizes the production-readiness of LiteRT-LM, citing its open-source nature and active development. Recent updates have introduced capabilities such as Swift APIs for iOS integration, support for running models on Windows, and NPU acceleration specifically for Gemma models.

BACKGROUND

The push for on-device AI reflects a broader industry trend seeking to enhance user privacy, reduce latency, and improve the reliability of AI-powered features. By processing data locally, devices can potentially offer more immediate responses and keep sensitive information from leaving the user's hardware. The development of frameworks like LiteRT aims to democratize access to these sophisticated AI capabilities, making them a standard component of everyday technology.

Frequently Asked Questions

Q: What is Google's new LiteRT AI framework?

LiteRT is a new system from Google that lets AI run directly on devices like phones and watches. It helps AI features work faster without needing the internet.

Q: Which Google devices are using LiteRT AI now?

LiteRT is already being used in Google's Chrome browser, Chromebook Plus laptops, and the Pixel Watch. This means these devices can use AI features locally.

Q: Why is Google making AI run on devices instead of the internet?

Running AI on devices makes features faster and more private because your data doesn't have to be sent to the internet. It also means AI can work even when you don't have a signal.

Q: Can developers use LiteRT for their own apps?

Yes, Google is providing tools for developers to use LiteRT. They can convert AI models and use resources like the LiteRT GenAI Model Zoo to build AI features that run on many different devices.

Q: What kind of AI models can run with LiteRT?

LiteRT can run popular open AI models, including Google's own Gemma models. It is designed to work with different hardware like CPUs, GPUs, and special AI chips (NPUs) found in modern devices.

Google LiteRT AI runs on your phone and watch

FRAMEWORK UNDERPINNINGS

DEVELOPMENT AND DEPLOYMENT

BACKGROUND

Frequently Asked Questions

NewsRadar

The Present

Search Records

Explore

Google LiteRT AI runs on your phone and watch

FRAMEWORK UNDERPINNINGS

DEVELOPMENT AND DEPLOYMENT

BACKGROUND

Frequently Asked Questions

Know What Changed

Apple Device Activation Lock: Why Disabling Find My Matters

Andrej Karpathy joins Anthropic AI to build LLM models

Claude Desktop App for macOS and Windows Launched

Google I/O 2026: New AI Agents Will Do Tasks For You

Google I/O 2026: New AI Tools for Developers

NewsRadar

The Present

Search Records

Explore