Close Menu
Emirates InsightEmirates Insight
  • The GCC
    • Duabi
  • Business & Economy
  • Startups & Leadership
  • Blockchain & Crypto
  • Eco-Impact

Subscribe to Updates

Get the latest creative news from FooBar about art, design and business.

What's Hot

XRP Heads for Recovery or Slump Next Week?

December 21, 2025

Yann LeCun confirms his new ‘world model’ startup, reportedly seeks $5B+ valuation

December 21, 2025

HCLTech and OpenAI: Transforming Enterprise AI Adoption at Scale

December 21, 2025
Facebook X (Twitter) Instagram LinkedIn
  • Home
  • Get Featured
  • Guest Writer Policy
  • Privacy Policy
  • Terms of Use
  • Contact Us
Facebook X (Twitter) Instagram LinkedIn
Emirates InsightEmirates Insight
  • The GCC
    • Duabi
  • Business & Economy
  • Startups & Leadership
  • Blockchain & Crypto
  • Eco-Impact
Emirates InsightEmirates Insight
Home»AI & Innovation»Privacy-preserving domain adaptation with LLMs for mobile applications
AI & Innovation

Privacy-preserving domain adaptation with LLMs for mobile applications

Emirates InsightBy Emirates InsightSeptember 23, 2025No Comments
Facebook Twitter Pinterest LinkedIn WhatsApp Reddit Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email

The recent success of machine learning models relies on not only large-scale, but also high-quality data. The paradigm of pre-training on massive data collected on the web and post-training on smaller high-quality data is used to train both large and small language models (LMs). For large models, post-training has proven vital for aligning models to user intent, and post-training of small models to adapt to the user domain has yielded significant results, for example, achieving 3%–13% improvements in key production metrics for mobile typing applications.

However, in complex LM training systems, there are potential privacy risks, such as the memorization of sensitive user instruction data. Privacy-preserving synthetic data provides one path to access user interaction data to improve models while systematically minimizing privacy risks. With the generation capabilities of large LMs (LLMs), synthetic data can be created to mimic user data without risk of memorization. This synthetic data can then be used in model training just as public data is used, simplifying privacy-preserving model training.

Gboard uses both small LMs and LLMs to improve billions of users’ typing experience. Small LMs support core features like slide to type, next word prediction (NWP), smart compose, smart completion and suggestion; LLMs support advanced features like proofread. In this blog post, we share our exploration over the past few years on generating and using synthetic data to improve LMs for mobile typing applications. We focus on approaches adhering to the privacy principles of both data minimization and data anonymization, and show how they are making a real-world impact in small and large models in Gboard. Particularly, our recent paper, “Synthesizing and Adapting Error Correction Data for Mobile Large Language Model Applications”, discusses the advances in privacy-preserving synthetic data for LLMs in production, building upon our continuous research efforts discussed below [1, 2, 3, 4, 5].

Source link

Share. Facebook Twitter Pinterest LinkedIn Tumblr Telegram Email
Emirates Insight
  • Website

Related Posts

Google-sponsored Data Science for Health Ideathon across Africa

December 13, 2025

A differentially private framework for gaining insights into AI chatbot use

December 11, 2025
AI & Innovation 2 Mins ReadSponsor: Doers Summit

Doers Summit 2025 opens in Dubai with strong Global participation

Sponsor: Doers Summit November 26, 2025
Leave A Reply Cancel Reply

Start Your Business in
Dubai with Tijarist

Company setup, residency support, and expert guidance — all in one place.

GET STARTED
Top Posts

Global Leaders Unite at World Climate Summit, The Investment COP 2023 to Redefine Climate Action

December 11, 20235,009 Views
AI & Innovation 2 Mins ReadSponsor: Doers Summit

Doers Summit 2025 opens in Dubai with strong Global participation

Sponsor: Doers Summit November 26, 2025

Australia Risks Falling Behind in Climate Investment, New Report Warns

August 21, 20253,049 Views

EnergyLab Selects 10 Startups for 2025 Climate Solutions Accelerator

August 26, 20251,791 Views

Subscribe to Updates

Get the latest creative news from SmartMag about art & design.

FEATURE YOUR BRAND ON
EMIRATES INSIGHT
CONTACT US
Emirares Insight

Emirates Insight - Lens on the Gulf provides in-depth analysis of the Gulf's business landscape, entrepreneurship stories, economic trends, and technological advancements, offering keen insights into regional developments and global implications.

We're accepting always open for new ideas and partnerships.

Email Us:[email protected]

Facebook X (Twitter)
Our Picks

XRP Heads for Recovery or Slump Next Week?

December 21, 2025

Yann LeCun confirms his new ‘world model’ startup, reportedly seeks $5B+ valuation

December 21, 2025

HCLTech and OpenAI: Transforming Enterprise AI Adoption at Scale

December 21, 2025
© 2020 - 2025 Emirates Insight. | Designed by Linc Globa Hub inc.
  • Home
  • Get Featured
  • Guest Writer Policy
  • Privacy Policy
  • Terms of Use
  • Contact Us

Type above and press Enter to search. Press Esc to cancel.