ARTICLE AD BOX
![]()
Google CEO Sundar Pichai said that helium is impressed with the enactment done by Sarvam AI. Speaking astatine the ongoing India AI Impact Summit 2026, Pichai said “The developer vigor I find successful India each clip I travel, it’s barroom none, 2nd to none,” adding that the entrepreneurship ecosystem successful the state is “thriving”.
Pichai specifically highlighted Sarvam AI for processing section AI models tailored to Indian languages and contexts saying "The enactment Sarvam has done processing section AI models ....I conscionable don't spot immoderate impediments to that, and I deliberation it is very, precise good positioned". The AI startup has precocious taken the net by tempest with the institution claiming that its AI exemplary has outperformed immoderate of the biggest names successful ai, including Google’s Gemini and OpenAI’s ChatGPT. “Sarvam Vision achieves state-of-the-art accuracy of 84.3% connected the olmOCR-Bench (English lone subset) outperforming frontier models similar Gemini 3 Pro and caller OCR models similar DeepSeek OCR 2,” wrote Pratyush Kumar, CEO, Sarvam AI.
What is India’s Sarvam AI that Sundar Pichai praised
Sarvam was founded by Vivek Raghavan and Pratyush Kumar successful August 2023. In a blog post, the institution explained that its Sarvam AI exemplary is susceptible of a scope of ocular knowing tasks, including representation captioning, country substance recognition, illustration interpretation, and analyzable array parsing.
One of the institution aims is to unlock India's cognition that remains embedded successful carnal documents, scanned archives, and humanities collections.
Another cardinal occupation that the institution is moving connected is to bring AI functionality to Indian users. “Most planetary models dainty Indian languages arsenic secondary, often resulting successful little accuracy for determination scripts. Along with pushing the frontiers of accuracy, our VLM is an inference-efficient 3B state-space model,” the institution said.Sarvam AI model, the institution says, is trained connected high-quality datasets covering 22 authoritative Indian languages, including varied fiscal documents, literature, newspapers, historical texts, and more.Sarvam AI’s code designation exemplary supports 10 Indian languages wrong a azygous 74-million-parameter exemplary that occupies astir 294MB connected a device. It tin automatically place the connection being spoken, without requiring the idiosyncratic to prime it.
The exemplary tin process code astatine astir 8.5x real-time and provides a time-to-first-token of little than 300 milliseconds connected a Qualcomm Snapdragon 8 Gen 3 chipset.
Its code synthesis exemplary has a instrumentality footprint of astir 60 MB and 24 cardinal parameters. The exemplary achieves a mean quality mistake complaint of 0.0173 connected a modular benchmark, indicating that synthesised code intimately matches the intended substance crossed languages.
Custom dependable cloning is besides supported connected it which means a caller dependable tin beryllium added utilizing astir 1 hr of audio information and deployed wrong the aforesaid 60MB exemplary file.The translation model, connected the different hand, has 150 cardinal parameters and an on-device footprint of astir 334MB. It handles bidirectional translation crossed 110 connection pairs, including 10 Indian languages and English, without routing done an intermediate language.
How Sarvam AI differs from Gemini and ChatGPT
One of the cardinal differentiators betwixt India’s Sarvam AI, and Gemini and ChatGPT is the former’s absorption connected Indian languages prioritising English and treating the remainder secondary. Since it is trained successful 22 Indian languages, it tin springiness higher accuracy for determination scripts.While different models are lone susceptible capable to extract substance from documents oregon images, the SarvamAI tin besides construe ocular elements for amended knowing and further knowledge.
This ensures amended show connected a assortment of analyzable documents successful the level of knowing with a large-scale Indic OCR benchmark for Indian languages.
Sarvam AI exemplary availability
The Document Intelligence API is escaped for February 2026, allowing users to research and physique with Sarvam Vision astatine scale, with getting started contiguous for wholly free.
India’s Sarvam AI: Key features
Here’s a little summary of large features of India’s Sarvam AI exemplary are:
- Multimodal vision-language: This helps successful ensuring to recognize the images and texts unneurotic for enabling the representation captioning, chart, oregon array mentation much easily.
- Document knowing (Indian languages focused): It has high-accuracy OCR and cognition extraction for 22 Indian languages, including historical texts and scanned documents.
- Charts and information interpretation: Sarvam AI is susceptible of knowing much than texts. The charts, data, illustrations, and ocular investigation of the documents.
- Multilingual visual: The AI exemplary understands and interprets ocular elements crossed aggregate languages successful the aforesaid document.
- Leading performance: Sarvam AI excels successful planetary English benchmarks and introduces the Sarvam Indic OCR Bench for Indian languages.
- Accessible API: Its papers quality APIs are production-ready and escaped to usage for experimentation successful February 2026.
