Sarvam AI Releases India’s First Hindi LLM ‘OpenHathi’, a Week After Raising $41 million
In Brief
Sarvam AI’s OpenHathi is the first Hindi LLM aimed at development of open models and datasets, to promote AI innovation for Indian languages.
Indian Generative AI startup Sarvam AI released OpenHathi-Hi-v0.1, the first Hindi large language model (LLM) in the OpenHathi series, just a week after raising $41 million in its Series A funding round led by Lightspeed Ventures.
Hindi is India’s most widely spoken language, with over 43% of the population declaring it as their mother tongue. The model is built on Meta AI’s Llama2-7B architecture and delivers performance on par with GPT-3.5 for Indic languages.
“We show that our model works as well as, if not better than GPT-3.5 on various Hindi tasks while maintaining its English performance,” the company said on social media platform X (formerly Twitter).
We show that our model works as well as, if not better than GPT-3.5 on various Hindi tasks while maintaining its English performance. Along with standard NLG tasks, we also evaluate on a bunch of non-academic, real-world tasks. 3/5
— Sarvam AI (@SarvamAI) December 12, 2023
According to Sarvam AI, the goal of the OpenHathi series is to contribute to the development of open models and datasets to promote innovation in Indian language AI and is collaborating with academic partners at AI4Bharat, who have provided language resources and benchmarks for this initiative.
AI4Bharat is a research lab at the Indian Institute of Technology (IIT) Madras – a public technical university in India, which works on developing open-source datasets, tools, models and applications for Indian languages.
OpenHathi uses a 48,000-token extension of Llama2-7B’s tokenizer and undergoes a two-step training process. In the first phase, the model concentrates on embedding alignment, aligning randomly initialized Hindi embeddings. The second phase involves bilingual language modeling, where the model learns cross-lingual attention across tokens.
Full-Stack Generative AI Platform to be Launched Soo
Further, the company is encouraging people to be creative and come up with new ideas by using and improving upon this release. The company invites developers to build specialized models for various purposes using the OpenHathi-Hi-v0.1 model as a starting point.
Sarvam AI also plans to release advanced models for businesses on its comprehensive generative AI platform, which according to the startup will be launched soon.
While raising its Series A funding, the company announced its plan to develop the “full-stack” for Generative AI ranging from research-led innovations in training custom AI models to an enterprise-grade platform for authoring and deployment.
The company added that this full-stack approach will accelerate the adoption of generative AI in India, especially given that enterprises see the potential but are grappling with how to leverage it for their business.
Established in July 2023 by Vivek Raghavan and Pratyush Kumar, have previously worked at AI4Bharat – backed by Infosys co-founder Nandan Nilekani.
Recently, India launched BharatGPT, an LLM-based solution developed in collaboration with Indian conversational AI startup CoRover.ai, which is positioned as the answer to existing generative AI or LLMs. It intends to fill a critical gap in the market, providing an indigenous solution available in over 12 Indian languages, covering video, voice and text interactions.
Disclaimer
In line with the Trust Project guidelines, please note that the information provided on this page is not intended to be and should not be interpreted as legal, tax, investment, financial, or any other form of advice. It is important to only invest what you can afford to lose and to seek independent financial advice if you have any doubts. For further information, we suggest referring to the terms and conditions as well as the help and support pages provided by the issuer or advertiser. MetaversePost is committed to accurate, unbiased reporting, but market conditions are subject to change without notice.About The Author
Kumar is an experienced Tech Journalist with a specialization in the dynamic intersections of AI/ML, marketing technology, and emerging fields such as crypto, blockchain, and NFTs. With over 3 years of experience in the industry, Kumar has established a proven track record in crafting compelling narratives, conducting insightful interviews, and delivering comprehensive insights. Kumar's expertise lies in producing high-impact content, including articles, reports, and research publications for prominent industry platforms. With a unique skill set that combines technical knowledge and storytelling, Kumar excels at communicating complex technological concepts to diverse audiences in a clear and engaging manner.
More articlesKumar is an experienced Tech Journalist with a specialization in the dynamic intersections of AI/ML, marketing technology, and emerging fields such as crypto, blockchain, and NFTs. With over 3 years of experience in the industry, Kumar has established a proven track record in crafting compelling narratives, conducting insightful interviews, and delivering comprehensive insights. Kumar's expertise lies in producing high-impact content, including articles, reports, and research publications for prominent industry platforms. With a unique skill set that combines technical knowledge and storytelling, Kumar excels at communicating complex technological concepts to diverse audiences in a clear and engaging manner.