Say it, buy its platform enables seamless AI-powered purchases.
In a significant move, Razorpay has partnered with Sarvam AI to introduce a voice-first conversational commerce platform in India, signaling a shift away from traditional app-based transactions toward more natural, speech-driven interactions.
With this initiative, users can discover products, place orders, and complete payments simply by speaking, combining artificial intelligence with an integrated payment infrastructure.
Making Commerce More Accessible
More importantly, the collaboration aims to address a structural gap in India’s digital ecosystem. While the country has over 950 million internet users and more than 450 million digital payment users, only around 270 million actively participate in online shopping.
At the core of this gap is usability. Many users, particularly those unfamiliar with English or complex interfaces, struggle to navigate conventional e-commerce platforms. In contrast, voice-based systems, especially those supporting regional languages, offer a more intuitive alternative by reducing reliance on text and interface literacy.
Launch With Swiggy Integration
The first rollout will take place on Swiggy through Sarvam’s chat platform, Indus App. Users will be able to complete an entire transaction, from browsing to payment, within a single voice interaction with an AI assistant.
This model removes the need to switch between multiple screens or applications, consolidating the entire commerce journey into a conversational flow.
Three-Layer Deployment Strategy
To scale the initiative, the companies plan to expand the system across three key layers.
First, they will integrate conversational commerce directly into Sarvam AI’s platform, beginning with Swiggy.
Second, they will enable businesses to deploy voice-enabled commerce on their own platforms. For instance, early implementation has already been tested with The Derma Co, where users can browse and purchase products using voice commands.
Finally, the third layer will focus on developers, with Sarvam’s AI models being integrated into Razorpay’s Agent Studio. This will allow developers to build multilingual AI agents capable of handling customer interactions and executing transactions autonomously.
From Interaction to Execution
At its core, the shift moves beyond conversational interfaces as simple informational tools and turns them into fully transactional systems.
Instead of merely assisting with navigation, these systems are designed to complete actions end-to-end. In practice, a user can express intent in natural language, and the system translates that into a completed purchase, including payment processing.
As a result, this marks a transition from interface-driven commerce to intent-driven commerce.
Broader Implications for India’s Digital Economy
More broadly, voice-first commerce aligns with the realities of India’s diverse linguistic and demographic landscape. By enabling transactions in local languages, the model expands access to digital services for users who have been underserved by traditional platforms.
For businesses, it creates a new channel for customer engagement that reduces friction and simplifies conversion. At the ecosystem level, it signals a shift toward more inclusive and scalable digital infrastructure.
The Next Layer of Digital Adoption
More broadly, the partnership reflects a larger trend in technology, where interfaces are becoming less visible and interactions more natural.
If execution matches intent, voice-based commerce could redefine how millions of users engage with digital platforms, particularly in markets where usability, language, and accessibility remain key constraints.



