IBM and Deepgram have announced a partnership to bring Deepgram’s speech-to-text and text-to-speech technologies into IBM’s watsonx Orchestrate generative AI platform. The integration aims to meet the growing demand for advanced voice recognition and transcription in enterprise settings.
Through this collaboration, IBM will embed Deepgram’s capabilities into its digital agent-building tools, allowing users to interact with AI systems using natural speech. The solution is designed to address challenges such as background noise, diverse accents, and real-life dialog by supporting a broad range of languages and dialects, including many Arabic and Indian variants. It also offers custom tuning options, real-time captioning, and natural-sounding voice synthesis.
The companies expect these features to improve automated customer care, call analysis, and voice-driven data entry in sectors like healthcare and finance.
Scott Stephenson, CEO and Co-Founder of Deepgram, said: “Voice is rapidly becoming the default interface between humans and technology, and enterprise deployments require a real-time platform that is accurate, low latency, and reliable at scale. By embedding Deepgram inside watsonx Orchestrate Agent Builder, IBM clients can build voice agents and voice-enabled workflows on top of a real-time foundation that has been developed and refined over more than a decade.”
Nick Holda, Vice President of AI Technology Partnerships at IBM added: “Our watsonx Orchestrate integration powered by Deepgram APIs introduces new speech recognition and transcription capabilities to IBM clients, refining and modernizing their operations. This collaboration aims to help enterprise organizations accelerate their AI initiatives and reinforces IBM’s open ecosystem, bringing choice and cutting-edge voice technology to partners and customers.”
IBM stated that the move will strengthen its offerings for enterprise AI solutions while giving Deepgram access to new customers through an established partner.
Deepgram provides API-based platforms for developers building voice applications or integrating voice technologies into products. Its platform supports both cloud-based APIs as well as self-hosted or on-premises deployment options.
IBM delivers hybrid cloud services along with AI expertise for organizations across various industries worldwide. Its solutions are used by thousands of government agencies and corporations in critical sectors such as financial services, telecommunications, and healthcare.



