List of Vapi AI Integrations
This is a list of platforms and tools that integrate with Vapi AI. This list is updated as of May 2026.
-
1
Speechmatics
Speechmatics
Transform your voice data into insights with unmatched accuracy.Leading the industry, Speechmatics offers exceptional Speech-to-Text and Voice AI solutions tailored for enterprises seeking top-tier accuracy, security, and versatility. Our robust enterprise-grade APIs enable both real-time and batch transcription with remarkable precision, accommodating a wide array of languages, dialects, and accents. Leveraging advanced Foundational Speech Technology, Speechmatics is designed to support essential voice applications across various sectors, including media, contact centers, finance, and healthcare. Businesses benefit from the flexibility of on-premises, cloud, and hybrid deployment options, allowing them to maintain complete control over their data security while gaining valuable voice insights. Recognized and trusted by global industry leaders, Speechmatics stands out as the preferred provider for premier transcription and voice intelligence solutions. 🔹 Unmatched Accuracy – Exceptional transcription capabilities for diverse languages and accents 🔹 Flexible Deployment – Options for cloud, on-premises, and hybrid environments 🔹 Enterprise-Grade Security – Ensuring comprehensive data management 🔹 Real-Time & Batch Processing – Scalable solutions for varied transcription needs Elevate your Speech-to-Text and Voice AI capabilities with Speechmatics today, and experience the difference that cutting-edge technology can make! -
2
Paygent
Paygent
Track every AI agent cost. Bill for real value. Run a profitable business.Paygent stands out as an advanced profitability and monetization platform tailored specifically for businesses harnessing AI technologies. In contrast to conventional billing systems that simply track revenue, Paygent emphasizes essential metrics that are vital for AI enterprises, such as the profit margin from each agent, the actual gross profit for every customer, and the immediate expenses associated with each LLM interaction, API call, and computational activity. Key features of Paygent include: - Instantaneous cost allocation for LLM utilization, categorized by agent, customer, and workflow - Predictive pricing simulation tools that enable businesses to devise pricing strategies before going live - Streamlined billing automation for diverse pricing structures, such as usage-based, outcome-oriented, hybrid, and digital employee models - Automated invoicing along with alerts for cost monitoring to prevent excessive agent loops that could jeopardize profitability With smooth integration options available through Node.js, Python, and Go SDKs, Paygent ensures that agent operations remain unaffected by added latency. Remove any ambiguity surrounding your profit margins and elevate your AI agents into a successful business enterprise. By implementing Paygent, organizations can achieve a deeper insight into their financial dynamics, paving the way for strategic decisions that enhance profitability while nurturing sustainable growth. Additionally, the platform’s robust tools empower businesses to adapt quickly to market changes and optimize their operations effectively. -
3
Cloudonix
Cloudonix
Multi-Award Winning AI Voice Agent Orchestration Platform - Humans and AI Speak as One.Cloudonix serves as a CPaaS (Communications Platform as a Service) provider with a focus on delivering voice and text APIs/SDKs, targeting developers, agencies, telecom firms/MSPs, and enterprises in need of programmable voice communication tools, AI-powered voice agents, and streamlined SIP trunking services. Their offerings include agentic voice trunking, which allows users to seamlessly connect voice-agent platforms to any telephone system, whether it is hosted in the cloud or located on-premises, utilizing a straightforward plug-in method; they also deliver adaptable SIP trunking and integrated SBC features, such as transcoding and TLS/TCP/UDP negotiation, ensuring smooth connections with any SIP carrier or PBX. For developers engaged in building voice applications, Cloudonix provides an extensive range of programmable voice APIs, mobile/web voice SDKs, audio streaming capabilities, and call control features like call transfers and IVR management, all enhanced by a scripting language designed for crafting call flow. Moreover, the platform includes low-code tools, enabling non-technical users to design IVR menus, automate call flows, implement outbound dialing systems, and develop advanced AI-driven voice receptionists, thus increasing accessibility for a diverse array of stakeholders in the communications sector. This unique blend of powerful functionalities and intuitive design positions Cloudonix as a robust solution for enterprises looking to upgrade their communication capabilities and drive innovation in their operations. With its commitment to simplifying complex communication processes, Cloudonix truly stands out in the competitive CPaaS marketplace. -
4
Gladia
Gladia
Gladia is a production-ready Speech-to-Text API for real-world voice productsGladia presents an advanced audio transcription and intelligence platform that features a unified API capable of handling both asynchronous transcription for pre-recorded audio and real-time streaming, empowering developers to convert spoken language into text in over 100 languages. The platform is equipped with a variety of functionalities, including precise word-level timestamps, automatic language detection, support for code-switching, speaker recognition, translation, summarization, a customizable lexicon, and the ability to extract relevant entities. With its impressive real-time processing engine, Gladia achieves latencies under 300 milliseconds while maintaining exceptional accuracy, and it provides "partials" or interim transcripts to facilitate quicker responses during live sessions. Gladia is not only a powerful solution for audio transcription but also an intelligent resource that can adapt to various user needs and environments. Overall, Gladia distinguishes itself as an essential asset for developers seeking to embed comprehensive audio transcription features seamlessly into their software applications. -
5
Mercury Edit 2
Inception
Revolutionize your workflow with ultra-fast AI editing efficiency.Mercury Edit 2 is an advanced AI model developed by Inception Labs, forming part of the Mercury suite, and is designed for efficient reasoning, coding, and editing through a unique architecture that diverges from standard large language models. This model improves upon the capabilities of Mercury 2, a diffusion-based system that can produce and enhance entire outputs at once, as opposed to the traditional approach of generating text token by token, resulting in significantly faster processing and more flexible editing. Rather than serving as a straightforward "typewriter," it functions as a responsive editor, starting with an initial draft and progressively refining it across multiple tokens in tandem, which allows for immediate interaction and rapid iterations in various areas, including code refinement, content generation, and agent-oriented tasks. With a remarkable throughput of nearly 1,000 tokens per second, this framework greatly exceeds the performance of conventional models while maintaining strong reasoning capabilities across a variety of benchmarks. Its innovative structure not only changes how users engage with AI but also establishes a new benchmark for excellence within the realm of artificial intelligence, pushing the boundaries of what is possible in this rapidly evolving field. As a result, it opens up new avenues for creativity and productivity that were previously unattainable. -
6
Inworld TTS
Inworld
Revolutionary speech synthesis: realistic voices for every application.Inworld TTS emerges as a state-of-the-art text-to-speech technology that delivers remarkably lifelike and context-sensitive speech synthesis, complete with sophisticated voice-cloning capabilities, all at a highly competitive price point. Its flagship model, TTS-1, is designed for real-time applications, featuring low-latency streaming that provides the initial audio output in approximately 200 milliseconds and encompasses a broad spectrum of languages, including English, Spanish, French, Korean, and Chinese, among others. Developers can choose between instant zero-shot voice cloning, which requires merely 5 to 15 seconds of audio input, or more comprehensive fine-tuned cloning, which allows for the incorporation of voice-tags to express emotion, style, and non-verbal signals, while also facilitating seamless language transitions without compromising the distinct voice identity. Additionally, for users desiring enhanced expressiveness and multilingual support, the TTS-1-Max model is currently available in preview, showcasing improved functionalities. The platform supports multiple access methods, such as APIs and portal options, and can function in streaming or batch processing modes, making it adaptable for a wide array of uses, including interactive voice assistants, gaming avatars, and custom audio branding projects. With its innovative features and flexibility, Inworld TTS is set to transform the landscape of synthetic voice interactions and enhance user experiences across various domains. As users continue to explore the possibilities, the technology promises to pave the way for more engaging and personalized audio experiences. -
7
Operata
Operata
Elevate customer experience with real-time insights and action.Operata is an innovative platform tailored for cloud contact centers, utilizing artificial intelligence to improve the observability of customer experiences by continuously collecting and examining real-time data from various interaction facets, such as calls, agent environments, networks, CCaaS, and AI engagements; this all-encompassing method provides teams with a thorough understanding of both customer and agent experiences, allowing them to not only recognize the events that transpired but also uncover the root causes and respond swiftly. Its notable features include a unified CX Insights Graph that correlates different technical, operational, and experiential signals, along with CX Copilot and Agent Copilot—intelligent assistants powered by Tenor AI that support natural language inquiries and deliver immediate recommendations. Furthermore, the platform offers Customer Journey Trace for mapping complete interaction sequences across multiple channels, pre-configured playbooks and dynamic dashboards for obtaining timely insights, performance benchmarking tools for readiness testing and assurance, compatibility with over 50 CX and voice systems, and an MCP Server that incorporates observability data into wider enterprise AI frameworks. By providing such a comprehensive array of tools, Operata significantly empowers organizations to refine their customer service strategies and elevate overall satisfaction levels. Ultimately, this multifaceted solution not only streamlines operations but also fosters a deeper connection between customers and agents. -
8
Hamming
Hamming
Revolutionize voice testing with unparalleled speed and efficiency.Experience automated voice testing and monitoring like never before. Quickly evaluate your AI voice agent with thousands of simulated users in just minutes, simplifying a process that typically requires extensive effort. Achieving optimal performance from AI voice agents can be challenging, as even minor adjustments to prompts, function calls, or model providers can significantly impact results. Our platform stands out by supporting you throughout the entire journey, from development to production. Hamming empowers you to store, manage, and synchronize your prompts with your voice infrastructure provider, achieving speeds that are 1000 times faster than conventional voice agent testing methods. Utilize our prompt playground to assess LLM outputs against a comprehensive dataset of inputs, where our system evaluates the quality of generated responses. By automating this process, you can reduce manual prompt engineering efforts by up to 80%. Additionally, our monitoring capabilities offer multiple ways to keep an eye on your application’s performance, as we continuously track, score, and flag important cases that require your attention. Furthermore, you can transform calls and traces into actionable test cases, integrating them seamlessly into your golden dataset for ongoing refinement. -
9
AI Agents Directory
AI Agents Directory
Discover and deploy tailored AI agents for efficiency.The AI Agents Directory is the world's most extensive marketplace and database for AI agents, featuring over 1,300 options tailored for enterprise use across more than 64 distinct categories. This comprehensive platform empowers users to explore, compare, and deploy AI agents designed to fulfill a variety of business needs. Users can navigate through a wide range of categories, including productivity, sales, customer service, coding, and voice, with each category housing specialized agents that enhance automation and improve operational efficiency. Moreover, the directory provides detailed insights on each agent, allowing users to make educated decisions based on their features, pricing structures, and user feedback. The platform also facilitates the creation of custom AI agents and the submission of new entries, fostering a dynamic environment for both businesses and developers eager to leverage advanced AI technologies. Additionally, the continuous expansion of its offerings ensures that the AI Agents Directory remains an indispensable tool in the rapidly changing realm of artificial intelligence, catering to an ever-growing audience. -
10
VerbaFlo
VerbaFlo
Streamline communications effortlessly with AI-driven conversational automation.VerbaFlo is an AI-driven conversational platform that streamlines and automates communication across multiple channels such as voice, chat, email, SMS, WhatsApp, and the web, thereby improving real-time interactions with prospects, clients, and residents, particularly within the real estate industry. Utilizing advanced natural language processing and machine learning, it guarantees intelligent engagement in conversations at any hour, simplifying tasks like lead qualification, appointment scheduling, follow-ups, and providing customized responses without the drawbacks of script fatigue. Furthermore, it integrates effortlessly with current systems like CRM or property management software to unify workflows and data analysis. The platform also supports multilingual conversations and features real-time dashboards, conversational memory, and automated outreach initiatives, including renewals, rent notifications, and maintenance updates. In addition, it offers critical insights into occupancy patterns and client behaviors, while monitoring performance metrics that enable property teams to act more quickly, enhance conversion and retention rates, and reduce overall operational expenses. By leveraging these extensive capabilities, VerbaFlo not only improves communication efficiency in the real estate sector but also empowers teams to make informed decisions that positively impact their operations. This comprehensive approach allows for a more cohesive and responsive customer experience, ultimately leading to greater satisfaction among clients and residents alike.
- Previous
- You're on page 1
- Next