List of Best Speech Recognition Software in 2026

SoapBox

Soapbox Labs

Empowering children's learning through safe, innovative voice technology.

View Product

SoapBox was designed specifically for children, aiming to revolutionize their learning and play experiences globally through the use of voice technology. Our platform, which is low-code and scalable, has gained worldwide recognition, being licensed by various educational and consumer enterprises to deliver exceptional voice-driven experiences in areas such as literacy, English language learning, smart toys, games, apps, robots, and more. The unique technology we developed is both independent and trustworthy, catering to children aged 2 to 12, and is capable of recognizing a variety of dialects and accents from different regions, having undergone independent verification to ensure it is free from any racial bias. We prioritize a privacy-by-design framework in the development of our SoapBox platform, firmly believing in the importance of safeguarding children's essential right to privacy. Our commitment to these principles not only enhances the user experience but also fosters a safe and nurturing environment for young learners.

INVOX Medical

VA cali

Transform speech into precise medical text effortlessly today!

View Product

Today’s leading voice dictation software provides an intuitive and instantaneous audio-to-text conversion experience. With its user-friendly interface, it guarantees efficient, rapid, and precise functionality. INVOX Medical stands out with specialized dictionaries that cater to various medical disciplines, enabling it to accurately interpret a wide range of medical terminology. Countless healthcare professionals around the globe already depend on this software for its dependability and simplicity. You can start dictating your medical documentation with impressive accuracy in mere minutes. Additionally, it offers remarkable value for its capabilities. By leveraging advanced artificial intelligence technology, INVOX Medical significantly boosts your ability to generate medical reports with exceptional precision, allowing for productivity increases of up to three times. The program’s customization options empower users to tailor the dictionary, modify word substitutions, and adjust pronunciations as needed, ensuring a tailored dictation experience. In a rapidly changing healthcare environment, having such an effective tool can dramatically enhance your workflow efficiency. Such advancements not only save time but also improve the quality of patient care through more accurate documentation.

e-Speaking

Experience hands-free control and seamless interaction with technology.

View Product

An intuitive software application allows you to oversee your computer, dictate various messages and documents, and have text read back to you. This tool enables you to control your Windows device using only your voice, streamlining navigation with minimal need for keyboard or mouse input. For example, simply saying "Down One" moves the cursor down a line, while "Open Email" grants you access to your inbox. It provides a seamless way to issue commands for managing any Windows application or document. Throughout history, human beings have relied on verbal communication, which has led to the evolution of our brain's ability to process sound. We interpret auditory signals and transform them into coherent ideas and actions, resulting in commands, interactions, and various forms of entertainment. This illustrates the significant impact of speech recognition technology in improving our engagement with computers. As users embrace such innovative solutions, they can enjoy a more streamlined and hands-free experience while interacting with technology in their everyday routines. Moreover, this transformational approach not only enhances productivity but also fosters a deeper connection between humans and their devices.

Alibaba Cloud Intelligent Speech Interaction

Alibaba Cloud

Revolutionizing communication through intelligent, multilingual speech interactions.

View Product

Intelligent Speech Interaction employs advanced technologies such as speech recognition, speech synthesis, and natural language understanding to provide a fluid user experience. By integrating this technology into their services, companies can allow their products to have significant dialogue with users, thus improving human-computer interaction. Currently, this system accommodates a variety of languages, including Mandarin Chinese, Cantonese, English, Japanese, Korean, French, and Indonesian, with aspirations to expand to more languages in the future. This groundbreaking solution is adaptable and can be applied in numerous contexts, such as intelligent Q&A systems, quality assurance procedures, real-time speech subtitling, and audio file transcription. Its successful deployment in various industries, including finance, insurance, eCommerce, and smart home technologies, showcases its flexibility and efficacy in boosting user engagement. As the need for more interactive and intelligent systems continues to rise, the importance of Intelligent Speech Interaction in facilitating communication between humans and machines is set to increase significantly. This evolution indicates a future where users can expect even more personalized and dynamic interactions with technology.

FirstLanguage

Unlock powerful NLP solutions for effortless app development.

View Product

Our suite of Natural Language Processing (NLP) APIs delivers outstanding precision at affordable rates, integrating all aspects of NLP into a single, unified platform. By using our services, you can conserve significant time that would typically be allocated to training and building language models. Take advantage of our premium APIs to accelerate your application development with ease. We provide vital tools necessary for successful app development, including chatbots and sentiment analysis features. Our text classification services cover a wide array of sectors and support more than 100 languages. Moreover, performing accurate sentiment analysis is straightforward with our tools. As your business grows, our adaptable support is designed to grow with you, featuring simple pricing structures that facilitate easy scaling in response to your evolving requirements. This solution is particularly beneficial for individual developers engaged in creating applications or developing proof of concepts. To get started, simply head to the Dashboard to retrieve your API Key and include it in the header of every API request you make. You can also utilize our SDK in any programming language of your choice to begin coding immediately or refer to the auto-generated code snippets in 18 different languages for additional guidance. With our extensive resources available, embarking on the journey to develop groundbreaking applications has never been so straightforward, making it easier than ever to bring your innovative ideas to life.

Picovoice

Empowering developers with versatile, transparent voice AI solutions.

View Product

Picovoice is a voice AI platform designed with developers in mind, aiming to promote the widespread use of voice AI technology. By recognizing the challenges posed by cloud dependence and a lack of transparency, Picovoice sets itself apart through on-device processing, the release of open-source benchmarks, and accessibility of its technology to all users. The range of Picovoice’s capabilities includes speech-to-text, voice search, wake word detection, intent recognition, and voice activity detection, all of which can operate on devices as compact as microcontrollers up to full web browsers, creating a rich and engaging user experience. This versatility ensures that developers can implement advanced voice features across a variety of platforms and devices.

Work by Speech

Mikołaj Magowski

Transform your computer experience with seamless voice control.

View Product

Work by Speech is a unique application that enables users to operate their computer entirely through voice commands, eliminating the need for a keyboard and mouse. Key features of the application include: - The ability to effectively navigate and control your computer using only your voice - Support for quiet speaking, allowing for discreet operation - The capability to switch applications and open programs through voice commands - A comprehensive set of built-in voice commands designed for common tasks - Advanced management options for custom voice commands - Macro recording functionality to streamline repetitive actions - A dedicated dictation mode for efficient text input - Full support for all mouse functions, which can be executed quickly and easily by voice - A customizable mouse grid that can also be manipulated through speech commands - Automatic optimization of the mouse grid based on the program being used - Minimal usage of system resources, ensuring smooth performance - Compatibility with any microphone on Windows 10 and 11 - Currently available only in English - Free updates to enhance the user experience over time. This application truly transforms how users interact with their computers, making it a valuable tool for those looking to increase their efficiency.

SpeechPulse

AV BEAM

Effortless speech recognition, offline support, endless possibilities await!

View Product

SpeechPulse leverages your computer's microphone to provide instantaneous speech recognition capabilities. This innovative tool can seamlessly input text into various applications, such as text editors, web browsers, and office software. One of the standout features of SpeechPulse is its ability to operate entirely offline, eliminating the need for an internet connection. It offers support for speech recognition across a diverse range of languages, encompassing a total of 100 languages, including English, French, Spanish, Italian, German, Japanese, Chinese, and Russian. In addition to these functionalities, SpeechPulse is capable of generating accurate subtitles for both audio and video files, complete with precise timestamps. With a straightforward one-time payment model, users can purchase SpeechPulse once and enjoy its benefits indefinitely, making it a cost-effective solution for speech-to-text needs. This means there are no recurring fees, providing users with peace of mind and an enduring resource for their transcription tasks.

Yandex SpeechKit

Yandex

Unlock precise voice technology for tailored customer experiences today!

View Product

Technologies driven by machine learning for speech recognition have led to the creation of innovative voice assistants, improved efficiency in call center workflows, and better monitoring of service quality, among other uses. Your organization can now leverage the advanced technology behind the award-winning Alice voice assistant. With SpeechKit, you can achieve accurate speech interpretation within moments, allowing for quick and effective communication for your clients' voice assistants. You have the choice between two versions: the comprehensive option, which develops an intelligent voice assistant, and the adaptive version, which grants your brand a unique voice in just a month. This service is designed for clients who demand meticulous control over speech processing and synthesis within their ecosystems. SpeechKit’s machine learning models are primed for deployment in your infrastructure, with flexible options that range from hybrid configurations to fully on-premise setups that are ideal for handling sensitive information. Additionally, the service supports various audio formats, including MP3, LPCM, and OggOpus, providing a high degree of versatility in audio management. This extensive selection empowers businesses to customize their speech technology solutions according to their unique operational requirements, resulting in increased satisfaction and efficiency. Ultimately, integrating such tailored solutions can lead to significant enhancements in customer experience and operational effectiveness.

Gladia

Gladia is a production-ready Speech-to-Text API for real-world voice products

View Product

Gladia presents an advanced audio transcription and intelligence platform that features a unified API capable of handling both asynchronous transcription for pre-recorded audio and real-time streaming, empowering developers to convert spoken language into text in over 100 languages. The platform is equipped with a variety of functionalities, including precise word-level timestamps, automatic language detection, support for code-switching, speaker recognition, translation, summarization, a customizable lexicon, and the ability to extract relevant entities. With its impressive real-time processing engine, Gladia achieves latencies under 300 milliseconds while maintaining exceptional accuracy, and it provides "partials" or interim transcripts to facilitate quicker responses during live sessions. Gladia is not only a powerful solution for audio transcription but also an intelligent resource that can adapt to various user needs and environments. Overall, Gladia distinguishes itself as an essential asset for developers seeking to embed comprehensive audio transcription features seamlessly into their software applications.

MAI-Transcribe-1

Microsoft AI

Experience seamless, accurate transcription for diverse audio needs.

View Product

MAI-Transcribe-1 is a cutting-edge speech-to-text technology developed by Microsoft, available through Azure AI Foundry, designed to deliver accurate transcriptions from a range of audio inputs for both enterprise and developer use cases. It supports 25 widely spoken languages and effectively handles various accents, dialects, and speech patterns, ensuring dependable performance even in challenging conditions such as background noise, low audio quality, or overlapping speech. Created by the AI Superintelligence team at Microsoft, this solution prioritizes both precision and speed, enabling quick batch processing and straightforward scalability for production environments. This robust tool is vital for a multitude of applications, including meeting transcriptions, live caption generation, accessibility improvements, call center analytics, and the functioning of voice-activated systems, establishing itself as a key component in voice-driven innovations. Furthermore, its adaptability makes it an indispensable asset for enhancing communication and improving accessibility across a wide range of platforms, thus promoting inclusivity and efficiency in various sectors.

Gemini Audio

Google

Transform conversations with seamless, expressive real-time audio interactions.

View Product

Gemini Audio is an advanced collection of real-time audio models built upon the cutting-edge Gemini architecture, designed to enable natural and seamless voice interactions along with dynamic audio generation through simple language prompts. This technology creates engaging conversational experiences, allowing users to speak, listen, and interact with AI continuously, while effectively combining comprehension, reasoning, and audio response generation. With the ability to both analyze and produce audio, it supports a wide array of applications such as speech-to-text transcription, translation, speaker recognition, emotion detection, and comprehensive audio content analysis. These models are particularly optimized for low-latency, real-time environments, making them ideal for live assistants, voice agents, and interactive systems that require ongoing, multi-turn conversations. In addition, Gemini Audio features enhanced capabilities such as function calling, which allows the model to trigger external tools and integrate real-time data into its responses, thus broadening its applicability and efficiency. This innovative framework not only simplifies user interaction but also significantly elevates the overall experience with AI-powered audio technology, ensuring users are consistently engaged and satisfied. Ultimately, Gemini Audio represents a leap forward in the convergence of voice interaction and intelligent audio processing, paving the way for future advancements in this space.

Go Transcribe

Enhance your content effortlessly with engaging subtitles today!

View Product

Sign up for a complimentary account to easily upload your audio and video files to our online transcription service. Studies show that videos featuring subtitles tend to capture more attention and engage viewers effectively. Given that over 80% of content consumed on social media is watched without sound, adding subtitles can greatly improve viewer engagement! When you provide subtitles, you make it easier for your audience to understand your message. For example, if you're urging support for a noble cause, subtitles can increase the chances of receiving donations by ensuring your message is conveyed clearly; the same principle applies when promoting products or services! Additionally, subtitles are especially useful for those with hearing disabilities. These reasons illustrate the significant advantages subtitles can bring to your business. If you're unfamiliar with the process, creating subtitles can be both labor-intensive and expensive. Fortunately, we offer solutions that will streamline this process, making it much simpler for you to enhance your content. With our assistance, you can focus on what truly matters—connecting with your audience and driving engagement.

Calldrip

Boost engagement instantly with powerful sales automation tools.

View Product

Calldrip is a platform designed to enhance how businesses tackle new inquiries, leveraging over a decade of experience to develop a comprehensive suite of sales automation tools that serve thousands of clients globally. By facilitating immediate calls while prospects are still browsing a website, Calldrip significantly boosts the likelihood of conversations between sales representatives and potential customers, yielding increases of up to 900% in engagement rates. This privately-owned and rapidly expanding company is based in Salt Lake City, Utah, and understands that in today's fast-paced digital environment, known as Google Micro Moments, businesses must connect with leads promptly. In addition, Calldrip not only fosters quick engagement but also helps identify and address potential shortcomings within sales processes, ensuring teams can optimize their performance effectively.

BigHand Dictation and Speech Recognition

BigHand

Transform your workflow: speed, efficiency, and productivity redefined.

View Product

Boost productivity and increase profitability by enabling your teams to reduce the time dedicated to transcription, allowing them to concentrate on more critical responsibilities. Implement efficient dictation methods that are not only rapid but also manageable through tailored workflows. With the ability to utilize their voices on desktop, mobile, or tablet devices, employees can easily record, share, organize, and track their files, which significantly enhances their workflow. This streamlined process not only saves time but also optimizes resource allocation, ultimately nurturing a more effective and dynamic work atmosphere. By prioritizing these improvements, organizations can experience a transformative shift in their operational efficiency.

LumenVox Automatic Speech Recognition (ASR)

LumenVox

Revolutionize customer engagement with adaptable, innovative voice solutions.

View Product

Voice recognition and authentication technologies powered by AI have the potential to revolutionize how customers engage with services. With adaptable voice-enabled solutions, you can cater to the diverse needs of your clientele in a timely and cost-effective manner. Our primary focus is on voice enablement for applications, ensuring that you receive exceptional voice automation and interaction experiences. The LumenVox ASR and TTS systems offer both precision and affordability, enhancing efficiency for both customers and service providers alike. You will find that every interaction can be unique, catering to the individual needs of each caller. Furthermore, our technology supports the recognition of various dialects through a unified global language model, providing unparalleled versatility in features, implementation, and revenue generation. With LumenVox, your only limit is your imagination, as we empower you to conceptualize and construct innovative solutions tailored to your requirements.

Phonexia Speech Platform

Phonexia

Revolutionizing voice technology for secure, efficient solutions.

View Product

Phonexia offers an extensive array of innovative voice recognition and voice biometrics technologies designed to fulfill the requirements of both commercial enterprises and government entities. Their products leverage the latest breakthroughs in artificial intelligence, voice biometrics research, acoustics, and phonetics, resulting in solutions that are exceptionally accurate, rapid, and scalable. With Phonexia's AI-driven offerings, users can create voicebots and authenticate speaker identities through voice biometrics. Additionally, the platform enables the transcription of spoken words into written text and allows for the identification of speakers within large audio datasets. This advanced voice biometric authentication simplifies the process of accessing client information while also providing robust fraud detection capabilities. As a result, organizations can enhance their security measures and streamline operations effectively.

TranscribeMe

Transforming data management with innovation, security, and quality.

View Product

The way we view data is changing, and currently, companies are increasingly depending on reliable and accurate transcription and data annotation services. We have created an innovative platform for task distribution and workforce management that upholds the highest standards of information security, ensuring your data is securely encrypted and meticulously managed. Our workflows are designed to meet HIPAA and GDPR regulations, and we offer flexible services, including the option to geofence our workforce within specific locations. The technology and methodologies we've put in place enable us to consistently provide exceptional data at competitive rates. For artificial intelligence and machine learning models to perform effectively, they require data tailored to particular applications. Leveraging our capability to assemble large teams of professionals, we can deliver high-quality data for a wide range of uses, including generating interactions for contact centers, producing images, and gathering review and survey data. This dedication to providing superior service establishes us as a frontrunner in the data services sector, equipped to fulfill the diverse requirements of our clients while adapting to the evolving landscape of data needs. Ultimately, our focus on innovation and quality ensures that we not only meet but exceed industry standards.

WebsiteVoice

Effortlessly convert text to engaging audio, enhancing accessibility.

View Product

Transform your website’s written content into top-notch audio effortlessly within five minutes, and at no cost to you. Our cutting-edge text-to-speech technology allows your visitors to listen to your articles while multitasking, which can significantly increase the time they spend on your site. Accessibility, often underestimated, plays a vital part in effective web design; our service enables those with visual impairments and reading difficulties to fully access your content without the challenges of conventional reading methods. The rise of podcasts and audiobooks showcases a notable shift in audience preference towards auditory formats instead of traditional reading. By implementing this feature, you can successfully engage a wider audience that enjoys listening as opposed to reading. Our Automatic Content Recognition technology requires only a brief code addition to your site, triggering the text-to-speech functionality for relevant content effortlessly. Our system is designed for a smooth user experience, ensuring that your visitors can navigate without interruptions. Furthermore, we incorporate advanced Artificial Intelligence and Machine Learning techniques to continually refine our voice algorithms, striving to make the text-to-speech experience on your platform as natural as possible, thereby enhancing user interaction. This revolutionary feature not only meets the needs of a diverse audience but also boosts the overall accessibility and quality of your website. Embracing such innovations can set your site apart and contribute to a more inclusive online environment.

Symbl

Symbl.ai

Transform conversations into actionable insights with effortless integration.

View Product

Symbl is an API platform aimed at enabling developers and companies to effortlessly integrate conversational intelligence into a variety of communication channels. Our comprehensive set of APIs utilizes advanced machine learning algorithms that can analyze any form of conversation data to derive meaningful insights contextually, encompassing several domains and platforms including voice, email, chat, and social media, all without the need for initial training data, wake words, or specialized classifiers. By democratizing access to conversational technology, Symbl facilitates large-scale collaboration, empowering organizations to implement our targeted workplace productivity API, which assists brands in optimizing crucial workflows for knowledge workers while enhancing customer interactions. Whether you are a seasoned developer or a novice looking to harness employee collaboration within your business, our API provides customizable options designed to address your unique use cases, ensuring it effectively satisfies your requirements. In addition, Symbl is dedicated to transforming the dynamics of team communication and collaboration by offering cutting-edge tools that enable businesses to thrive in a rapidly evolving landscape. Ultimately, our goal is to support organizations in unlocking their full potential through improved interaction and engagement strategies.

Azure Speaker Recognition

Microsoft

Enhancing interactions through secure, personalized voice authentication technology.

View Product

The Speech service includes a functionality that authenticates and recognizes individual speakers, significantly improving customer interactions. By streamlining the verification process, it promotes seamless and secure experiences for users across multiple platforms, such as web applications and customer support call centers. This voice-based authentication can be achieved through designated passphrases or unrestricted voice inputs. Moreover, it enables the identification of speakers from a pool of registered users, which helps in associating conversations with particular individuals, thus enhancing personalized interactions and catering to scenarios involving multiple voice recognitions. Consequently, this innovative technology equips businesses to deliver customized experiences that align with the distinct identities of each customer, ultimately fostering stronger connections. In an increasingly digital world, such capabilities are crucial for meeting the evolving expectations of clients.

Voice Pro

LinguaTec

Transform your workplace with secure, efficient voice recognition.

View Product

Voice Pro Enterprise is tailored for corporate settings, enabling voice recognition directly on the organization’s server, which can be utilized from various devices such as PCs, Macs, smartphones, and tablets. This configuration ensures that all confidential internal data stays protected within the company. The system features speaker-independent recognition technology, eliminating the necessity for extensive speaker training; users can simply speak into their devices and obtain instant transcriptions. This groundbreaking tool offers businesses a highly secure and sophisticated speech recognition solution. Whether drafting reports at a desk, sending emails on the move, or dictating sales presentations in an outdoor setting, Voice Pro Enterprise greatly boosts employee efficiency and productivity. Users can dictate text at nearly three times the speed of traditional typing, and the system’s exceptional accuracy minimizes the need for editing. Consequently, organizations can look forward to significant enhancements in overall workforce effectiveness and streamlined workflows, leading to a more productive work environment. Additionally, the convenience of using Voice Pro Enterprise fosters a more responsive and adaptable company culture.

Deepgram

Transforming speech recognition for rapid, scalable business success.

View Product

Accurate speech recognition can be effectively utilized on a large scale, allowing for continuous enhancement of model performance through data labeling and training from a single interface. Our advanced speech recognition and understanding technology operates efficiently at an extensive level, facilitated by our innovative model training, data labeling, and versatile deployment solutions. The platform supports various languages and accents, ensuring it can adapt in real-time to the specific requirements of your business with each training cycle. We offer enterprise-level speech transcription tools that are not only quick and precise but also dependable and scalable. Reinventing automatic speech recognition with a focus on 100% deep learning empowers organizations to boost their accuracy significantly. Instead of relying on large tech firms to enhance their software, businesses can encourage their developers to actively improve accuracy by incorporating keywords in every API interaction. Start training your speech model today and enjoy the advantages within weeks rather than waiting for months or even years to see results, making your operations more efficient and effective. This proactive approach allows companies to stay ahead in a fast-evolving technological landscape.

Azure AI Speech

Microsoft

Transform your applications with advanced, customizable voice technology.

View Product

Accelerate the creation of voice-enabled applications confidently by leveraging the Speech SDK. This powerful tool enables accurate speech-to-text transcription, produces lifelike text-to-speech results, facilitates spoken language translation, and provides speaker recognition capabilities within conversations. You can customize your applications by employing tailored models through Speech Studio. Experience state-of-the-art speech recognition, realistic text-to-speech synthesis, and award-winning speaker identification technology, all while ensuring your data privacy, as no speech input is recorded during processing. Additionally, you can personalize voices, add specific terms to your vocabulary, or craft your own distinctive models. The Speech SDK is versatile enough to be used in various settings, such as cloud platforms and edge containers. With impressive accuracy, you can transcribe audio in more than 92 languages and dialects. This technology enhances customer comprehension via call center transcriptions, improves user experiences with voice-activated assistants, and captures important discussions in meetings, among other applications. Utilize the text-to-speech features to create applications and services that communicate in a natural manner, offering a selection of over 215 voices across 60 languages, which greatly enhances the engagement and versatility of your projects. The combination of these extensive capabilities empowers developers to innovate effortlessly while significantly enhancing user interactions and satisfaction.

Dragon Legal

Nuance Communications

Revolutionize legal workflows with precision dictation and efficiency.

View Product

Dragon Legal is an innovative speech recognition application tailored specifically for the legal profession, featuring a language model built from an impressive collection of over 400 million words sourced from legal documents. This cutting-edge software empowers attorneys and legal professionals to dictate a variety of documents, including contracts, briefs, and citations, achieving remarkable accuracy rates of up to 99% and operating at a speed three times faster than traditional typing. Additionally, users have the capability to create custom voice commands to simplify repetitive tasks and can transcribe previously recorded audio, which significantly enhances overall productivity. The latest version, Dragon Legal v16, is optimized for Windows 11 and maintains compatibility with Windows 10, offering accessibility features such as playback of dictated content and advanced macro commands for users with physical or cognitive difficulties. Moreover, it integrates effortlessly with Dragon Anywhere Mobile, a cloud-based dictation solution available on both iOS and Android platforms, ensuring that legal professionals can stay productive even when they are away from their desks. The array of features provided by Dragon Legal makes it an essential tool for optimizing workflow in the demanding legal environment. Ultimately, this software not only streamlines the drafting process but also supports the unique needs of legal practitioners, allowing them to focus on their core responsibilities more effectively.

List of the Top Speech Recognition Software in 2026 - Page 2

Reviews and comparisons of the top Speech Recognition software currently available

SoapBox

INVOX Medical

e-Speaking

Alibaba Cloud Intelligent Speech Interaction

FirstLanguage

Picovoice

Work by Speech

SpeechPulse

Yandex SpeechKit

Gladia

MAI-Transcribe-1

Gemini Audio

Go Transcribe

Calldrip

BigHand Dictation and Speech Recognition

LumenVox Automatic Speech Recognition (ASR)

Phonexia Speech Platform

TranscribeMe

WebsiteVoice

Symbl

Azure Speaker Recognition

Voice Pro

Deepgram

Azure AI Speech

Dragon Legal

List of the Top Speech Recognition Software in 2026 - Page 2

Reviews and comparisons of the top Speech Recognition software currently available

SoapBox

INVOX Medical

e-Speaking

Alibaba Cloud Intelligent Speech Interaction

FirstLanguage

Picovoice

Work by Speech

SpeechPulse

Yandex SpeechKit

Gladia

MAI-Transcribe-1

Gemini Audio

Go Transcribe

Calldrip

BigHand Dictation and Speech Recognition

LumenVox Automatic Speech Recognition (ASR)

Phonexia Speech Platform

TranscribeMe

WebsiteVoice

Symbl

Azure Speaker Recognition

Voice Pro

Deepgram

Azure AI Speech

Dragon Legal

Categories Related to Speech Recognition Software