Voice synthesis, cloning, transcription, and AI-powered communication tools.
ElevenLabs is an AI voice synthesis platform that creates highly realistic speech from text using advanced neural networks. It offers voice cloning, multilingual support, and real-time generation for various audio content needs.
AssemblyAI is a speech-to-text and audio intelligence API platform offering high-accuracy transcription alongside advanced NLP features like sentiment analysis, topic detection, and LLM-powered audio understanding. It targets developers and enterprises building voice-enabled or audio-processing applications.
AI-powered cloud call center with smart routing, real-time transcription, and 35+ CRM integrations. Trusted by 2,500+ companies for inbound and outbound calls.
Wispr Flow is an AI-powered voice dictation tool for macOS that enables system-wide speech-to-text across any application, using on-device and cloud AI to transcribe, clean, and format spoken input in real time. It differentiates itself by removing filler words, adapting to user vocabulary, and integrating directly into existing workflows without requiring a dedicated interface.
AI-powered cloud phone system with noise cancellation, call transcription, and smart IVR. Built for remote sales and support teams.
Speechify is a leading text-to-speech platform that converts documents, PDFs, web pages, and ebooks into high-quality AI-narrated audio. It offers 200+ natural voices across 30+ languages with speed controls up to 4.5x, available via iOS, Android, Chrome extension, and desktop apps.
Descript is an all-in-one audio and video editor that transcribes recordings and allows editing through the transcript itself. It features AI-powered tools including Overdub voice cloning, Studio Sound noise removal, and automatic filler-word detection.
Play.ht is an AI-powered text-to-speech platform offering 900+ voices across 140+ languages, real-time voice cloning, and a developer-friendly API. It serves content creators and businesses needing high-quality synthetic audio at scale.
Murf AI is a cloud-based AI voice generator offering 120+ lifelike voices across 20+ languages, built-in voice customization, and a media editor for syncing audio to video or images. It targets professional content production use cases from e-learning to marketing narration.
Resemble AI is a voice synthesis and cloning platform that enables users to create custom AI voices, clone existing voices, and generate speech programmatically via API. It offers features including real-time voice generation, emotion control, neural audio editing, and enterprise-grade integration capabilities.
LOVO AI is a text-to-speech and AI voice generation platform offering 500+ voices across 100+ languages, along with Genny, its integrated AI video and script editor. It targets professionals needing broadcast-quality voiceovers without recording infrastructure.
CallHippo is a cloud-based business phone system that offers VoIP calling, call recording, and basic voice analytics. It focuses primarily on traditional telephony features with some AI-powered call insights and analytics capabilities.