AssemblyAI
Transcription API with built-in AI features: speaker labels, sentiment analysis, topic detection, PII redaction, and LeMUR (LLM over audio). Best for extracting structured insight from audio.
Deepgram
Real-time and batch speech recognition API with <300ms latency. Supports 30+ languages, speaker diarization, and custom vocabulary. Nova-3 model is best-in-class for English accuracy.