Tag: speech recognition

Blog
>
Tag: speech-recognition

What is Automatic Speech Recognition?

How many times a day do you talk to a computer? We’re not referring to the exasperated exclamation you direct at your laptop when it overheats and crashes. We want you to think about the moments you speak to a device and it actually listens.

ASR speech recognition speech-to-text

Enhanced speech recognition model is now available

62% Word Error Rate (WER) improvement for US English

ASR ivr speech recognition

High quality Speech Recognition is now available

We are happy to announce the high quality speech recognition for both audio call records transcription and real-time recognition scenarios.

Grok Voice Agent API now available in Voximplant

Voximplant now includes a native Grok module that connects any Voximplant call to xAI’s Grok Voice Agent API for real-time, speech-to-speech conversations. With a single VoxEngine scenario, you can interact via audio with Grok over phone numbers, SIP trunks and infrastructure, WhatsApp Business, or WebRTC into Grok — all without building custom media gateways or WebSocket streaming infrastructure.

Deepgram Voice Agent now available in Voximplant

Voximplant now includes a native Deepgram module that connects any Voximplant call to Deepgram’s Voice Agent API for real-time, speech‑to‑speech conversations. You can stream audio from phone numbers, SIP trunks, WhatsApp, or WebRTC into Deepgram’s unified agent environment—combining STT, LLM reasoning, and TTS—and play responses via Voximplant’s serverless runtime with minimal latency.

events

LEAP 2025: AI, Startups, & the Future of Tech – Voximplant Is There

Discover the future of tech at LEAP 2025 in Riyadh! Join Voximplant as we dive into the latest AI innovations, startup ecosystems, and groundbreaking technologies shaping tomorrow. Don’t miss this chance to network, learn, and transform your business.

TTS streaming gemini elevenlabs voice agent

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

New integrations for Voice AI have arrived: Google's Gemini 2.0 Flash model, featuring seamless voice-to-voice conversation capabilities and ElevenLabs low-latency streaming speech synthesis are now available for Voximplant developers

Voximplant Kit updates. April 2025

Check out the latest useful Voximplant Kit updates — we developed chat analytics, improved call history, added new tools for supervisors, expanded scenario capabilities, and updated the softphone. Below is a brief overview of the essential enhancements.

Ultravox adds SIP to its Voice AI Services using Voximplant

Today Ultravox announced they are directly integrating Voximplant into their platform to provide SIP capabilities. The integration builds on Voximplant’s deep telephony and Voice AI tooling

TTS ASR Integration voice ai

OpenAI Client update: gpt-realtime GA alignment

OpenAI has recently announced GA version of their Realtime API that Voximplant now fully supports

Cartesia Realtime TTS now available in Voximplant

Voximplant now includes a native Cartesia module for streaming, low-latency text-to-speech (TTS). You can use a single VoxEngine API to synthesize speech in real time, connect it to any call (PSTN, SIP, WebRTC, WhatsApp) and control playback from a Large Language Model (LLM) or other source, all inside VoxEngine.

voximplant kit podcast voximplant-kit-cc-news product management voximplant-kit-automation-news web sdk webrtc video kit-updates call center ios sdk sip voximplant pstn api

Tag: speech recognition

What is Automatic Speech Recognition?

Enhanced speech recognition model is now available

High quality Speech Recognition is now available

Sign Up for a free Voximplant developer account or talk to our experts

Grok Voice Agent API now available in Voximplant

Deepgram Voice Agent now available in Voximplant

LEAP 2025: AI, Startups, & the Future of Tech – Voximplant Is There

Introducing Gemini 2.0 Flash Live API Client and ElevenLabs Streaming TTS integration

Voximplant Kit updates. April 2025

Ultravox adds SIP to its Voice AI Services using Voximplant

OpenAI Client update: gpt-realtime GA alignment

Cartesia Realtime TTS now available in Voximplant

Sign Up for a free Voximplant developer account or talk to our experts

Tag: speech recognition

Sign Up for a free Voximplant developer account or talk to our experts

Sign Up for a free Voximplant developer account or talk to our experts

Contact Us