AsyncAPI channel · Deepgram · Deepgram Speech-to-Text Streaming Events

/v1/listen

WebSocket channel for real-time speech-to-text streaming. The client sends binary audio frames and receives JSON transcription events. Connection parameters include model, language, punctuate, diarize, smart_format, interim_results, utterance_end_ms, vad_events, and encoding options.

Provider: Deepgram AsyncAPI: v2.6.0 Spec: Deepgram Speech-to-Text Streaming Events Operations: 2 Messages: 8

Channel address

/v1/listen

Operations

publish
sendAudioData
Send audio data for real-time transcription
receiveTranscriptionEvents
Receive transcription events

Messages

AudioFrame
Binary audio data frame
Content-Type: application/octet-stream
CloseStream
Signal to close the audio stream
Content-Type: application/json
KeepAlive
Keep the connection alive
Content-Type: application/json
TranscriptResult
Real-time transcription result
Content-Type: application/json
SpeechStarted
Speech activity detected
Content-Type: application/json
UtteranceEnd
End of utterance detected
Content-Type: application/json
StreamMetadata
Stream metadata information
Content-Type: application/json
StreamError
Stream error event
Content-Type: application/json

About AsyncAPI

The AsyncAPI specification describes event-driven APIs the way OpenAPI describes request/response APIs. A channel is the named pipe — a webhook URL, a Kafka topic, a WebSocket route, an MQTT subject — that producers and consumers publish or subscribe to. Each channel carries one or more messages with structured payloads, and an operation declares whether a given party sends or receives on that channel.

Browse every event-driven channel on the APIs.io network or compare with the broader Naftiko capability, Agent Skill, and MCP server surfaces of the same providers.