Updated on 2025-08-26 GMT+08:00

API Description

The Real-Time ASR API complies with the WebSocket protocol, and provides three modes: Streaming, Continuous, and Single-sentence.

The handshake request wss-URI of each mode differs, but the formats of requests and responses are the same.

Developers can use a WebSocket software package or library interface in the Java, Python, or C++ language to handshake with the Real-Time ASR engine, send the speech data, and obtain the recognition result. After the recognition work is done, close the WebSocket connection.

When a client uses the WebSocket protocol to access the RASR APIs, the connection can last 5 hours at most. If the WebSocket connection lasts more than 5 hours, the server automatically closes it.