Updated on 2025-09-12 GMT+08:00

Overview

Welcome to Speech Interaction Service API Reference.

Speech Interaction Service (SIS) lets you get instant responses by accessing and calling APIs in real time.

The APIs provided by SIS are proprietary APIs.

Table 1 Real-Time Automatic Speech Recognition (RASR) API

API

Description

RASR

This is a WebSocket API provided by Huawei Cloud, and is used for real-time speech recognition. An audio file is transmitted by fragments. The server can return the intermediate results during the recognition process and the final recognition result after the process is complete.

Table 2 Short Sentence Recognition API

API

Description

Short Sentence Recognition

This API is used for real-time recognition of short sentences. The entire audio is uploaded at a time, and the recognition result is returned immediately.

Table 3 Real-Time TTS

API

Description

Real-Time TTS

Real-Time TTS transforms text into natural-sounding speech using cutting-edge voice technology and deep learning algorithms. You can access and call APIs in real time to generate audio from your input text. By selecting timbres and customizing volume, speed, and pitch, you can tailor the audio format, offering personalized pronunciation services for both businesses and individuals.

Table 4 Hot word management API

API

Description

Hot Word Management

If there are specific terms in your business domain with subpar default recognition performance, consider utilizing hot word management to incorporate these terms into the lexicon, thereby enhancing recognition accuracy.