Updated on 2023-11-06 GMT+08:00

What Is SIS?

Speech Interaction Service (SIS) allows you to obtain speech interaction results by calling application programming interfaces (APIs) in real time. For example, you can use Automatic Speech Recognition (ASR) to convert speech recordings to editable text, and use Text To Speech (TTS) to convert text into lifelike voices. SIS is applicable to scenarios such as voice customer service inspection, conference records, voice SMS messages, audio books, and telephone follow-ups.

Prerequisites

You must have programming capabilities and be familiar with the Java, Python, and iOS programming languages.

SIS provides APIs for you to convert speech into editable text and returns the recognition result in JSON format. You need to encode the recognition result and save it to a service system or save it in TXT or Excel format.

Using SIS for the First Time

If you are a first-time user, the following information will help you get familiar with SIS: