What Is SIS?
Speech Interaction Service (SIS) allows you to obtain speech interaction results by calling application programming interfaces (APIs) in real time. For example, you can use Automatic Speech Recognition (ASR) to convert speech recordings to editable text, and use Text To Speech (TTS) to convert text into lifelike voices. SIS is applicable to scenarios such as voice customer service inspection, conference records, voice SMS messages, audio books, and telephone follow-ups.
Prerequisites
You must have programming capabilities and be familiar with the Java, Python, and iOS programming languages.
SIS provides APIs for you to convert speech into editable text and returns the recognition result in JSON format. You need to encode the recognition result and save it to a service system or save it in TXT or Excel format.
Using SIS for the First Time
If you are a first-time user, the following information will help you get familiar with SIS:
- Functions
Functions describes SIS functions, including Real-time ASR, Short Sentence Recognition, TTS.
- Getting Started
SIS provides services through open APIs. You can learn how to use SIS by referring to the Speech Interaction Service Getting Started.
- Using SIS
If you are a development engineer familiar with code compilation and want to directly call SIS APIs, see the Speech Interaction Service API Reference or Speech Interaction Service SDK Reference.
- From Beginners to Experts
You can learn how to use SIS by referring to Progressive Knowledge.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot