- What's New
- Service Overview
- Service Brochure
- Getting Started
-
User Guide
- Prerequisites
- Permissions Management
- Image Modeling
- Voice Modeling
- Video Production
- Livestreaming
- Asset Management
- Appendix
-
API Reference
- Before You Start
- API Overview
- Calling APIs
- Asset Management
- Virtual Avatar Video Production
-
Virtual Avatar Livestreaming
-
Intelligent Livestreaming Room Management
- Creating an Intelligent Livestreaming Room
- Querying Intelligent Livestreaming Rooms
- Querying Intelligent Livestreaming Script Details
- Updating Intelligent Livestreaming Room Information
- Deleting an Intelligent Livestreaming Room
- Creating an Interaction Rule Library for Live Rooms
- Querying Interaction Rule Libraries for Live Rooms
- Updating an Interaction Rule Library for Live Rooms
- Deleting an Interaction Rule Library for Live Rooms
-
Livestreaming Task Management
- Starting a Virtual Human Intelligent Livestreaming Task
- Querying Livestreaming Tasks of a Live Room
- Querying Virtual Human Intelligent Livestreaming Task Details
- Ending a Virtual Human Intelligent Livestreaming Task
- Controlling Virtual Human Intelligent Livestreaming
- Querying All Virtual Human Livestreaming Tasks of a Tenant
- Reporting Livestreaming Events
- Live Product Management
-
Intelligent Livestreaming Room Management
-
Image Modeling Management
- Creating a Virtual Avatar Model Training Task
- Querying Virtual Avatar Model Training Tasks
- Querying Details About a Virtual Avatar Model Training Task
- Deleting a Virtual Avatar Model Training Task
- Updating a Virtual Avatar Model Training Task
- Executing a Virtual Avatar Model Training Task as a Tenant
-
Voice Modeling Task Management
- Creating a Voice Training Task (Basic Edition)
- Creating a Voice Training Task (Advanced Edition)
- Creating a Voice Training Task (Premium Edition)
- Querying Voice Training Tasks
- Submitting a Voice Training Task
- Querying Voice Training Task Details
- Deleting a Voice Training Task
- Querying Task Operation Logs
- Obtaining the URL for Uploading a Voice File
- Obtaining the Review Result of a Voice Training Task
- Confirming the Online Recording Result
- Obtaining the Confirmed Online Recording Result
- TTS Management
- Appendix
- Change History
- ssdk
- FAQs
- Videos
- General Reference
Show all
Copied.
Creating a Voice Modeling Task
You can view the preset voices of MetaStudio on the Video Production or Livestreams page. If the preset voices cannot meet your requirements, you can use models to customize voices.
Constraints
Only enterprise users can customize voices on MetaStudio.
Preparations
Before creating a voice modeling task, you need to prepare the following items by referring to Procedure:
- If you select Script Upload, record an audio in advance by referring to the recording guide on the voice modeling page.
Procedure
- Log in to the MetaStudio console.
- Click Create under Voice modeling.
On the page displayed, the area on the left is for voice modeling, and the area on the right shows the voice modeling process.Figure 1 Customizing a voice
- Under the Huawei Models tab, configure voice modeling parameters.
For details, see Table 1.
Table 1 GUI operations Parameter
Description
Voice modeling (advanced edition)
Only 100 pieces of script are required for voice modeling (advanced edition). You need to record a WAV audio of 10 to 30 minutes (recommended: 15 minutes).
The remaining voice modeling quota will be displayed.
Voice Settings
Enter a voice name.
Example: emotion_joyful_healing
Voice Gender
Gender of the voice. Options:
- Male
- Female
Input Language
Language of the voice. Options:
- Chinese
- English
Voice Tag
Tag of the voice. Options:
- News
- Marketing
Script of each of the preceding tags is preset in MetaStudio, as shown in Script Examples (Advanced Edition). When using the preset script, you must select the corresponding tag.
Produce Voice
The method of voice modeling is Script Upload. You can follow the recording guide provided on the GUI to record 100 pieces of script in a WAV file, which can be directly uploaded without being compressed or containing TXT files.
If the preset script is not used, the voice tag is only used to indicate the application scenario.
- Click Submit.
The Information dialog box is displayed, notifying you of the remaining voice modeling quota and indicating that one resource will be consumed this time.
- After confirming the information, click Submit.
After the voice modeling task is submitted, the message Production task submitted is displayed, as shown in Figure 2.
After the voice modeling task is submitted, the task review will take about one day. After the task is approved, you can start voice modeling. The task takes about 1 to 3 working days.
- You can click View Production Tasks to view the review progress of the voice modeling task.
When the status changes to Reviewed, algorithm training is automatically started. If there are multiple algorithm training tasks, queuing and delay may occur.
Feedback
Was this page helpful?
Provide feedbackThank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.
For any further questions, feel free to contact us through the chatbot.
Chatbot