创建基础版语音训练任务
功能介绍
用户创建语音训练基础版任务,该接口会返回一个obs上传地址,用于上传语音文件。
支持2种方式上传语音文件:
-
语音文件和文本文件打包成zip上传:语音文件已经切分成20个wav文件,每个语音文件对应一个txt文本文件,所有文件打包成zip文件。语音文件命名规则:0.wav~19.wav;文本文件命名规则:0.txt~19.txt。
-
语音文件和文本文件逐句上传:每次上传一句语料的语音文件和文本文件,再调用“确认在线录音结果”接口确认语音和文本内容是否一致。确认成功后再上传和确认下一句。
文件上传后,调用“提交语音训练任务”接口,启动审核和训练。
调用方法
请参见如何调用API。
URI
POST /v1/{project_id}/voice-training-manage/user/basic-jobs
参数 |
是否必选 |
参数类型 |
描述 |
---|---|---|---|
project_id |
是 |
String |
项目ID,获取方法请参考获取项目ID。 |
请求参数
参数 |
是否必选 |
参数类型 |
描述 |
---|---|---|---|
X-Auth-Token |
否 |
String |
用户Token。使用Token鉴权方式时必选。 通过调用IAM服务获取用户Token接口获取。 响应消息头中X-Subject-Token的值。 |
Authorization |
否 |
String |
使用AK/SK方式认证时必选,携带的鉴权信息。 |
X-Sdk-Date |
否 |
String |
使用AK/SK方式认证时必选,请求的发生时间。 格式为(YYYYMMDD'T'HHMMSS'Z')。 |
X-Project-Id |
否 |
String |
使用AK/SK方式认证时必选,携带项目ID信息。 |
X-App-UserId |
否 |
String |
第三方用户ID。不允许输入中文。 |
参数 |
是否必选 |
参数类型 |
描述 |
---|---|---|---|
tag |
否 |
String |
任务标签。
|
description |
否 |
String |
一段描述信息,会呈现在资产库中。 |
sex |
否 |
String |
语音性别,是男性声音还是女性声音。
默认取值: FEMALE |
voice_name |
是 |
String |
音色名称。该名称会作为资产库中音色模型资产名称。 |
language |
否 |
String |
训练语言,当前仅支持中文。
默认取值: CN |
create_type |
否 |
String |
任务创建方式。
|
phone |
否 |
String |
手机号 |
dhtms_job_id |
否 |
String |
形象制作任务id |
batch_name |
否 |
String |
批次名称 |
output_language |
否 |
String |
模型输出语言类型 |
custom_text |
否 |
String |
自定义试听文本 |
响应参数
状态码: 200
参数 |
参数类型 |
描述 |
---|---|---|
job_id |
String |
任务id。 |
training_data_uploading_url |
String |
上传训练数据的地址。训练数据需打包成zip文件后,上传至该url。 create_type取值为package时设置。
说明:
通过该obs地址上传时,需设置content-type为application/zip。 |
segment_uploading_url |
segment_uploading_url object |
分句上传任务的上传地址,create_type为segment时设置。 |
authorization_letter_uploading_url |
String |
授权书的上传地址。 |
参数 |
参数类型 |
描述 |
---|---|---|
audio_uploading_url |
Array of strings |
音频上传的地址。 通过该obs地址上传时,需设置content-type为audio/wav |
txt_uploading_url |
Array of strings |
文本上传的地址。 通过该obs地址上传时需设置content-type为text/plain |
状态码: 400
参数 |
参数类型 |
描述 |
---|---|---|
error_code |
String |
错误码。 |
error_msg |
String |
错误描述。 |
请求示例
POST https://{endpoint}/v1/3f0924078d1b471c884a5383d4dec9fa/voice-training-manage/user/basic-jobs { "tag" : "ECOMMERCE", "description" : "这是一段女声", "sex" : "FEMALE", "voice_name" : "温柔女声", "language" : "CN", "create_type" : "PACKAGE" }
响应示例
状态码: 200
处理成功返回。
{ "job_id" : "26f06524-4f75-4b3a-a853-b649a21aaf66", "training_data_uploading_url" : "https://my-bucket/data.zip", "segment_uploading_url" : { "audio_uploading_url" : [ "https://my-bucket/data0.wav" ], "txt_uploading_url" : [ "https://my-bucket/data0.txt" ] }, "authorization_letter_uploading_url" : "https://my-bucket/data" }
SDK代码示例
SDK代码示例如下。
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 55 |
package com.huaweicloud.sdk.test; import com.huaweicloud.sdk.core.auth.ICredential; import com.huaweicloud.sdk.core.auth.BasicCredentials; import com.huaweicloud.sdk.core.exception.ConnectionException; import com.huaweicloud.sdk.core.exception.RequestTimeoutException; import com.huaweicloud.sdk.core.exception.ServiceResponseException; import com.huaweicloud.sdk.metastudio.v1.region.MetaStudioRegion; import com.huaweicloud.sdk.metastudio.v1.*; import com.huaweicloud.sdk.metastudio.v1.model.*; public class CreateTrainingBasicJobSolution { public static void main(String[] args) { // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment String ak = System.getenv("CLOUD_SDK_AK"); String sk = System.getenv("CLOUD_SDK_SK"); String projectId = "{project_id}"; ICredential auth = new BasicCredentials() .withProjectId(projectId) .withAk(ak) .withSk(sk); MetaStudioClient client = MetaStudioClient.newBuilder() .withCredential(auth) .withRegion(MetaStudioRegion.valueOf("<YOUR REGION>")) .build(); CreateTrainingBasicJobRequest request = new CreateTrainingBasicJobRequest(); CreateTrainingJobReq body = new CreateTrainingJobReq(); body.withCreateType(CreateTrainingJobReq.CreateTypeEnum.fromValue("PACKAGE")); body.withLanguage("CN"); body.withVoiceName("温柔女声"); body.withSex(CreateTrainingJobReq.SexEnum.fromValue("FEMALE")); body.withDescription("这是一段女声"); body.withTag(CreateTrainingJobReq.TagEnum.fromValue("ECOMMERCE")); request.withBody(body); try { CreateTrainingBasicJobResponse response = client.createTrainingBasicJob(request); System.out.println(response.toString()); } catch (ConnectionException e) { e.printStackTrace(); } catch (RequestTimeoutException e) { e.printStackTrace(); } catch (ServiceResponseException e) { e.printStackTrace(); System.out.println(e.getHttpStatusCode()); System.out.println(e.getRequestId()); System.out.println(e.getErrorCode()); System.out.println(e.getErrorMsg()); } } } |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 |
# coding: utf-8 import os from huaweicloudsdkcore.auth.credentials import BasicCredentials from huaweicloudsdkmetastudio.v1.region.metastudio_region import MetaStudioRegion from huaweicloudsdkcore.exceptions import exceptions from huaweicloudsdkmetastudio.v1 import * if __name__ == "__main__": # The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. # In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment ak = os.environ["CLOUD_SDK_AK"] sk = os.environ["CLOUD_SDK_SK"] projectId = "{project_id}" credentials = BasicCredentials(ak, sk, projectId) client = MetaStudioClient.new_builder() \ .with_credentials(credentials) \ .with_region(MetaStudioRegion.value_of("<YOUR REGION>")) \ .build() try: request = CreateTrainingBasicJobRequest() request.body = CreateTrainingJobReq( create_type="PACKAGE", language="CN", voice_name="温柔女声", sex="FEMALE", description="这是一段女声", tag="ECOMMERCE" ) response = client.create_training_basic_job(request) print(response) except exceptions.ClientRequestException as e: print(e.status_code) print(e.request_id) print(e.error_code) print(e.error_msg) |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 |
package main import ( "fmt" "github.com/huaweicloud/huaweicloud-sdk-go-v3/core/auth/basic" metastudio "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1" "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1/model" region "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1/region" ) func main() { // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment ak := os.Getenv("CLOUD_SDK_AK") sk := os.Getenv("CLOUD_SDK_SK") projectId := "{project_id}" auth := basic.NewCredentialsBuilder(). WithAk(ak). WithSk(sk). WithProjectId(projectId). Build() client := metastudio.NewMetaStudioClient( metastudio.MetaStudioClientBuilder(). WithRegion(region.ValueOf("<YOUR REGION>")). WithCredential(auth). Build()) request := &model.CreateTrainingBasicJobRequest{} createTypeCreateType:= model.GetCreateTypeCreateTypeEnum().PACKAGE languageCreateTrainingJobReq:= "CN" sexCreateTrainingJobReq:= model.GetCreateTrainingJobReqSexEnum().FEMALE descriptionCreateTrainingJobReq:= "这是一段女声" tagTag:= model.GetJobTagTagEnum().ECOMMERCE request.Body = &model.CreateTrainingJobReq{ CreateType: &createTypeCreateType, Language: &languageCreateTrainingJobReq, VoiceName: "温柔女声", Sex: &sexCreateTrainingJobReq, Description: &descriptionCreateTrainingJobReq, Tag: &tagTag, } response, err := client.CreateTrainingBasicJob(request) if err == nil { fmt.Printf("%+v\n", response) } else { fmt.Println(err) } } |
更多编程语言的SDK代码示例,请参见API Explorer的代码示例页签,可生成自动对应的SDK代码示例。
状态码
状态码 |
描述 |
---|---|
200 |
处理成功返回。 |
400 |
参数异常。 |
错误码
请参见错误码。