更新时间:2024-11-07 GMT+08:00
分享

创建基础版语音训练任务

功能介绍

用户创建语音训练基础版任务,该接口会返回一个obs上传地址,用于上传语音文件。

支持2种方式上传语音文件:

  • 语音文件和文本文件打包成zip上传:语音文件已经切分成20个wav文件,每个语音文件对应一个txt文本文件,所有文件打包成zip文件。语音文件命名规则:0.wav~19.wav;文本文件命名规则:0.txt~19.txt。

  • 语音文件和文本文件逐句上传:每次上传一句语料的语音文件和文本文件,再调用“确认在线录音结果”接口确认语音和文本内容是否一致。确认成功后再上传和确认下一句。

文件上传后,调用“提交语音训练任务”接口,启动审核和训练。

调用方法

请参见如何调用API

URI

POST /v1/{project_id}/voice-training-manage/user/basic-jobs

表1 路径参数

参数

是否必选

参数类型

描述

project_id

String

项目ID,获取方法请参考获取项目ID

请求参数

表2 请求Header参数

参数

是否必选

参数类型

描述

X-Auth-Token

String

用户Token。使用Token鉴权方式时必选。

通过调用IAM服务获取用户Token接口获取。

响应消息头中X-Subject-Token的值。

Authorization

String

使用AK/SK方式认证时必选,携带的鉴权信息。

X-Sdk-Date

String

使用AK/SK方式认证时必选,请求的发生时间。

格式为(YYYYMMDD'T'HHMMSS'Z')。

X-Project-Id

String

使用AK/SK方式认证时必选,携带项目ID信息。

X-App-UserId

String

第三方用户ID。不允许输入中文。

表3 请求Body参数

参数

是否必选

参数类型

描述

tag

String

任务标签。

  • ECOMMERCE: 电商

  • NEWS: 新闻

  • MARKETING: 营销

description

String

一段描述信息,会呈现在资产库中。

sex

String

语音性别,是男性声音还是女性声音。

  • FEMALE: 女性

  • MALE: 男性

默认取值:

FEMALE

voice_name

String

音色名称。该名称会作为资产库中音色模型资产名称。

language

String

训练语言,当前仅支持中文。

  • CN: 中文

  • EN: 英文

默认取值:

CN

create_type

String

任务创建方式。

  • PACKAGE: 使用一个zip包包含所有数据

  • SEGMENT: 逐句上传数据

phone

String

手机号

dhtms_job_id

String

形象制作任务id

batch_name

String

批次名称

响应参数

状态码: 200

表4 响应Body参数

参数

参数类型

描述

job_id

String

任务id。

training_data_uploading_url

String

上传训练数据的地址。训练数据需打包成zip文件后,上传至该url。

create_type取值为package时设置。

说明:

通过该obs地址上传时,需设置content-type为application/zip。

segment_uploading_url

segment_uploading_url object

分句上传任务的上传地址,create_type为segment时设置。

authorization_letter_uploading_url

String

授权书的上传地址。

表5 segment_uploading_url

参数

参数类型

描述

audio_uploading_url

Array of strings

音频上传的地址。

通过该obs地址上传时,需设置content-type为audio/wav

txt_uploading_url

Array of strings

文本上传的地址。

通过该obs地址上传时需设置content-type为text/plain

状态码: 400

表6 响应Body参数

参数

参数类型

描述

error_code

String

错误码。

error_msg

String

错误描述。

请求示例

POST https://{endpoint}/v1/3f0924078d1b471c884a5383d4dec9fa/voice-training-manage/user/basic-jobs

{
  "tag" : "ECOMMERCE",
  "description" : "这是一段女声",
  "sex" : "FEMALE",
  "voice_name" : "温柔女声",
  "language" : "CN",
  "create_type" : "PACKAGE"
}

响应示例

状态码: 200

处理成功返回。

{
  "job_id" : "26f06524-4f75-4b3a-a853-b649a21aaf66",
  "training_data_uploading_url" : "https://my-bucket/data.zip",
  "segment_uploading_url" : {
    "audio_uploading_url" : [ "https://my-bucket/data0.wav" ],
    "txt_uploading_url" : [ "https://my-bucket/data0.txt" ]
  },
  "authorization_letter_uploading_url" : "https://my-bucket/data"
}

SDK代码示例

SDK代码示例如下。

 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
package com.huaweicloud.sdk.test;

import com.huaweicloud.sdk.core.auth.ICredential;
import com.huaweicloud.sdk.core.auth.BasicCredentials;
import com.huaweicloud.sdk.core.exception.ConnectionException;
import com.huaweicloud.sdk.core.exception.RequestTimeoutException;
import com.huaweicloud.sdk.core.exception.ServiceResponseException;
import com.huaweicloud.sdk.metastudio.v1.region.MetaStudioRegion;
import com.huaweicloud.sdk.metastudio.v1.*;
import com.huaweicloud.sdk.metastudio.v1.model.*;


public class CreateTrainingBasicJobSolution {

    public static void main(String[] args) {
        // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security.
        // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment
        String ak = System.getenv("CLOUD_SDK_AK");
        String sk = System.getenv("CLOUD_SDK_SK");
        String projectId = "{project_id}";

        ICredential auth = new BasicCredentials()
                .withProjectId(projectId)
                .withAk(ak)
                .withSk(sk);

        MetaStudioClient client = MetaStudioClient.newBuilder()
                .withCredential(auth)
                .withRegion(MetaStudioRegion.valueOf("<YOUR REGION>"))
                .build();
        CreateTrainingBasicJobRequest request = new CreateTrainingBasicJobRequest();
        CreateTrainingJobReq body = new CreateTrainingJobReq();
        body.withCreateType(CreateTrainingJobReq.CreateTypeEnum.fromValue("PACKAGE"));
        body.withLanguage("CN");
        body.withVoiceName("温柔女声");
        body.withSex(CreateTrainingJobReq.SexEnum.fromValue("FEMALE"));
        body.withDescription("这是一段女声");
        body.withTag(CreateTrainingJobReq.TagEnum.fromValue("ECOMMERCE"));
        request.withBody(body);
        try {
            CreateTrainingBasicJobResponse response = client.createTrainingBasicJob(request);
            System.out.println(response.toString());
        } catch (ConnectionException e) {
            e.printStackTrace();
        } catch (RequestTimeoutException e) {
            e.printStackTrace();
        } catch (ServiceResponseException e) {
            e.printStackTrace();
            System.out.println(e.getHttpStatusCode());
            System.out.println(e.getRequestId());
            System.out.println(e.getErrorCode());
            System.out.println(e.getErrorMsg());
        }
    }
}
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
# coding: utf-8

import os
from huaweicloudsdkcore.auth.credentials import BasicCredentials
from huaweicloudsdkmetastudio.v1.region.metastudio_region import MetaStudioRegion
from huaweicloudsdkcore.exceptions import exceptions
from huaweicloudsdkmetastudio.v1 import *

if __name__ == "__main__":
    # The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security.
    # In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment
    ak = os.environ["CLOUD_SDK_AK"]
    sk = os.environ["CLOUD_SDK_SK"]
    projectId = "{project_id}"

    credentials = BasicCredentials(ak, sk, projectId)

    client = MetaStudioClient.new_builder() \
        .with_credentials(credentials) \
        .with_region(MetaStudioRegion.value_of("<YOUR REGION>")) \
        .build()

    try:
        request = CreateTrainingBasicJobRequest()
        request.body = CreateTrainingJobReq(
            create_type="PACKAGE",
            language="CN",
            voice_name="温柔女声",
            sex="FEMALE",
            description="这是一段女声",
            tag="ECOMMERCE"
        )
        response = client.create_training_basic_job(request)
        print(response)
    except exceptions.ClientRequestException as e:
        print(e.status_code)
        print(e.request_id)
        print(e.error_code)
        print(e.error_msg)
 1
 2
 3
 4
 5
 6
 7
 8
 9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
package main

import (
	"fmt"
	"github.com/huaweicloud/huaweicloud-sdk-go-v3/core/auth/basic"
    metastudio "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1"
	"github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1/model"
    region "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/metastudio/v1/region"
)

func main() {
    // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security.
    // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment
    ak := os.Getenv("CLOUD_SDK_AK")
    sk := os.Getenv("CLOUD_SDK_SK")
    projectId := "{project_id}"

    auth := basic.NewCredentialsBuilder().
        WithAk(ak).
        WithSk(sk).
        WithProjectId(projectId).
        Build()

    client := metastudio.NewMetaStudioClient(
        metastudio.MetaStudioClientBuilder().
            WithRegion(region.ValueOf("<YOUR REGION>")).
            WithCredential(auth).
            Build())

    request := &model.CreateTrainingBasicJobRequest{}
	createTypeCreateType:= model.GetCreateTypeCreateTypeEnum().PACKAGE
	languageCreateTrainingJobReq:= "CN"
	sexCreateTrainingJobReq:= model.GetCreateTrainingJobReqSexEnum().FEMALE
	descriptionCreateTrainingJobReq:= "这是一段女声"
	tagTag:= model.GetJobTagTagEnum().ECOMMERCE
	request.Body = &model.CreateTrainingJobReq{
		CreateType: &createTypeCreateType,
		Language: &languageCreateTrainingJobReq,
		VoiceName: "温柔女声",
		Sex: &sexCreateTrainingJobReq,
		Description: &descriptionCreateTrainingJobReq,
		Tag: &tagTag,
	}
	response, err := client.CreateTrainingBasicJob(request)
	if err == nil {
        fmt.Printf("%+v\n", response)
    } else {
        fmt.Println(err)
    }
}

更多编程语言的SDK代码示例,请参见API Explorer的代码示例页签,可生成自动对应的SDK代码示例。

状态码

状态码

描述

200

处理成功返回。

400

参数异常。

错误码

请参见错误码

相关文档