加载自定义词库
功能介绍
该接口用于加载存放于OBS的自定义词库。
调用方法
请参见如何调用API。
URI
POST /v1.0/{project_id}/clusters/{cluster_id}/thesaurus
参数 |
是否必选 |
参数类型 |
描述 |
---|---|---|---|
project_id |
是 |
String |
项目ID。获取方法请参见获取项目ID和名称。 |
cluster_id |
是 |
String |
指定配置自定义词库的集群ID。 |
请求参数
参数 |
是否必选 |
参数类型 |
描述 |
---|---|---|---|
bucketName |
是 |
String |
词库文件存放的OBS桶(桶类型必须为标准存储或者低频存储,不支持归档存储)。 |
mainObject |
否 |
String |
主词词库文件对象,必须为UTF-8无BOM编码的文本文件,一行一个分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。 |
stopObject |
否 |
String |
停词词库文件对象,必须为UTF-8无BOM编码的文本文件,一行一个分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。 |
synonymObject |
否 |
String |
同义词词库文件,必须为UTF-8无BOM编码的文本文件,一行一组分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。 |
static_main_object |
否 |
String |
静态主词词库文件,必须为UTF-8无BOM编码的文本文件,一行一组分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。仅支持此词库功能上线后的新集群。 |
static_stop_object |
否 |
String |
静态停词词库文件,必须为UTF-8无BOM编码的文本文件,一行一组分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。仅支持此词库功能上线后的新集群。 |
extra_main_object |
否 |
String |
Extra主词词库文件,必须为UTF-8无BOM编码的文本文件,一行一组分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。仅支持此词库功能上线后的新集群。 |
extra_stop_object |
否 |
String |
Extra停词词库文件,必须为UTF-8无BOM编码的文本文件,一行一组分词,文件大小最大支持100M。 7个词库参数至少修改一个词库。注:参数传递""空字符串为清空此词库,不传或传递null为不修改。仅支持此词库功能上线后的新集群。 |
响应参数
无
请求示例
开启并配置词库信息。
POST /v1.0/6204a5bd270343b5885144cf9c8c158d/clusters/4f3deec3-efa8-4598-bf91-560aad1377a3/thesaurus { "bucketName" : "test-bucket", "mainObject" : "word/main.txt", "stopObject" : "word/stop.txt", "synonymObject" : "word/synonym.txt", "static_main_object" : "word/staticMain.txt", "static_stop_object" : "word/staticStop.txt", "extra_main_object" : "word/extraMain.txt", "extra_stop_object" : "word/extraStop.txt" }
响应示例
无
SDK代码示例
SDK代码示例如下。
开启并配置词库信息。
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 48 49 50 51 52 53 54 |
package com.huaweicloud.sdk.test; import com.huaweicloud.sdk.core.auth.ICredential; import com.huaweicloud.sdk.core.auth.BasicCredentials; import com.huaweicloud.sdk.core.exception.ConnectionException; import com.huaweicloud.sdk.core.exception.RequestTimeoutException; import com.huaweicloud.sdk.core.exception.ServiceResponseException; import com.huaweicloud.sdk.css.v1.region.CssRegion; import com.huaweicloud.sdk.css.v1.*; import com.huaweicloud.sdk.css.v1.model.*; public class CreateLoadIkThesaurusSolution { public static void main(String[] args) { // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment String ak = System.getenv("CLOUD_SDK_AK"); String sk = System.getenv("CLOUD_SDK_SK"); String projectId = "{project_id}"; ICredential auth = new BasicCredentials() .withProjectId(projectId) .withAk(ak) .withSk(sk); CssClient client = CssClient.newBuilder() .withCredential(auth) .withRegion(CssRegion.valueOf("<YOUR REGION>")) .build(); CreateLoadIkThesaurusRequest request = new CreateLoadIkThesaurusRequest(); request.withClusterId("{cluster_id}"); LoadCustomThesaurusReq body = new LoadCustomThesaurusReq(); body.withSynonymObject("word/synonym.txt"); body.withStopObject("word/stop.txt"); body.withMainObject("word/main.txt"); body.withBucketName("test-bucket"); request.withBody(body); try { CreateLoadIkThesaurusResponse response = client.createLoadIkThesaurus(request); System.out.println(response.toString()); } catch (ConnectionException e) { e.printStackTrace(); } catch (RequestTimeoutException e) { e.printStackTrace(); } catch (ServiceResponseException e) { e.printStackTrace(); System.out.println(e.getHttpStatusCode()); System.out.println(e.getRequestId()); System.out.println(e.getErrorCode()); System.out.println(e.getErrorMsg()); } } } |
开启并配置词库信息。
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 |
# coding: utf-8 import os from huaweicloudsdkcore.auth.credentials import BasicCredentials from huaweicloudsdkcss.v1.region.css_region import CssRegion from huaweicloudsdkcore.exceptions import exceptions from huaweicloudsdkcss.v1 import * if __name__ == "__main__": # The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. # In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment ak = os.environ["CLOUD_SDK_AK"] sk = os.environ["CLOUD_SDK_SK"] projectId = "{project_id}" credentials = BasicCredentials(ak, sk, projectId) client = CssClient.new_builder() \ .with_credentials(credentials) \ .with_region(CssRegion.value_of("<YOUR REGION>")) \ .build() try: request = CreateLoadIkThesaurusRequest() request.cluster_id = "{cluster_id}" request.body = LoadCustomThesaurusReq( synonym_object="word/synonym.txt", stop_object="word/stop.txt", main_object="word/main.txt", bucket_name="test-bucket" ) response = client.create_load_ik_thesaurus(request) print(response) except exceptions.ClientRequestException as e: print(e.status_code) print(e.request_id) print(e.error_code) print(e.error_msg) |
开启并配置词库信息。
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 43 44 45 46 47 |
package main import ( "fmt" "github.com/huaweicloud/huaweicloud-sdk-go-v3/core/auth/basic" css "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/css/v1" "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/css/v1/model" region "github.com/huaweicloud/huaweicloud-sdk-go-v3/services/css/v1/region" ) func main() { // The AK and SK used for authentication are hard-coded or stored in plaintext, which has great security risks. It is recommended that the AK and SK be stored in ciphertext in configuration files or environment variables and decrypted during use to ensure security. // In this example, AK and SK are stored in environment variables for authentication. Before running this example, set environment variables CLOUD_SDK_AK and CLOUD_SDK_SK in the local environment ak := os.Getenv("CLOUD_SDK_AK") sk := os.Getenv("CLOUD_SDK_SK") projectId := "{project_id}" auth := basic.NewCredentialsBuilder(). WithAk(ak). WithSk(sk). WithProjectId(projectId). Build() client := css.NewCssClient( css.CssClientBuilder(). WithRegion(region.ValueOf("<YOUR REGION>")). WithCredential(auth). Build()) request := &model.CreateLoadIkThesaurusRequest{} request.ClusterId = "{cluster_id}" synonymObjectLoadCustomThesaurusReq:= "word/synonym.txt" stopObjectLoadCustomThesaurusReq:= "word/stop.txt" mainObjectLoadCustomThesaurusReq:= "word/main.txt" request.Body = &model.LoadCustomThesaurusReq{ SynonymObject: &synonymObjectLoadCustomThesaurusReq, StopObject: &stopObjectLoadCustomThesaurusReq, MainObject: &mainObjectLoadCustomThesaurusReq, BucketName: "test-bucket", } response, err := client.CreateLoadIkThesaurus(request) if err == nil { fmt.Printf("%+v\n", response) } else { fmt.Println(err) } } |
更多编程语言的SDK代码示例,请参见API Explorer的代码示例页签,可生成自动对应的SDK代码示例。
状态码
状态码 |
描述 |
---|---|
200 |
请求已成功。 |
403 |
请求被拒绝访问。返回该状态码,表明请求能够到达服务端,且服务端能够理解用户请求,但是拒绝做更多的事情,因为该请求被设置为拒绝访问,建议直接修改该请求,不要重试该请求。 |
500 |
表明服务端能被请求访问到,但是不能理解用户的请求。 |
错误码
请参见错误码。