Help Center> Cloud Search Service> User Guide> Simplified-Traditional Chinese Conversion Plugin

Simplified-Traditional Chinese Conversion Plugin

By default, a simplified-traditional Chinese conversion plugin is installed in CSS. The plugin implements conversion between simplified and traditional Chinese. With this plugin, you can search index data containing the corresponding simplified Chinese based on the traditional Chinese keyword, and vice versa.

The simplified-traditional Chinese conversion plugin can be used as the analyzer, tokenizer, token-filter, or char-filter.

The simplified-traditional Chinese conversion plugin provides the following two conversion types:

  • s2t: converts the simplified Chinese to the traditional Chinese.
  • t2s: converts the traditional Chinese to the simplified Chinese.

Examples (Version 6.5.4)

  1. Log in to the CSS management console.
  2. In the left navigation pane, click Clusters to switch to the Clusters page.
  3. In the cluster list, locate the row where the target cluster resides and click Kibana in the Operation column.

    If the target cluster has the security mode enabled, enter the username and password you set when creating the cluster.

  4. In the left navigation pane of the displayed Kibana window, click Dev Tools. Click Get to work to switch to the Console page.
  5. On the Console page, run the following command to create index stconvert and specify a user-defined mapping to define the data type:
    PUT /stconvert
    {
        "settings": {
            "number_of_shards": 1,
            "number_of_replicas": 0,
            "analysis": {
                "analyzer": {
                    "ts_ik": {
                        "tokenizer": "ik_smart",
                        "char_filter": [
                            "tsconvert",
                            "stconvert"
                        ]
                    }
                },
                "char_filter": {
                    "tsconvert": {
                        "type": "stconvert",
                        "convert_type": "t2s"
                    },
                    "stconvert": {
                        "type": "stconvert",
                        "convert_type": "s2t"
                    }
                }
            }
        },
        "mappings": {
            "type": {
                "properties": {
                    "desc": {
                        "type": "text",
                        "analyzer": "ts_ik"
                    }
                }
            }
        }
    }

    The command output is similar to the following:

    {
      "acknowledged" : true,
      "shards_acknowledged" : true,
      "index" : "stconvert"
    }
  6. On the Console page, run the following command to import data to index stconvert:
    POST /stconvert/type/1
    {
      "desc": "Text in traditional Chinese"
    }

    If the value of failed in the command output is 0, the data is imported successfully.

  7. On the Console page, run the following command to search for the keyword and view the search result:
    GET /stconvert/_search
    {
        "query": {
            "match": {
          "desc": "Keyword"    
            }
        }
    }

    The command output is similar to the following:

    {
      "took" : 15,
      "timed_out" : false,
      "_shards" : {
        "total" : 1,
        "successful" : 1,
        "skipped" : 0,
        "failed" : 0
      },
      "hits" : {
        "total" : 1,
        "max_score" : 0.5753642,
        "hits" : [
          {
            "_index" : "stconvert",
            "_type" : "type",
            "_id" : "1",
            "_score" : 0.5753642,
            "_source" : {
              "desc": "Text in traditional Chinese"
            }
          }
        ]
      }
    }