What's New
Function Overview
Product Bulletin
- [Notice] Huawei Cloud ModelArts Has Discontinued the Old Version of Training Management
Service Overview
- Infographics
  - What Is ModelArts
- What Is ModelArts?
- Advantages
- Use Cases
- Functions
- AI Development Basics
- Security
- Notes and Constraints
- Permissions Management
- Billing Description
- Quotas
- ModelArts and Other Services
Billing
- Billing Modes
- Billing Item
- Billing Examples
- Changing the Billing Mode
- Renewal
- Bills
- About Arrears
- Stopping Billing
- Cost Management
- Billing FAQs
Getting Started
- How to Use ModelArts
- Using a Custom Algorithm to Build a Handwritten Digit Recognition Model
- Practices for Beginners
ModelArts User Guide (Standard)
- ModelArts Standard Usage
- ModelArts Standard Preparations
- ModelArts Standard Resource Management
- Using ExeML for Zero-Code AI Development
- Using Workflows for Low-Code AI Development
- Development Environments
- Data Management
- Model Training
- Inference Deployment
- Image Management
- Resource Monitoring
- Viewing Audit Logs
  - ModelArts Key Operations Traced by CTS
  - Viewing ModelArts Audit Logs
ModelArts User Guide (Lite Server)
- Before You Start
- Enabling Lite Server Resources
- Configuring Lite Server Resources
- Using Lite Server Resources
  - PyTorch GPU Training and Inference Guide for GPT-2
- Managing Lite Server Resources
ModelArts User Guide (Lite Cluster)
- Before You Start
- Enabling Lite Cluster Resources
- Configuring Lite Cluster Resources
- Using Lite Cluster Resources
- Managing Lite Server Resources
ModelArts User Guide (AI Gallery)
- AI Gallery
- Free Assets
- My Gallery
- Subscription & Use
- Publish & Share
  - Publishing a Free Algorithm
  - Publishing a Free Model
Best Practices
- Official Samples
- Permissions Management
- Notebook
  - Creating, Migrating, and Managing Conda Virtual Environments Based on SFS
- Model Training
- Model Inference
API Reference
- Before You Start
- API Overview
- Calling APIs
- Development Environment Management
- Training Management
- AI Application Management
- App Authentication Management
- Service Management
- Resource Management
- DevServer Management
- Authorization Management
- Managing DevEnviron Instances
  - Querying All Notebook Instances
- Use Cases
- Permissions Policies and Supported Actions
- Common Parameters
- Historical APIs
- Change History
SDK Reference
- Before You Start
- SDK Overview
- Getting Started
- (Optional) Installing the ModelArts SDK Locally
- Session Authentication
- OBS Management
- Data Management
- Training Management (New Version)
  - Training Jobs
  - APIs for Resources and Engine Specifications
    - Obtaining Resource Flavors
    - Obtaining Engine Types
- Training Management (Old Version)
- Model Management
- Service Management
- Change History
FAQs
- General Issues
- Billing
- ExeML (Old Version)
- Data Management (Old Version)
- Notebook (New Version)
- Training Jobs
- Service Deployment
  - Model Management
    - Importing Models
  - Service Deployment
    - Functional Consulting
    - Real-Time Services
- Resource Pools
- API/SDK
- Using PyCharm Toolkit
Troubleshooting
- General Issues
  - Incorrect OBS Path on ModelArts
- ExeML
- DevEnviron
- Training Jobs
- Inference Deployment
- MoXing
- APIs or SDKs
Videos
Preparations (To Be Offline)
- Creating a Huawei ID and Enabling Huawei Cloud Services
- Logging In to the ModelArts Management Console
- Configuring Access Authorization (Global Configuration)
- Creating an OBS Bucket
- Enabling ModelArts Resources
  - ModelArts Resources
  - Pay-Per-Use
User Guide (ExeML)
- ExeML (New Version)
- ExeML (Old Version)
Workflows
- MLOps Overview
- What Is Workflow?
- How to Use a Workflow?
- How to Develop a Workflow?
DevEnviron
- Introduction to DevEnviron
- Application Scenarios
- Managing Notebook Instances
- JupyterLab
- Local IDE
- ModelArts CLI Command Reference
Model Development (To Be Offline)
- Introduction to Model Development
- Preparing Data
- Preparing Algorithms
- Performing a Training
- Advanced Training Operations
- Distributed Training
- Automatic Model Tuning (AutoSearch)
Image Management
- Image Management
- Using a Preset Image
- Using Custom Images in Notebook Instances
- Using a Custom Image to Train Models (Model Training)
- Using a Custom Image to Create AI applications for Inference Deployment
  - Custom Image Specifications for Creating AI Applications
  - Creating a Custom Image and Using It to Create an AI Application
- FAQs
- Modification History
Model Inference (To Be Offline)
- Introduction to Inference
- Managing AI Applications
- Deploying an AI Application as a Service
- Inference Specifications
- ModelArts Monitoring on Cloud Eye
Resource Management
- Resource Pool
- Elastic Cluster
- Audit Logs
  - Key Operations Recorded by CTS
  - Viewing Audit Logs
- Monitoring Resources
Data Preparation and Analytics
- Introduction to Data Preparation
- Getting Started
- Creating a Dataset
- Importing Data
- Data Analysis and Preview
- Labeling Data
- Publishing Data
- Exporting Data
Data Labeling (To Be Offline)
- Introduction to Data Labeling
- Manual Labeling
- Auto Labeling
  - Creating an Auto Labeling Job
  - Confirming Hard Examples
- Team Labeling
User Guide for Senior AI Engineers (To Be Offline)
- Operation Guide
- Data Management (Old Version to Be Terminated)
- Training Management (Old Version )
- Resource Pools (Old Version to Be Terminated)
- Custom Images
- Permissions Management
  - Creating a User and Granting Permissions
  - Creating a Custom Policy
- Audit Logs
  - Key Operations Recorded by CTS
  - Viewing Audit Logs
- Change History
General Reference
- Glossary
- Service Level Agreement
- White Papers
- Endpoints
- Permissions

On this page

Context
Prerequisites
Constraints
Preparations
Procedure
Uploading the Image to SWR
Creating an AI Application Using the Image
Deploying the AI Application as a Real-Time Service
Calling a WebSocket Real-Time Service

Show all

Help Center/ ModelArts/ Best Practices/ Model Inference/ Full-Process Development of WebSocket Real-Time Services

Full-Process Development of WebSocket Real-Time Services

Updated on 2024-03-05 GMT+08:00

View PDF

Context

WebSocket is a network transmission protocol that supports full-duplex communication over a single TCP connection. It is located at the application layer in an OSI model. The WebSocket communication protocol was established by IETF in 2011 as standard RFC 6455 and supplemented by RFC 7936. The WebSocket API in the Web IDL is standardized by W3C.

WebSocket simplifies data exchange between the client and the server and allows the server to proactively push data to the client. In the WebSocket API, if the initial handshake between the client and the server is successful, a persistent connection will be established between them and data can be transferred bidirectionally.

Prerequisites

You are experienced in developing Java and familiar with JAR packaging.
You have basic knowledge and calling methods of WebSocket.
You are familiar with the method of creating an image using Docker.

Constraints

WebSocket supports only the deployment of real-time services.
WebSocket supports only real-time services deployed using AI applications imported from custom images.

Preparations

Before using WebSocket in ModelArts for inference, bring your own custom image. The custom image must be able to provide complete WebSocket services in a standalone environment, for example, completing WebSocket handshakes and exchanging data between the client to the server. The model inference is implemented in the custom image, including downloading the model, loading the model, performing preprocessing, completing inference, and assembling the response body.

Procedure

To develop a WebSocket real-time service, perform the following operations:

Uploading the Image to SWR
Creating an AI Application Using the Image
Deploying the AI Application as a Real-Time Service
Calling the WebSocket Real-Time Service

Uploading the Image to SWR

Upload the local image to SWR. For details, see How Can I Log In to SWR and Upload Images to It?

Creating an AI Application Using the Image

Log in to the ModelArts management console, choose AI Application Management > AI Applications, and click Create under My AI Applications. The page for creating an AI application is displayed.
Configure the AI application.
- Meta Model Source: Select Container image.
- Container Image Path: Select the path specified in Uploading the Image to SWR.
- Container API: Configure this parameter based on site requirements.
- Health Check: Retain default settings. If health check has been configured in the image, configure the health check parameters based on those configured in the image.
  Figure 1 AI application parameters
Click Create now. In the AI application list that is displayed, check the AI application status. When it changes to Normal, the AI application has been created.

Deploying the AI Application as a Real-Time Service

Log in to the ModelArts management console, choose Service Deployment > Real-Time Services, and click Deploy.
Configure the service.
- AI Application and Version: Select the AI application and version created in Creating an AI Application Using the Image.
- WebSocket: Enable this function.
  Figure 2 WebSocket
Click Next, confirm the configuration, and click Submit. In the real-time service list you will be redirected to, check the service status. When it changes to Running, the real-time service has been deployed.

Calling a WebSocket Real-Time Service

WebSocket itself does not require additional authentication. ModelArts WebSocket is WebSocket Secure-compliant, regardless of whether WebSocket or WebSocket Secure is enabled in the custom image. WebSocket Secure supports only one-way authentication, from the client to the server.

You can use one of the following authentication methods provided by ModelArts:

The following section uses GUI software Postman for prediction and token authentication as an example to describe how to call WebSocket.

Establish a WebSocket connection.
Exchange data between the WebSocket client and the server.

Establish a WebSocket connection.
1. Open Postman of a version later than 8.5, for example, 10.12.0. Click in the upper left corner and choose File > New. In the displayed dialog box, select WebSocket Request (beta version currently).
  Figure 3 WebSocket Request
2. Configure parameters for the WebSocket connection.
  Select Raw in the upper left corner. Do not select Socket.IO (a type of WebSocket implementation, which requires that both the client and the server run on Socket.IO). In the address box, enter the API Address obtained on the Usage Guides tab on the service details page. If there is a finer-grained URL in the custom image, add the URL to the end of the address. If queryString is available, add this parameter in the params column. Add authentication information into the header. The header varies depending on the authentication mode, which is the same as that in the HTTPS-compliant inference service. Click Connect in the upper right corner to establish a WebSocket connection.
  
  Figure 4 Obtaining the API address
  NOTE:
  - If the information is correct, CONNECTED will be displayed in the lower right corner.
  - If establishing the connection failed and the status code is 401, check the authentication.
  - If a keyword such as WRONG_VERSION_NUMBER is displayed, check whether the port configured in the custom image is the same as that configured in WebSocket or WebSocket Secure.
  The following shows an established WebSocket connection.
  
  Figure 5 Connection established
  
  NOTICE:
  
  Preferentially check the WebSocket service provided by the custom image. The type of implementing WebSocket varies depending on the tool you used. Possible issues are as follows: A WebSocket connection can be established but cannot be maintained, or the connection is interrupted after one request and needs to be reconnected. ModelArts only ensures that it will not affect the WebSocket status in a custom image (the API address and authentication mode may be changed on ModelArts).
Exchange data between the WebSocket client and the server.

After the connection is established, WebSocket uses TCP for full-duplex communication. The WebSocket client sends data to the server. The implementation types vary depending on the client, and the lib package may also be different for the same language. Different implementation types are not considered here.

The format of the data sent by the client is not limited by the protocol. Postman supports text, JSON, XML, HTML, and Binary data. Take text as an example. Enter the text data in the text box and click Send on the right to send the request to the server. If the text is oversized, Postman may be suspended.

Figure 6 Sending data

Parent topic: Model Inference

Previous topic: High-Speed Access to Inference Services Through VPC Peering

Feedback

Was this page helpful?

Helpful Not helpful

Provide feedback

Thank you very much for your feedback. We will continue working to improve the documentation.See the reply and handling status in My Cloud VOC.

The system is busy. Please try again later.

Which of the following issues have you encountered?

Content is inconsistent with the product UI

Unclear descriptions

Lack of examples or code

Incorrect steps

Can't find what I need

Lack of best practices

Feedback (optional)

0/500

Select at least one type of issue, and enter your comments or suggestions.

Enter a maximum of 500 characters.

Submit Cancel

For any further questions, feel free to contact us through the chatbot.

Chatbot