Updated on 2024-12-10 GMT+08:00

Development Suggestions

Add HBase Configuration Using the With Properties When Submitting a Job on the Client

Submit a job on the Flink client. For example, on a SQL client, add the following configuration to the table creation statement:
Table 1 Flink job with properties

Configuration

Description

'properties.hbase.rpc.protection' = 'authentication'

This parameter must be consistent with that on the HBase server.

'properties.zookeeper.znode.parent' = '/hbase'

If there are multiple services, hbase1 and hbase2 coexist. You must clarify the cluster to be accessed.

'properties.hbase.security.authorization' = 'true'

This parameter is used to enable authentication.

'properties.hbase.security.authentication' = 'kerberos'

This parameter is used to enable Kerberos authentication.

[Example]

CREATE TABLE hsink1 (
      rowkey STRING,
      f1 ROW < q1 STRING >,
      PRIMARY KEY (rowkey) NOT ENFORCED
     ) WITH (
       'connector' = 'hbase-2.2',
       'table-name' = 'cc',
       'zookeeper.quorum' = 'x.x.x.x:clientPort',
       'properties.hbase.rpc.protection' = 'authentication',
       'properties.zookeeper.znode.parent' = '/hbase',
       'properties.hbase.security.authorization' = 'true',
       'properties.hbase.security.authentication' = 'kerberos'
    );

Enable Asynchronous Lookup Join for Faster Dimension Table Join

Add the following with property for the HBase dimension table:

'lookup.async'='true'

Increase the Parallelism of the Lookup Join Operator for Faster Dimension Table Join

Add the following with property for the HBase dimension table:

'lookup.parallelism'='xx'

Increase the Parallelism of the Sink HBase Operator for Higher Write Performance

Add the following with property for the HBase sink table:

'sink.parallelism'='xx'