Help Center/ GaussDB(DWS)/ Product Bulletin/ Product Notice/ Version 8.3.0/ New Features of Version 8.3.0.100
Updated on 2024-09-03 GMT+08:00

New Features of Version 8.3.0.100

For details about new features in 8.3.0.x, see New Features. For details about resolved issues in each patch, see New Features and Resolved Issues in 8.3.0.x.

[V8.3.0.108 Release Date] 2024-07-22

[V8.3.0.105 release date] 2024-07-10 (whitelisted users)

[V8.3.0.103 release date] 2024-06-10 (whitelisted users)

[V8.3.0.101 release date] 2024-04-25 (whitelisted users)

[V8.3.0.100 Release Date] 2024-04-17

New Features

Table 1 8.3.0 feature changes

Category

Feature

Description

Reference

SQL

Last access time of a table object can be recorded.

  • Added the enable_save_dataaccess_timestamp parameter to specify whether to record the last access time of a table.
  • Added the last_reference_timestamp field to the PGXC_STAT_OBJECT view. It can be used to query the last access time of a table.
  • Added the pg_stat_get_all(regclass) and pgxc_stat_get_all(regclass) functions to return table tuple information on CNs recorded in pg_stat_object.
  • Developer Guide > GUC Parameters > Statistics During the Database Running > Query and Index Statistics Collector
  • Developer Guide > System Catalogs and Views > System Catalogs > PGXC_STAT_OBJECT
  • SQL Syntax Reference > Functions and Operators > Statistics Information Functions

Security

OneAccess authentication

  • Added OneAccess authentication to the rolauthinfo field of PG_AUTHID.
  • Added the following description to the AUTHINFO parameter: When the OneAccess authentication mode is supported, authinf must contain oneaccessClientId and domain information in the format of 'oneaccessClientId=xxxx, domain=xxxx'.
  • Developer Guide > System Catalogs > PG_AUTHID
  • SQL Syntax Reference > DDL Syntax > CREATE ROLE

Usability

pg_job can record error information.

Added the PG_JOB_INFO system catalog to record the execution results of scheduled tasks.

Developer Guide > System Catalogs and Views > System Catalogs > PG_JOB_INFO

High Performance

Pooler can connect to the standby node.

The GUC parameter enable_connect_standby is added to set the CN to connect to the standby DN.

Developer Guide > GUC Parameters > Connection and Authentication > Communication Library Parameters

CMC

Sunset of historical system catalog and system function interfaces

Users can learn about the product evolution through the sunset plan. This improves product experience, and improves product reliability and usability.

Added section "Metadata Sunset Description" to summarize sunsetted views and functions.

HA

Enhanced O&M views

Added functions:

1. pgxc_get_xlog_stats(), which runs on CNs and the types and numbers of Xlogs on DNs from the DN startup time.

2. pgxc_get_wal_speed(), which runs on CNs and obtains the WAL generation rate of each DN and the receive, write, flush, and redo rates of the standby DN.

3. pg_xlog_display_one_lsn(start_lsn), which runs on CNs or DNs and parses the Xlog in the current location based on the start LSN.

4. pg_xlogdump (tablename), which runs on CNs or DNs and parses and filters Xlog files based on table names.

5. pg_xlogdump (xid), which runs on CNs or DNs and parses and filters Xlog files based on transaction IDs.

6. pg_xlogdump (start_lsn, end_lsn), which runs on CNs or DNs and parses Xlog files based on the start and end LSNs.

SQL Syntax Reference > Functions and Operators > Statistics Information Functions

HA

Clearing residual files

Added the following functions to scan residual files:

  • pg_scan_residualfiles()
  • pgxc_scan_residualfiles(query_flag)
  • pg_get_scan_residualfiles()
  • pgxc_get_scan_residualfiles(query_flag)
  • pg_archive_scan_residualfiles()
  • pgxc_archive_scan_residualfiles(query_flag)
  • pg_rm_scan_residualfiles_archive()
  • pgxc_rm_scan_residualfiles_archive(query_flag)

SQL Syntax Reference > Functions and Operators > Residual File Management Functions > Functions for Scanning Residual Files

High Performance

Column-store CU level-2 partitions

Added the secondary_part_column and secondary_part_num parameters to specify the column names and number of level-2 partition columns in a column-store table.

Developer Guide > Hybrid Data Warehouse > Hybrid Data Warehouse Syntax > CREATE TABLE

High Performance

Table-level oldestxmin

Added the GUC parameter enable_table_level_oldestxmin to control whether to enable the table-level oldestxmin feature.

Added the GUC parameter old_txn_threshold. When the table-level oldestxmin is calculated, transactions whose running duration exceeds the value of this parameter are regarded as long transactions.

Developer Guide > GUC Parameters > Automatic Cleanup

Foreign tables

sql on hudi

Added the Hudi system function hudi_set_sync_commit to set the start timestamp of the Hudi automatic synchronization task.

Added the cow_improve parameter is added to optimize the COPY_ON_WRITE table. This parameter can be specified only when format is set to hudi.

SQL Syntax Reference > Functions and Operators > Hudi System Functions

SQL Syntax Reference > DDL Syntax > CREATE FOREIGN TABLE (SQL on OBS or Hadoop)

Lock

Distributed deadlock detection

Added the GUC parameters enable_global_deadlock_detector and

global_deadlock_detector_period to specify whether to enable the distributed deadlock detection function and the detection interval.

Developer Guide > GUC Parameters > Lock Management

Real-time queries

Approximate computing: column-store vectorized functions based on the HLL algorithm

Added the approx_count_distinct(col_name) function.

Added the approx_count_distinct_precision parameter, which indicates the number of buckets in the HyperLogLog++ (HLL++) algorithm. This parameter can be used to adjust the error rate of the approx_count_distinct aggregate function.

SQL Syntax Reference > Functions and Operators > Aggregate Functions

Developer Guide > GUC Parameters > Developer Options

Real-time queries

Subplan vectorization

Added the GUC parameter vector_engine_strategy to control the vectorization enhancement policy.

Developer Guide > GUC Parameters > Miscellaneous Parameters

SQL

Funnel functions and retention functions

Added the funnel functions window_funnel, retention, range_retention_count, and range_retention_sum.

SQL Syntax Reference > Functions and Operators > Funnel and Retention Functions

SQL

Job backpressure

Added the GUC parameter max_queue_statements to set the maximum queue length of queuing jobs.

Developer Guide > GUC Parameters > Resource Management

SQL

Releasing locks at query end

Added the GUC parameter enable_release_scan_lock to control whether to release the level-1 lock after the SELECT statement is executed.

Developer Guide > GUC Parameters > Lock Management

SQL

Replication tables

Added disable_update_returning_check to the GUC parameter behavior_compat_options.

Developer Guide > GUC Parameters > Miscellaneous Parameters

Replicability

Optimized the rounding rule of the cast function.

Added options for the GUC parameter behavior_compat_options: enable_banker_round: specifies how numeric types round their values, using either the rounding or the banker method.

Developer Guide > GUC Parameters > Miscellaneous Parameters

SQL

string_agg behavior compatibility

Added the enable_full_string_agg option to the GUC parameter behavior_compat_options.

Developer Guide > GUC Parameters > Miscellaneous Parameters

Import and export

Upgraded the ORC third-party library and optimized its performance.

Added the GUC parameter dfs_max_memory to specify the maximum memory that can be occupied during ORC export.

Developer Guide > GUC Parameters > Resource Consumption > Memory

Partition management

The COPY statement spilling behavior is optimized.

Added the GUC parameter default_partition_cache_strategy to control the default policy of partition cache.

Developer Guide > GUC Parameters > Resource Management

Hybrid data warehouse

Batch upsert is optimized for hstore tables.

Added the GUC parameter enable_hstore_keyby_upsert, which controls the optimization of batch upsert in the hstore table.

Developer Guide > Hybrid Data Warehouse > Hybrid Data Warehouse GUC Parameters

Partition management

The automatic partition reducing operation of automatic partition management blocks real-time data import to the database.

Added the pg_partition_management_time function to modify the invoking time of the automatic partition increasing tasks.

SQL Syntax Reference > Functions and Operators >

Database Object Functions > Partition Management Functions

Behavior Changes

Table 2 Behavior changes

Change Type

No.

Item

Description

Deleted

1

policy_oid

Previously, the value of the policy_oid field was NULL post-upgrade to version 8.3.0. Now, a script automatically populates this field during the upgrade process.

Modified

2

gs_dump

gs_dump includes the unusable status of indexes.

3

pg_get_tabledef

pg_get_tabledef includes the unusable status of indexes.

4

pg_get_indexdef

pg_get_indexdef carries the unusable status of the index.

5

create table like

When copying an index using the INCLUDE clause, the new index inherits the unusable status from the source index.

SQL Syntax Changes

Table 3 SQL syntax changes

Change Type

No.

Item

Description

Modified

1

CREATE TABLE

New support scenario: create <common table >like <partition table >INCLUDING INDEXES. The index creation logic has been updated to set the index type based on the actual table type, not the 'like' table type.

Added

2

last_value ignore nulls

The last_value ignore nulls syntax is introduced to exclude NULL values during calculations.

3

ALTER INDEX index_name REBUILD [ PARTITION partition_name ] [WITHOUT UNUSABLE ]

Enhanced syntax to rebuild unusable indexes.

4

ALTER TABLE REBUILD PARTITION partition_name [ WITHOUT UNUSABLE ]

Enhanced syntax to rebuild unusable indexes.

5

REINDEX { INDEX | [INTERNAL] TABLE } name [ FORCE | WITHOUT UNUSABLE ]

Enhanced syntax to rebuild unusable indexes.

6

REINDEX { INDEX | [INTERNAL] TABLE } name PARTITION partition_name [ FORCE | WITHOUT UNUSABLE ]

Enhanced syntax to rebuild unusable indexes.

7

CREATE INDEX ... [ UNUSABLE ] ... PARTITION index_partition_name [ UNUSABLE ]

Added support for creating indexes in the unusable.

System Catalogs

Table 4 System catalogs

Change Type

No.

Item

Description

Modified

1

pg_stat_object

Renamed the reserved field extra1 to last_autovacuum_csn.

2

gs_wlm_session_info

Added a new field parse_time.

3

pg_job_info

Added a new column jobdb to store the database information of jobs.

System Functions

Table 5 System functions

Change Type

No.

Item

Description

Added

1

dbms_job.submit_node

Added a new function dbms_job.submit_node to designate the execution node, which is defaulted to null, representing the current CN node.

2

dbms_job.change_node

Added a new function dbms_job.change_node to designate the execution node, which is defaulted to null, representing the current CN node.

3

Funnel function

Added the funnel function windowfunnel.

4

Retention function

Added retention functions.

5

Retention extension function

Added the retention extension function range_retention_count.

6

Retention extension function

Added the retention extension function range_retention_sum.

7

approx_count_distinct

Added an approximate calculation aggregate function.

8

pg_scan_residualfiles

Scans all residual file records in the database where the current node resides

9

pgxc_scan_residualfiles

Scans all nodes for the residual files of the current database

10

pg_get_scan_residualfiles

Obtains all residual file records of the current node.

11

pgxc_get_scan_residualfiles

Obtains residual file records on all nodes.

12

pg_archive_scan_residualfiles

Archives all residual file records of the current node.

13

pgxc_archive_scan_residualfiles

Archives residual file records on all nodes.

14

pg_rm_scan_residualfiles_archive

Deletes files from the archived file list on the current node.

15

pgxc_rm_scan_residualfiles_archive

Deletes files in the archive directory from all nodes.

16

pg_partition_management_time

Modifies the invoking time of the auto-increment partition task.

17

uniq

Assists in UV calculation of data, such as precise deduplication.

18

reload_cold_partition

Changes a cold partition into a hot partition.

Modified

19

pgxc_get_small_cu_info

Added the feature of querying the number of CUs in each level-2 partition.

20

pgxc_get_wlm_session_info_bytime

Added the parse_time field.

Deleted

21

pg_sync_cstore_delta(text)

Deprecated the system function.

22

pg_sync_cstore_delta

Deprecated the system function.

23

pgxc_pool_check

Deprecated the system function.

24

pg_delete_audit

Deprecated the system function.

25

pg_log_comm_status

Deprecated the system function.

26

pgxc_log_comm_status

Deprecated the system function.

27

signal_backend

Deprecated the system function.

28

pg_stat_get_realtime_info_internal

Deprecated the system function.

29

pg_stat_get_wlm_session_info_internal

Deprecated the system function.

30

pg_stat_get_wlm_session_info

Deprecated the system function.

31

pg_stat_get_wlm_statistics

Deprecated the system function.

32

pg_user_iostat

Deprecated the system function.

33

pg_stat_get_wlm_session_iostat_info

Deprecated the system function.

System Views

Table 6 System views

Change Type

No.

Item

Description

Added

1

pgxc_memory_debug_info

This view displays the memory error information of each node in the current cluster during job execution, helping locate memory errors.

Modified

2

pgxc_stat_object

Changed with pg_stat_object. The reserved field extra1 is renamed last_autovacuum_csn.

3

gs_wlm_session_statistics

Added the parse_time field.

4

pgxc_wlm_session_statistics

Added the parse_time field.

5

gs_wlm_session_history

Added the parse_time field.

6

pgxc_wlm_session_history

Added the parse_time field.

7

gs_wlm_session_info

Added the parse_time field.

8

information_schema.tables

Modified the view definition and changed the filter criteria so that the views whose reloptions is empty can be queried.

9

gs_table_stat

Optimized the query performance of the gs_table_stat view.

10

gs_row_table_io_stat

Optimized the query performance of equivalence logic rewriting.

11

gs_column_table_io_stat

Optimized the query performance of equivalence logic rewriting.

Deleted

12

gs_wlm_session_info_all

Deprecated the system view.

13

pg_wlm_statistics

Deprecated the system view.

14

pg_session_iostat

Deprecated the system view.

GUC Parameters

Table 7 GUC parameters

Change Type

No.

Item

Description

Added

1

max_queue_statements

Default: -1 (unlimited queue length). Triggers an error and exits the job when the queue length exceeds this value. This parameter does not take effect for newly installed and upgraded clusters.

2

job_retention_time

Maximum days to store pg_job execution results. Default: 30 days.

3

vector_engine_strategy

Controls vectorization of the operator. Default: improve (maximize vectorization). Alternate: force (roll back to row storage plan).

4

enable_release_scan_lock

Determines if the SELECT statements release the level-1 lock post-execution. It is disabled by default.

5

job_queue_naptime

Interval for scheduling task checks and starting task threads. Default: 1 second.

6

approx_count_distinct_precision

Number of buckets in HLL++ algorithm, affecting the error rate of the approx_count_distinct function. The number of buckets affects the precision of estimating the distinct value. More buckets make the estimation more accurate. Default: 17 buckets.

7

llvm_compile_expr_limit

Limits the number of compiled LLVM expressions.

8

llvm_compile_time_limit

Sets a threshold for LLVM compilation time as a percentage of executor running time, triggering an alarm if exceeded.

9

max_opt_sort_rows

Maximum rows for optimized limit+offset in an ORDER BY statement. If the number of rows exceeds the value of this parameter, the original logic is used. If the number of lines is less than the value of this parameter, the optimized logic is used. Default: 0 (use original logic).

10

dfs_max_memory

Maximum memory for ORC export. Default: 262144 KB.

11

default_partition_cache_strategy

Default policy for partition cache control.

12

enable_connect_standby

Allows CN to connect to a standby DN for O&M operations. This parameter applies only to O&M operations.

13

enable_stream_sync_quit

Determines synchronous exit of stream thread after plan execution. This parameter is disabled by default so that sync quit is not blocked.

14

full_group_by_mode

Behavior after enabling disable_full_group_by_mysql:

  • nullpadding indicates that NULL values in a non-aggregation column are replaced with values and non-null values in the column are used. The result set may contain different rows.
  • notpadding indicates that NULL values are not processed for non-aggregation columns and the entire row of data is obtained. The result set of non-aggregation columns is a random row.

Modified

15

behavior_compat_options

Added the alter_distribute_key_by_partition option to control INSERT INTO execution by partition post ALTER TABLE.

16

behavior_compat_options

Added the enable_full_string_agg option for full or incremental aggregation logic in string_agg(a, delimiter) over (partition by b order by c).

17

behavior_compat_options

Added the unsupported_set_function_case option to control case when conditions with multi-result set functions.

18

behavior_compat_options

Added the enable_unknown_datatype option, disallowing creation of tables with unknown type columns if unset.

19

behavior_compat_options

Added the enable_whole_row_var option, intercepting scenarios where a table name is used as an expression (SELECT T FROM T, or SELECT .... FROM T GROUP BY T) if unset.

20

time_track_strategy

Parameter level changed from USERSET to SIGHUP. The default value is timer at the kernel level, which can be set to tsc on the management console.

21

rewrite_rule

Controls whether to use case when rewriting.