最新动态
功能总览
服务公告
- 产品公告
- 产品发布说明
产品介绍
- 图解云数据库GaussDB
- 什么是云数据库GaussDB
- 应用场景
- 常用概念
- 产品优势
- 实例说明
- 数据库实例规格
- 数据库产品类型
- 安全
- 权限管理
- 约束与限制
- GaussDB与其他服务的关系
- 兼容性说明
计费说明
- 计费概述
- 计费项
- 计费模式
- 计费样例
- 变更计费模式
- 续费
- 费用账单
- 欠费说明
- 停止计费
- 成本管理
快速入门
- 购买并通过界面化工具DAS连接GaussDB实例（推荐）
- 购买并通过gsql连接GaussDB实例
- GaussDB入门实践
用户指南
- 权限管理
  - 创建用户并授权使用GaussDB
  - 自定义策略
- 购买GaussDB实例
- 连接GaussDB实例
  - GaussDB实例连接方式介绍
  - 通过数据管理服务DAS连接实例
  - 通过gsql连接实例
  - 通过Navicat连接实例
  - 通过DBeaver连接实例
- 数据库迁移
  - GaussDB迁移方案总览
  - 使用DRS将Oracle数据库迁移至GaussDB
  - 使用DRS将MySQL数据库迁移到GaussDB
  - 使用DAS的导出和导入功能迁移GaussDB数据
  - 使用copy to/from命令导出导入数据
  - JDBC中使用COPY命令导出导入数据
  - 使用gs_dump和gs_dumpall命令导出数据
  - 使用gs_restore命令导入数据
  - 使用gs_loader工具导入数据
- 使用数据库
  - 数据库使用概述
  - 创建GaussDB数据库
  - 创建GaussDB数据库用户
- 实例管理
  - 查看GaussDB实例总览
  - 设置GaussDB实例安全组规则
  - 绑定和解绑GaussDB实例弹性公网IP
  - 设置GaussDB实例回收站策略
  - 导出GaussDB实例列表
  - 退订GaussDB包周期实例
  - 停止GaussDB实例
  - 启动GaussDB实例
  - 重启GaussDB实例
  - 删除GaussDB按需实例
  - 重建GaussDB实例
  - 停止GaussDB节点
  - 启动GaussDB节点
  - 重启GaussDB节点
- 变更实例
  - 修改GaussDB实例名称
  - 修改GaussDB数据库端口
  - 修改M兼容端口
  - 变更GaussDB实例的CPU和内存规格
  - 只读节点变更
  - 同步数据至单副本实例
  - 查看并修改高级特性
  - 扩容和缩容实例
  - 扩容磁盘
  - 变更部署形态
  - DN主备倒换
- 版本升级
  - 版本升级概述
  - 热补丁升级
  - 就地升级
  - 灰度升级
- 插件管理
  - 插件安装
  - 开启或关闭插件开关
  - 查看拓展功能模块
- 数据备份
  - 备份概述
  - 执行备份
  - 管理备份
- 数据恢复
  - GaussDB数据误操作恢复方案
  - 通过备份文件恢复GaussDB实例
  - 通过备份文件恢复GaussDB库/表
  - 恢复GaussDB实例到指定时间点
  - 恢复GaussDB库/表到指定时间点
- 参数管理
  - 数据库实例支持修改的参数
  - 修改GaussDB实例参数
  - 查看GaussDB实例参数修改历史
  - 导出GaussDB实例参数
  - 创建GaussDB实例参数模板
  - 管理GaussDB实例参数模板
- 监控与告警
  - GaussDB支持的监控指标
  - 查看GaussDB监控指标
  - 查看GaussDB监控大盘
  - 创建GaussDB实例告警规则
  - 事件监控
- 日志与审计
  - 下载GaussDB错误日志和慢日志
  - 下载GaussDB强切日志
  - 在CTS查看GaussDB实例审计日志
  - 对接LTS并查看数据库审计日志
- 配额调整
  - 调整GaussDB云服务资源配额
  - 调整企业项目下的GaussDB资源配额
- SQL诊断
  - 配置SQL限流
- 容灾管理
  - 约束限制
  - 创建容灾任务
  - 查询容灾任务
  - 容灾升主
  - 结束容灾
  - 删除容灾任务
  - 主实例故障后重建灾备任务
  - 容灾主备关系切换
  - 容灾回切
  - 灾备实例容灾演练
  - 主实例日志保持
  - 灾备实例限制说明
- GaussDB任务管理
- GaussDB标签管理
- 重置GaussDB实例管理员密码
开发指南
- 开发指南（分布式_V2.0-8.x）
- 开发指南（集中式_V2.0-8.x）
- 开发指南（分布式_V2.0-3.x）
- 开发指南（集中式_V2.0-3.x）
- 开发指南（分布式_V2.0-2.x）
- 开发指南（集中式_V2.0-2.x）
工具参考
- 工具参考（分布式_V2.0-8.x）
- 工具参考（集中式_V2.0-8.x）
- 工具参考（分布式_V2.0-3.x）
- 工具参考（集中式_V2.0-3.x）
  - gsql
  - gs_loader
- 工具参考（分布式_V2.0-2.x）
- 工具参考（集中式_V2.0-2.x）
  - gsql
  - gs_loader
特性指南
- 特性指南（分布式_V2.0-8.x）
- 特性指南（集中式_V2.0-8.x）
- 特性指南（分布式_V2.0-3.x）
- 特性指南（集中式_V2.0-3.x）
最佳实践
- GaussDB安全配置建议
- 扩缩容最佳实践
性能白皮书
- 测试方法
- 测试数据
API参考
- 使用前必读
- API概览
- 如何调用API
- API（推荐）
- 历史API
- 权限策略和授权项
  - 策略及授权项说明
  - GaussDB授权分类
- 附录
SDK参考
- SDK概述
场景代码示例
常见问题
- 产品咨询
- GaussDB资源冻结/解冻/释放/删除/退订
- 资源及磁盘管理
- 数据库连接
- 数据库存储
  1. 数据超过了GaussDB实例的最大存储容量怎么办
  2. 如何查看GaussDB的存储空间使用情况
- 数据库基本使用
- 备份与恢复
- 数据库监控
  1. GaussDB实例的哪些监控指标需要重点关注
  2. GaussDB实例内存使用率指标的计算方法
- 扩容及规格变更
  1. GaussDB实例在扩容和规格变更期间是否一直可用
- 数据库参数修改
- 日志管理
  1. 如何查看GaussDB执行过的所有SQL日志
  2. 如何查看GaussDB数据库的死锁日志
- 网络安全
  1. 如何防止任意源连接GaussDB数据库
  2. 将根证书导入Windows/Linux操作系统
兼容性参考
- 与Oracle兼容性参考（分布式）
- 与Oracle兼容性参考（集中式）
- 与MySQL兼容性参考（分布式）
- 与MySQL兼容性参考（集中式）
视频帮助
文档下载
通用参考
- 产品术语
- 云服务等级协议（SLA）
- 白皮书资源
- 支持区域
- 系统权限

本文导读

展开导读

文档首页/ 云数据库 GaussDB/ 开发指南（分布式_V2.0-8.x）/ SQL调优指南/ 实际调优案例/ 案例：使用DN Gather减少计划中的Stream节点

案例：使用DN Gather减少计划中的Stream节点

更新时间：2024-11-12 GMT+08:00

DN Gather用来把分布式计划中的Stream节点去掉，把数据发送到一个节点进行计算，这样可以减少分布式计划执行时数据重分布的代价，从而提升单个查询效率以及系统整体的吞吐能力。不过DN Gather面向的是TP的小数据量场景，对于小数据量查询因为节省了数据重分布的代价且单个节点的算力完全够用所以可以得到性能提升。对于大数据量的计算，多节点并行计算更有优势。需要通过打开关闭开关来对比哪种情况更快（dngather_min_rows默认为500行，下述案例采用了默认值）。

案例环境准备

为了便于案例演示，需准备建表语句如下：

--清理环境
DROP SCHEMA IF EXISTS dn_gather_test CASCADE;
CREATE SCHEMA dn_gather_test;
SET current_schema=dn_gather_test;
--创建测试表
CREATE TABLE t1(a INT, b INT, c INT, d INT);
CREATE TABLE t2(a INT, b INT, c INT, d INT);
CREATE TABLE t3(a INT, b INT, c INT, d INT);
CREATE TABLE t4(a INT, b INT, c INT, d INT);

Gather Join

要把join的结果收敛到单个DN需要满足：

join前后优化器估算的数据行数在阈值以下。
join的子节点均为Stream节点。

为了方便举例，设置join的子节点都为Stream节点，关闭了Broadcast。

gaussdb=# set enable_broadcast=false;        
SET
gaussdb=# set explain_perf_mode=pretty;
SET
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select count(*) from t1, t2 where t1.b = t2.b;
 id |                    operation                     | E-rows | E-width | E-costs 
----+--------------------------------------------------+--------+---------+---------
  1 | ->  Aggregate                                    |      1 |       8 | 31.46
  2 |    ->  Streaming (type: GATHER)                  |      3 |       8 | 31.46
  3 |       ->  Aggregate                              |      3 |       8 | 31.34
  4 |          ->  Hash Join (5,7)                     |     30 |       0 | 31.30
  5 |             ->  Streaming(type: REDISTRIBUTE)    |     30 |       4 | 15.49
  6 |                ->  Seq Scan on t1                |     30 |       4 | 14.14
  7 |             ->  Hash                             |     29 |       4 | 15.49
  8 |                ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 15.49
  9 |                   ->  Seq Scan on t2             |     30 |       4 | 14.14
(9 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   4 --Hash Join (5,7)
         Hash Cond: (t1.b = t2.b)
(2 rows)
gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select count(*) from t1, t2 where t1.b = t2.b;
 id |                                operation                                | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                            |      1 |       8 | 32.53
  2 |    ->  Aggregate                                                        |      1 |       8 | 32.47
  3 |       ->  Hash Join (4,6)                                               |     30 |       0 | 32.38
  4 |          ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)    |     30 |       4 | 15.69
  5 |             ->  Seq Scan on t1                                          |     30 |       4 | 14.14
  6 |          ->  Hash                                                       |     30 |       4 | 15.69
  7 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 15.69
  8 |                ->  Seq Scan on t2                                       |     30 |       4 | 14.14
(8 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   3 --Hash Join (4,6)
         Hash Cond: (t1.b = t2.b)
(2 rows)
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select * from t1, t2, t3, t4 where t1.b = t2.b and t2.c = t3.c and t3.d = t4.d order by t1.a;
 id |                         operation                         | E-rows | E-width | E-costs 
----+-----------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                              |     30 |     144 | 66.46
  2 |    ->  Sort                                               |     30 |     144 | 65.05
  3 |       ->  Hash Join (4,16)                                |     30 |     144 | 64.86
  4 |          ->  Streaming(type: REDISTRIBUTE)                |     30 |     108 | 49.05
  5 |             ->  Hash Join (6,13)                          |     30 |     108 | 48.08
  6 |                ->  Streaming(type: REDISTRIBUTE)          |     30 |      72 | 32.27
  7 |                   ->  Hash Join (8,10)                    |     30 |      72 | 31.30
  8 |                      ->  Streaming(type: REDISTRIBUTE)    |     30 |      36 | 15.49
  9 |                         ->  Seq Scan on t1                |     30 |      36 | 14.14
 10 |                      ->  Hash                             |     29 |      36 | 15.49
 11 |                         ->  Streaming(type: REDISTRIBUTE) |     30 |      36 | 15.49
 12 |                            ->  Seq Scan on t2             |     30 |      36 | 14.14
 13 |                ->  Hash                                   |     29 |      36 | 15.49
 14 |                   ->  Streaming(type: REDISTRIBUTE)       |     30 |      36 | 15.49
 15 |                      ->  Seq Scan on t3                   |     30 |      36 | 14.14
 16 |          ->  Hash                                         |     29 |      36 | 15.49
 17 |             ->  Streaming(type: REDISTRIBUTE)             |     30 |      36 | 15.49
 18 |                ->  Seq Scan on t4                         |     30 |      36 | 14.14
(18 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   3 --Hash Join (4,16)
         Hash Cond: (t3.d = t4.d)
   5 --Hash Join (6,13)
         Hash Cond: (t2.c = t3.c)
   7 --Hash Join (8,10)
         Hash Cond: (t1.b = t2.b)
(6 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select * from t1, t2, t3, t4 where t1.b = t2.b and t2.c = t3.c and t3.d = t4.d order by t1.a;
 id |                                   operation                                   | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                                  |     30 |     144 | 68.47
  2 |    ->  Sort                                                                   |     30 |     144 | 66.36
  3 |       ->  Hash Join (4,10)                                                    |     30 |     144 | 65.55
  4 |          ->  Hash Join (5,7)                                                  |     30 |      72 | 32.38
  5 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3)       |     30 |      36 | 15.69
  6 |                ->  Seq Scan on t1                                             |     30 |      36 | 14.14
  7 |             ->  Hash                                                          |     30 |      36 | 15.69
  8 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3)    |     30 |      36 | 15.69
  9 |                   ->  Seq Scan on t2                                          |     30 |      36 | 14.14
 10 |          ->  Hash                                                             |     30 |      72 | 32.38
 11 |             ->  Hash Join (12,14)                                             |     30 |      72 | 32.38
 12 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3)    |     30 |      36 | 15.69
 13 |                   ->  Seq Scan on t3                                          |     30 |      36 | 14.14
 14 |                ->  Hash                                                       |     30 |      36 | 15.69
 15 |                   ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3) |     30 |      36 | 15.69
 16 |                      ->  Seq Scan on t4                                       |     30 |      36 | 14.14
(16 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   3 --Hash Join (4,10)
         Hash Cond: (t2.c = t3.c)
   4 --Hash Join (5,7)
         Hash Cond: (t1.b = t2.b)
  11 --Hash Join (12,14)
         Hash Cond: (t3.d = t4.d)
(6 rows)
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select count(*) from t1, t2, t3, t4 where t1.b = t2.b and t2.c = t3.c and t3.d = t4.d group by t1.b order by t1.b;
 id |                          operation                           | E-rows | E-width | E-costs 
----+--------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                 |     30 |      12 | 66.45
  2 |    ->  GroupAggregate                                        |     30 |      12 | 65.20
  3 |       ->  Sort                                               |     30 |       4 | 65.05
  4 |          ->  Hash Join (5,17)                                |     30 |       4 | 64.86
  5 |             ->  Streaming(type: REDISTRIBUTE)                |     30 |       4 | 49.05
  6 |                ->  Hash Join (7,14)                          |     30 |       4 | 48.08
  7 |                   ->  Streaming(type: REDISTRIBUTE)          |     30 |       8 | 32.27
  8 |                      ->  Hash Join (9,11)                    |     30 |       8 | 31.30
  9 |                         ->  Streaming(type: REDISTRIBUTE)    |     30 |       8 | 15.49
 10 |                            ->  Seq Scan on t2                |     30 |       8 | 14.14
 11 |                         ->  Hash                             |     29 |       8 | 15.49
 12 |                            ->  Streaming(type: REDISTRIBUTE) |     30 |       8 | 15.49
 13 |                               ->  Seq Scan on t3             |     30 |       8 | 14.14
 14 |                   ->  Hash                                   |     29 |       4 | 15.49
 15 |                      ->  Streaming(type: REDISTRIBUTE)       |     30 |       4 | 15.49
 16 |                         ->  Seq Scan on t4                   |     30 |       4 | 14.14
 17 |             ->  Hash                                         |     29 |       4 | 15.49
 18 |                ->  Streaming(type: REDISTRIBUTE)             |     30 |       4 | 15.49
 19 |                   ->  Seq Scan on t1                         |     30 |       4 | 14.14
(19 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   4 --Hash Join (5,17)
         Hash Cond: (t2.b = t1.b)
   6 --Hash Join (7,14)
         Hash Cond: (t3.d = t4.d)
   8 --Hash Join (9,11)
         Hash Cond: (t2.c = t3.c)
(6 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select count(*) from t1, t2, t3, t4 where t1.b = t2.b and t2.c = t3.c and t3.d = t4.d group by t1.b order by t1.b;
 id |                                    operation                                     | E-rows | E-width | E-costs 
----+----------------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                                     |     30 |      12 | 68.69
  2 |    ->  GroupAggregate                                                            |     30 |      12 | 66.81
  3 |       ->  Sort                                                                   |     30 |       4 | 66.36
  4 |          ->  Hash Join (5,11)                                                    |     30 |       4 | 65.55
  5 |             ->  Hash Join (6,8)                                                  |     30 |       8 | 32.38
  6 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)       |     30 |       4 | 15.69
  7 |                   ->  Seq Scan on t1                                             |     30 |       4 | 14.14
  8 |                ->  Hash                                                          |     30 |       8 | 15.69
  9 |                   ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)    |     30 |       8 | 15.69
 10 |                      ->  Seq Scan on t2                                          |     30 |       8 | 14.14
 11 |             ->  Hash                                                             |     30 |       4 | 32.38
 12 |                ->  Hash Join (13,15)                                             |     30 |       4 | 32.38
 13 |                   ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)    |     30 |       8 | 15.69
 14 |                      ->  Seq Scan on t3                                          |     30 |       8 | 14.14
 15 |                   ->  Hash                                                       |     30 |       4 | 15.69
 16 |                      ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 15.69
 17 |                         ->  Seq Scan on t4                                       |     30 |       4 | 14.14
(17 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   4 --Hash Join (5,11)
         Hash Cond: (t2.c = t3.c)
   5 --Hash Join (6,8)
         Hash Cond: (t1.b = t2.b)
  12 --Hash Join (13,15)
         Hash Cond: (t3.d = t4.d)
(6 rows)

Gather Groupby/Agg

要把GroupBy/Agg结果收敛到单个DN需要满足：

GroupBy/Agg前后优化器估算的数据行数在阈值以下。
Agg的子节点均为Stream节点。

gaussdb=# set explain_perf_mode=pretty;
SET
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select count(*) from t1 group by b;
 id |                operation                | E-rows | E-width | E-costs 
----+-----------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)            |     30 |      12 | 15.87
  2 |    ->  HashAggregate                    |     30 |      12 | 14.62
  3 |       ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
  4 |          ->  Seq Scan on t1             |     30 |       4 | 14.14
(4 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select count(*) from t1 group by b;
 id |                             operation                             | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                      |     30 |      12 | 16.85
  2 |    ->  HashAggregate                                              |     30 |      12 | 14.97
  3 |       ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 14.46
  4 |          ->  Seq Scan on t1                                       |     30 |       4 | 14.14
(4 rows)

gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select b from t1 group by b;
 id |                operation                | E-rows | E-width | E-costs 
----+-----------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)            |     30 |       4 | 15.84
  2 |    ->  HashAggregate                    |     30 |       4 | 14.59
  3 |       ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
  4 |          ->  Seq Scan on t1             |     30 |       4 | 14.14
(4 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select b from t1 group by b;
 id |                             operation                             | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                      |     30 |       4 | 16.74
  2 |    ->  HashAggregate                                              |     30 |       4 | 14.87
  3 |       ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 14.46
  4 |          ->  Seq Scan on t1                                       |     30 |       4 | 14.14
(4 rows)

Gather 窗口函数

要把窗口函数的结果收敛到单个DN需要满足：

窗口函数前后优化器估算的数据行数在阈值以下。
窗口函数的子节点均为Stream节点。

gaussdb=# set explain_perf_mode=pretty;
SET
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select count(*) over (partition by b) a from t1;
 id |                 operation                  | E-rows | E-width | E-costs 
----+--------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)               |     29 |       4 | 16.71
  2 |    ->  WindowAgg                           |     29 |       4 | 14.96
  3 |       ->  Sort                             |     29 |       4 | 14.75
  4 |          ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
  5 |             ->  Seq Scan on t1             |     30 |       4 | 14.14
(5 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select count(*) over (partition by b) a from t1;
 id |                              operation                               | E-rows | E-width | E-costs 
----+----------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                         |     30 |       4 | 19.07
  2 |    ->  WindowAgg                                                     |     30 |       4 | 16.38
  3 |       ->  Sort                                                       |     30 |       4 | 15.73
  4 |          ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3) |     30 |       4 | 14.46
  5 |             ->  Seq Scan on t1                                       |     30 |       4 | 14.14
(5 rows)

gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select sum(b) over (partition by b) a from t1 group by b;
 id |                   operation                   | E-rows | E-width | E-costs 
----+-----------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                  |     30 |       4 | 16.18
  2 |    ->  WindowAgg                              |     30 |       4 | 14.93
  3 |       ->  Sort                                |     30 |       4 | 14.78
  4 |          ->  HashAggregate                    |     30 |       4 | 14.59
  5 |             ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
  6 |                ->  Seq Scan on t1             |     30 |       4 | 14.14
(6 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select sum(b) over (partition by b) a from t1 group by b;
 id |                                operation                                | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                            |     30 |       4 | 18.00
  2 |    ->  WindowAgg                                                        |     30 |       4 | 16.13
  3 |       ->  Sort                                                          |     30 |       4 | 15.68
  4 |          ->  HashAggregate                                              |     30 |       4 | 14.87
  5 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode3) |     30 |       4 | 14.46
  6 |                ->  Seq Scan on t1                                       |     30 |       4 | 14.14
(6 rows)

Union/Union all

要把Union/Union all的结果收敛到单个DN需要满足：

子节点需要至少一个满足前面3个案例（Gather Join、Gather Groupby/Agg、Gather窗口函数）中的情况。

为了方便举例，设置join的子节点都为Stream节点，关闭了Broadcast。

gaussdb=# set explain_perf_mode=pretty;
SET
gaussdb=# set enable_broadcast=false;        
SET
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select t1.a, t2.b from t1, t2 where t1.b = t2.b union all select t3.a, t3.b from t3, t4 where t3.b = t4.b;
 id |                    operation                     | E-rows | E-width | E-costs 
----+--------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                     |     60 |       8 | 65.31
  2 |    ->  Result                                    |     60 |       8 | 62.81
  3 |       ->  Append(4, 10)                          |     60 |       8 | 62.81
  4 |          ->  Hash Join (5,7)                     |     30 |       8 | 31.30
  5 |             ->  Streaming(type: REDISTRIBUTE)    |     30 |       8 | 15.49
  6 |                ->  Seq Scan on t1                |     30 |       8 | 14.14
  7 |             ->  Hash                             |     29 |       4 | 15.49
  8 |                ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 15.49
  9 |                   ->  Seq Scan on t2             |     30 |       4 | 14.14
 10 |          ->  Hash Join (11,13)                   |     30 |       8 | 31.30
 11 |             ->  Streaming(type: REDISTRIBUTE)    |     30 |       8 | 15.49
 12 |                ->  Seq Scan on t3                |     30 |       8 | 14.14
 13 |             ->  Hash                             |     29 |       4 | 15.49
 14 |                ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 15.49
 15 |                   ->  Seq Scan on t4             |     30 |       4 | 14.14
(15 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   4 --Hash Join (5,7)
         Hash Cond: (t1.b = t2.b)
  10 --Hash Join (11,13)
         Hash Cond: (t3.b = t4.b)
(4 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select t1.a, t2.b from t1, t2 where t1.b = t2.b union all select t3.a, t3.b from t3, t4 where t3.b = t4.b;
 id |                                operation                                | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                            |     60 |       8 | 69.11
  2 |    ->  Append(3, 9)                                                     |     60 |       8 | 65.36
  3 |       ->  Hash Join (4,6)                                               |     30 |       8 | 32.38
  4 |          ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)    |     30 |       8 | 15.69
  5 |             ->  Seq Scan on t1                                          |     30 |       8 | 14.14
  6 |          ->  Hash                                                       |     30 |       4 | 15.69
  7 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 15.69
  8 |                ->  Seq Scan on t2                                       |     30 |       4 | 14.14
  9 |       ->  Hash Join (10,12)                                             |     30 |       8 | 32.38
 10 |          ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1)    |     30 |       8 | 15.69
 11 |             ->  Seq Scan on t3                                          |     30 |       8 | 14.14
 12 |          ->  Hash                                                       |     30 |       4 | 15.69
 13 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode1) |     30 |       4 | 15.69
 14 |                ->  Seq Scan on t4                                       |     30 |       4 | 14.14
(14 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   3 --Hash Join (4,6)
         Hash Cond: (t1.b = t2.b)
   9 --Hash Join (10,12)
         Hash Cond: (t3.b = t4.b)
(4 rows)

gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select t1.a, t2.b from t1, t2 where t1.b = t2.b union select t3.a, t3.b from t3, t4 where t3.b = t4.b order by a, b;
 id |                      operation                      | E-rows | E-width | E-costs 
----+-----------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                        |     60 |       8 | 66.09
  2 |    ->  Sort                                         |     60 |       8 | 63.59
  3 |       ->  HashAggregate                             |     60 |       8 | 63.11
  4 |          ->  Append(5, 11)                          |     60 |       8 | 62.81
  5 |             ->  Hash Join (6,8)                     |     30 |       8 | 31.30
  6 |                ->  Streaming(type: REDISTRIBUTE)    |     30 |       8 | 15.49
  7 |                   ->  Seq Scan on t1                |     30 |       8 | 14.14
  8 |                ->  Hash                             |     29 |       4 | 15.49
  9 |                   ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 15.49
 10 |                      ->  Seq Scan on t2             |     30 |       4 | 14.14
 11 |             ->  Hash Join (12,14)                   |     30 |       8 | 31.30
 12 |                ->  Streaming(type: REDISTRIBUTE)    |     30 |       8 | 15.49
 13 |                   ->  Seq Scan on t3                |     30 |       8 | 14.14
 14 |                ->  Hash                             |     29 |       4 | 15.49
 15 |                   ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 15.49
 16 |                      ->  Seq Scan on t4             |     30 |       4 | 14.14
(16 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   5 --Hash Join (6,8)
         Hash Cond: (t1.b = t2.b)
  11 --Hash Join (12,14)
         Hash Cond: (t3.b = t4.b)
(4 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select t1.a, t2.b from t1, t2 where t1.b = t2.b union select t3.a, t3.b from t3, t4 where t3.b = t4.b order by a, b;
 id |                                   operation                                   | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                                  |     60 |       8 | 71.93
  2 |    ->  Sort                                                                   |     60 |       8 | 68.18
  3 |       ->  HashAggregate                                                       |     60 |       8 | 66.26
  4 |          ->  Append(5, 11)                                                    |     60 |       8 | 65.36
  5 |             ->  Hash Join (6,8)                                               |     30 |       8 | 32.38
  6 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2)    |     30 |       8 | 15.69
  7 |                   ->  Seq Scan on t1                                          |     30 |       8 | 14.14
  8 |                ->  Hash                                                       |     30 |       4 | 15.69
  9 |                   ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2) |     30 |       4 | 15.69
 10 |                      ->  Seq Scan on t2                                       |     30 |       4 | 14.14
 11 |             ->  Hash Join (12,14)                                             |     30 |       8 | 32.38
 12 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2)    |     30 |       8 | 15.69
 13 |                   ->  Seq Scan on t3                                          |     30 |       8 | 14.14
 14 |                ->  Hash                                                       |     30 |       4 | 15.69
 15 |                   ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2) |     30 |       4 | 15.69
 16 |                      ->  Seq Scan on t4                                       |     30 |       4 | 14.14
(16 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   5 --Hash Join (6,8)
         Hash Cond: (t1.b = t2.b)
  11 --Hash Join (12,14)
         Hash Cond: (t3.b = t4.b)
(4 rows)
     

gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select b, count(*) from t1 group by b union all select b, count(*) from t2 group by b order by b;
 id |                    operation                     | E-rows | E-width | E-costs 
----+--------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                     |     60 |      12 | 32.43
  2 |    ->  Sort                                      |     60 |      12 | 29.93
  3 |       ->  Result                                 |     60 |      12 | 29.45
  4 |          ->  Append(5, 8)                        |     60 |      12 | 29.45
  5 |             ->  HashAggregate                    |     30 |      12 | 14.62
  6 |                ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
  7 |                   ->  Seq Scan on t1             |     30 |       4 | 14.14
  8 |             ->  HashAggregate                    |     30 |      12 | 14.62
  9 |                ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
 10 |                   ->  Seq Scan on t2             |     30 |       4 | 14.14
(10 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select b, count(*) from t1 group by b union all select b, count(*) from t2 group by b order by b;
 id |                                operation                                | E-rows | E-width | E-costs 
----+-------------------------------------------------------------------------+--------+---------+---------
  1 | ->  Streaming (type: GATHER)                                            |     60 |      12 | 36.22
  2 |    ->  Sort                                                             |     60 |      12 | 32.47
  3 |       ->  Append(4, 7)                                                  |     60 |      12 | 30.55
  4 |          ->  HashAggregate                                              |     30 |      12 | 14.97
  5 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2) |     30 |       4 | 14.46
  6 |                ->  Seq Scan on t1                                       |     30 |       4 | 14.14
  7 |          ->  HashAggregate                                              |     30 |      12 | 14.97
  8 |             ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2) |     30 |       4 | 14.46
  9 |                ->  Seq Scan on t2                                       |     30 |       4 | 14.14
(9 rows)
gaussdb=# set enable_dngather=false;
SET
gaussdb=# explain select b, count(*) from t1 group by b union all select count(distinct a) a , count(distinct b)b from t2 order by b;
 id |                          operation                           | E-rows | E-width |    E-costs     
----+--------------------------------------------------------------+--------+---------+----------------
  1 | ->  Streaming (type: GATHER)                                 |     33 |      12 | 20000000045.02
  2 |    ->  Sort                                                  |     33 |      12 | 20000000043.65
  3 |       ->  Append(4, 8)                                       |     33 |      12 | 20000000043.43
  4 |          ->  Subquery Scan on "*SELECT* 1"                   |     30 |      12 | 14.72
  5 |             ->  HashAggregate                                |     30 |      12 | 14.62
  6 |                ->  Streaming(type: REDISTRIBUTE)             |     30 |       4 | 14.45
  7 |                   ->  Seq Scan on t1                         |     30 |       4 | 14.14
  8 |          ->  Subquery Scan on "*SELECT* 2"                   |      1 |      16 | 20000000028.73
  9 |             ->  Nested Loop (10,14)                          |      3 |      16 | 20000000028.70
 10 |                ->  Aggregate                                 |      3 |      12 | 10000000014.18
 11 |                   ->  Streaming(type: BROADCAST)             |      9 |      12 | 10000000014.18
 12 |                      ->  Aggregate                           |      3 |      12 | 14.19
 13 |                         ->  Seq Scan on t2                   |     30 |       4 | 14.14
 14 |                ->  Materialize                               |      3 |       8 | 10000000014.49
 15 |                   ->  Aggregate                              |      3 |      12 | 10000000014.48
 16 |                      ->  Streaming(type: BROADCAST)          |      9 |      12 | 10000000014.48
 17 |                         ->  Aggregate                        |      3 |      12 | 14.48
 18 |                            ->  Streaming(type: REDISTRIBUTE) |     30 |       4 | 14.45
 19 |                               ->  Seq Scan on t2             |     30 |       4 | 14.14
(19 rows)

 Predicate Information (identified by plan id) 
-----------------------------------------------
   8 --Subquery Scan on "*SELECT* 2"
         Filter: (Hash By "*SELECT* 2".a)
(2 rows)

gaussdb=# set enable_dngather=true;
SET
gaussdb=# explain select b, count(*) from t1 group by b union all select count(distinct a) a , count(distinct b)b from t2 order by b;
 id |                                 operation                                  | E-rows | E-width |    E-costs     
----+----------------------------------------------------------------------------+--------+---------+----------------
  1 | ->  Streaming (type: GATHER)                                               |     33 |      11 | 20000000046.96
  2 |    ->  Sort                                                                |     33 |      11 | 20000000044.90
  3 |       ->  Append(4, 8)                                                     |     33 |      11 | 20000000043.99
  4 |          ->  Subquery Scan on "*SELECT* 1"                                 |     30 |      12 | 15.27
  5 |             ->  HashAggregate                                              |     30 |      12 | 14.97
  6 |                ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2) |     30 |       4 | 14.46
  7 |                   ->  Seq Scan on t1                                       |     30 |       4 | 14.14
  8 |          ->  Streaming(type: REDISTRIBUTE ng: node_group->datanode2)       |      3 |      16 | 20000000028.83
  9 |             ->  Nested Loop (10,14)                                        |      3 |      16 | 20000000028.70
 10 |                ->  Aggregate                                               |      3 |      12 | 10000000014.18
 11 |                   ->  Streaming(type: BROADCAST)                           |      9 |      12 | 10000000014.18
 12 |                      ->  Aggregate                                         |      3 |      12 | 14.19
 13 |                         ->  Seq Scan on t2                                 |     30 |       4 | 14.14
 14 |                ->  Materialize                                             |      3 |       8 | 10000000014.50
 15 |                   ->  Aggregate                                            |      3 |      12 | 10000000014.48
 16 |                      ->  Streaming(type: BROADCAST)                        |      9 |      12 | 10000000014.48
 17 |                         ->  Aggregate                                      |      3 |      12 | 14.48
 18 |                            ->  Streaming(type: REDISTRIBUTE)               |     30 |       4 | 14.45
 19 |                               ->  Seq Scan on t2                           |     30 |       4 | 14.14
(19 rows)

父主题： 实际调优案例

上一篇：案例：调整基于代价的查询重写GUC参数costbased_rewrite_rule

下一篇：案例：使用全局二级索引加速查询

意见反馈

文档内容是否对您有帮助？

有帮助没帮助

提供反馈

提交成功！非常感谢您的反馈，我们会继续努力做到更好！您可在我的云声建议查看反馈及问题处理状态。

系统繁忙，请稍后重试

在使用文档中是否遇到以下问题

内容与产品页面不一致

内容不易理解

缺失示例代码

步骤不可操作

搜不到想要的内容

缺少最佳实践

意见反馈（选填）

0/500

请至少选择一项反馈信息并填写问题反馈

字符长度不能超过500

直接提交取消

如您有其它疑问，您也可以通过华为云社区问答频道来与我们联系探讨

智能客服提问云社区提问

案例：使用DN Gather减少计划中的Stream节点

案例环境准备

Gather Join

Gather Groupby/Agg

Gather 窗口函数

Union/Union all

相关文档

意见反馈

文档内容是否对您有帮助？

7*24

备案

专业服务

退订

建议反馈

售前咨询热线

文档反馈