Updated on 2024-05-31 GMT+08:00

CCE Node Problem Detector Release History

Table 1 Release history

Add-on Version

Supported Cluster Version

New Feature

Community Version

1.19.1

v1.21

v1.23

v1.25

v1.27

v1.28

v1.29

Fixed some issues.

0.8.10

1.19.0

v1.21

v1.23

v1.25

v1.27

v1.28

Fixed some issues.

0.8.10

1.18.48

v1.21

v1.23

v1.25

v1.27

v1.28

Fixed some issues.

0.8.10

1.18.46

v1.21

v1.23

v1.25

v1.27

v1.28

CCE clusters 1.28 are supported.

0.8.10

1.18.22

v1.19

v1.21

v1.23

v1.25

v1.27

None

0.8.10

1.18.14

v1.19

v1.21

v1.23

v1.25

  • Supported anti-affinity scheduling of pods on nodes in different AZs.
  • Allows adding a taint to a node before the release of a spot ECS for the node to repel a set of pods.
  • Synchronizes time zones used by add-ons and nodes.

0.8.10

1.18.10

v1.19

v1.21

v1.23

v1.25

  • Optimizes the configuration page.
  • Adds threshold configuration to the DiskSlow check item.
  • Adds threshold configuration to the NTPProblem check item.
  • Supported anti-affinity scheduling of pods on nodes in different AZs.
  • Supports interruption detection for spot ECSs and evicts pods on nodes before the interruption.

0.8.10

1.17.4

v1.17

v1.19

v1.21

v1.23

v1.25

Optimizes DiskHung check item.

0.8.10

1.17.3

v1.17

v1.19

v1.21

v1.23

v1.25

  • The maximum number of taint nodes that can be added to the NPC can be configured by percentage.
  • Adds the ProcessZ check item.
  • Adds the time deviation detection to the NTPProblem check item.
  • Fixes the processes consistently in the D state (exist in the BMS node).

0.8.10

1.17.2

v1.17

v1.19

v1.21

v1.23

v1.25

  • Adds the DiskHung check item for disk I/O.
  • Adds the DiskSlow check item for disk I/O.
  • Adds the ProcessD check item.
  • Adds MountPointProblem to check the health of mount points.
  • To avoid conflicts with the service port range, the default health check listening port is changed to 19900, and the default Prometheus metric exposure port is changed to 19901.
  • Supports clusters 1.25.

0.8.10

1.16.4

v1.17

v1.19

v1.21

v1.23

  • Adds the beta check item ScheduledEvent to detect cold and live VM migration events caused by host machine exceptions using the metadata API. This check item is disabled by default.

0.8.10

1.16.3

v1.17

v1.19

v1.21

v1.23

Adds the function of checking the ResolvConf configuration file.

0.8.10

1.16.1

v1.17

v1.19

v1.21

v1.23

  • Adds node-problem-controller. Supports basic fault isolation.
  • Adds the PID, FD, disk, memory, temporary volume pool, and PV pool check items.

0.8.10

1.15.0

v1.17

v1.19

v1.21

v1.23

  • Hardens check items comprehensively to avoid false positives.
  • Supports kernel check. Supports reporting of OOMKilled and TaskHung events.

0.8.10

1.14.11

v1.17

v1.19

v1.21

CCE clusters 1.21 are supported.

0.7.1

1.14.5

v1.17

v1.19

Fixes the issue that monitoring metrics cannot be obtained.

0.7.1

1.14.4

v1.17

v1.19

  • Supports containerd nodes.

0.7.1

1.14.2

v1.17

v1.19

  • CCE clusters 1.19 are supported.
  • Supported Ubuntu OS and Kata containers.

0.7.1

1.13.8

v1.15.11

v1.17

  • Fixes the CNI health check issue on the container tunnel network.
  • Adjusts resource quotas.

0.7.1

1.13.6

v1.15.11

v1.17

Fixes the issue that zombie processes are not reclaimed.

0.7.1

1.13.5

v1.15.11

v1.17

Adds taint tolerance configuration.

0.7.1

1.13.2

v1.15.11

v1.17

Adds resource restrictions and enhances the detection capability of the cni add-on.

0.7.1