Painel
O painel exibe vários gráficos, como gráficos de linhas e dígitos, na mesma tela para apresentar dados de recursos, ajudando você a entender de forma abrangente os dados de monitoramento.
Verificar e alternar visualizações
- Efetue logon no console do CCE e clique no nome do cluster para acessar o console do cluster.
- Escolha Monitoring Center no painel de navegação e clique na guia Dashboard. A visualização de cluster é exibida por padrão.
- Configurar parâmetros relacionados para verificação de visualizações. Os parâmetros disponíveis para configuração variam de acordo com as vistas. Veja Tabela 1 para mais detalhes.
- Especifique a janela de exibição.
Selecione ou personalize segmentos de tempo no canto superior direito da página e clique em
para atualizar a página.
- O painel fornece visualizações predefinidas. Você pode clicar no botão Switch View ao lado do nome da exibição para selecionar os dados de monitoramento a serem exibidos. Tabela 1 descreve as visualizações predefinidas.
Tabela 1 Visualizações predefinidas Nome da visualização
Parâmetro
Métrica de nonitoramento incluída
Visualização de cluster (padrão)
Cluster
- Nodes
- Nodes with Unavailable Disks
- Unavailable Nodes
- CPU Usage
- CPU Request
- CPU Limit
- Memory Usage
- Memory Request
- Memory Limit
- Pods
- Containers
- Used CPUs
- Used Memory
- Network Receive Rate
- Network Transmit Rate
- Average Network Receive Rate
- Average Network Transmit Rate
- Packet Receive Rate
- Packet Transmit Rate
- Packet Loss Rate (Receive)
- Packet Loss Rate (Transmit)
- Disk I/O Rate (Read + Write)
- Disk Throughput (Read + Write)
Visualização do servidor da API
- Cluster
- Pod
- Alived
- QPS
- Request Success Rate (Read)
- Requests Being Processed
- Request Rate (Read)
- Request Error Rate (Read)
- P99 Request Latency (Read)
- Request Rate (Write)
- Request Error Rate (Write)
- P99 Request Latency (Write)
- Work Queue Growth Rate
- Work Queue Depth
- Work Queue Latency (P99)
- Used Memory
- Used CPUs
- Goroutines
Visualização do Pod
- Cluster
- Namespace
- Pod
- Containers
- Running Containers
- Pod Status
- Container Restarts
- Used CPUs
- CPU Throttling
- Used Memory
- Network Receive Rate
- Network Transmit Rate
- Packet Receive Rate
- Packet Transmit Rate
- Packet Loss Rate (Receive)
- Packet Loss Rate (Transmit)
- Pod Disk I/O Rate (Read + Write)
- Pod Disk Throughput (Read + Write)
- Container Disk I/O Rate (Read + Write)
- Container Disk Throughput (Read + Write)
- File System Usage
- Used File System Space
Visualização de host
- Cluster
- Node
- CPU Usage
- Load Average
- Used Memory
- Memory Usage
- Disk Write Rate
- Disk Read Rate
- Disk Space Usage
- Disk I/O
- TCP Connection
- UDP Usage
- Max. File Descriptor
- Used File Descriptors
- Socket Usage
- Abnormal File System
- Disk Rate
- I/O Latency
- I/O Queues
- Process Status
k8s-node
- Cluster
- Nó
- CPU Usage
- CPU Request
- CPU Limit
- Memory Usage
- Memory Request
- Memory Limit
- Used Memory
- Network Receive Rate
- Network Transmit Rate
- Network Receive Rate (Pod)
- Network Transmit Rate (Pod)
- Packet Receive Rate
- Packet Transmit Rate
- Packet Loss Rate (Receive)
- Packet Loss Rate (Transmit)
- Node Disk I/O Rate (Read + Write)
- Node Disk Throughput (Read + Write)
CoreDNS
- Cluster
- Pod
- Request Rate
- Request Rate (Record Type)
- Request Rate (Region)
- Request Rate (DO Flag)
- Request Packet (UDP)
- Request Packet (TCP)
- Response Rate (by rcode)
- Response Rate (duration)
- Response Packet (UDP)
- Response Packet (TCP)
- Cached Records
- Cache Hit Ratio
Visualização de PVC (somente clusters do CCE)
- Cluster
- Namespace
- PV
- PVC
- PV Status
- PVC Status
- Used PVC Capacity
- PVC Usage
- PVC Inodes
- PVC Inodes Usage
- PVC Capacity Used per Hour
- PVC Capacity Used per Day
- PVC Capacity Used per Week
- Used PVC Capacity After One Week
kubelet
- Cluster
- Pod
- Running kubelets
- Running Pods
- Running Containers
- Actual Volumes
- Expected Volumes
- Configuration Errors
- Operation Rate
- Operation Error Rate
- Operation Latency
- Pod Startup Rate
- Pod Startup Latency (P99)
- Storage Operation Rate
- Storage Operation Error Rate
- Storage Operation Latency (P99)
- Cgroup Manager Operation Rate
- Cgroup Manager Operation Latency (P99)
- PLEG Relist Rate
- PLEG Relist Interval
- PLEG Relist Latency (P99)
- RPC Rate
- Request Latency (P99)
- Used Memory
- Used CPUs
- Goroutines
Prometheus
- Cluster
- Tarefa
- Pod
- Target Synchronization
- Targets
- Average Scrape Interval
- Scrape Failures
- Sample Adding Rate
- Series in the Head
- Head Chunks
- Query Rate
- P90 Query Duration
Gravação remota de Prometheus
- Cluster
- Pod
- url
- Remote Sample Lag Ratio
- Remote Write Traffic
- Current Shards
- Max Shards
- Min Shards
- Desired Shards
- Shard Capacity
- Pending Samples
- Current TSDB Segment
- Current Segment of Remote Write
- Sample Discard Rate
- Sample Failure Rate
- Sample Retry Rate
- Retry Rate of Enqueuing
Visualização do pool de nós
- Cluster
- Pool de nós
- Node Pool CPU Allocation Rate
- Node CPU Allocation Rate
- Node Pool Memory Allocation Rate
- Node Memory Allocation Rate
- Node Count Trend
Visualização de XGPU
Cluster
- Cluster - XGPU Device GPU Memory Usage
- Cluster - XGPU Device Computing Usage
- Node - XGPU Device GPU Memory Usage
- Node - XGPU Device Computing Usage
- Node - Number of XGPU Devices
- Node - Allocated XGPU Device GPU Memory
- GPU - XGPU Device GPU Memory Usage
- GPU - Allocated XGPU Device GPU Memory
- GPU - XGPU Device GPU Memory Allocation Rate
- GPU - XGPU Device Computing Usage
- GPU - Number of XGPU Devices
- GPU - Scheduling Policy
- GPU - Number of Unhealthy XGPU Devices
- Container - Allocated GPU Memory
- Container - Computing Usage
- Container - Used GPU Memory
- Container - GPU Memory Usage