【 TiDB 使用环境】
tidb 4.0.7
tidb-monitor yaml文件
apiVersion: pingcap.com/v1alpha1
kind: TidbMonitor
metadata:
name: tidb
spec:
clusters:
- name: tidb
persistent: true
storageClassName: ebs-sc
storage: 50G
prometheus:
baseImage: prom/prometheus
version: v2.18.1
service:
type: NodePort
grafana:
baseImage: grafana/grafana
version: 6.1.6
service:
type: NodePort
initializer:
baseImage: pingcap/tidb-monitor-initializer
version: v4.0.13
reloader:
baseImage: pingcap/tidb-monitor-reloader
version: v1.0.1
imagePullPolicy: IfNotPresent
【概述】使用官方提供的监控,里面发现有部分指标没有
tidb binlog 指标没有
TiFlash 所有都没有
PD_incorrect_namespace_region_count
count(changes(pd_server_tso{type=“save”}[10m])> 0) >= 2
PD_no_store_for_making_replica
PD_system_time_slow
go_memstats_heap_inuse_bytes 改成{cluster=“tidb”}
TiDB_monitor_keep_alive 改成{cluster=“tidb”}
TiDB_server_event_error 改成type=“start|hang”
TiDB_server_panic_total 没有
TiDB_tikvclient_gc_action_fail 没有
TiKV_batch_request_snapshot_nums 没有{name=~“cop_.*”}
TiKV_channel_full_total tikv_channel_full_total没有
TiKV_coprocessor_pending_request 没有tikv_coprocessor_pending_request
TiKV_coprocessor_request_lock_error 没有reason=“lock”
TiKV_leader_drops 没有tikv_pd_heartbeat_tick_total
TiKV_memory_used_too_fast 没有component=~“tikv”
TiKV_raft_process_tick_duration_secs 没有type=“tick”
TiKV_thread_apply_worker_cpu_seconds 没有name=“apply_worker”