网站建设论文参考文献,网站开发数据库有关合同,谷歌seo视频教程,小程序注册好了怎么办开始使用目录标题ZooKeeper Cluster 架构文档集群标识1. 环境信息1.1 Kubernetes 集群1.2 工作节点分布2. 集群架构概览2.1 集群拓扑图2.2 集群规模统计3. Pod 详细信息3.1 Participant 节点Replica 0Replica 1Replica 23.2 Observer 节点4. 容器架构4.1 Pod 容器组成4.2 容器环境变量z…目录标题ZooKeeper Cluster 架构文档集群标识1. 环境信息1.1 Kubernetes 集群1.2 工作节点分布2. 集群架构概览2.1 集群拓扑图2.2 集群规模统计3. Pod 详细信息3.1 Participant 节点Replica 0Replica 1Replica 23.2 Observer 节点4. 容器架构4.1 Pod 容器组成4.2 容器环境变量zookeeper 容器环境变量5. 存储架构5.1 持久化卷声明 (PVC)5.2 存储目录结构5.3 存储策略6. 网络架构6.1 服务列表6.2 端口映射6.3 网络端点内部客户端端点Headless 服务端点 (直接访问各节点)6.4 Endpoints 详情Client Service EndpointsAdmin Server EndpointsHeadless Service Endpoints7. ZooKeeper 配置详解7.1 主配置文件 (zoo.cfg)7.2 环境配置 (env.sh)7.3 日志配置 (log4j.properties)7.4 SASL 认证配置 (jaas.config)8. 调度与亲和性8.1 Pod 反亲和性规则8.2 节点分布9. 安全与认证9.1 ServiceAccount9.2 Secret9.3 安全上下文ssh 容器 (备份容器)10. 监控与健康检查10.1 健康检查配置Liveness Probe (存活探针)Readiness Probe (就绪探针)10.2 Prometheus 监控指标10.3 四字命令11. Operator 管理11.1 ZooKeeper Operator11.2 Operator 启动参数11.3 Web 管理界面12. 资源汇总12.1 单 Pod 资源12.2 集群总资源 (4 Pods)12.3 Operator 资源13. 自定义资源 (CRD)13.1 ZookeeperCluster CR13.2 集群状态14. 标签 (Labels)14.1 集群级别标签14.2 Pod 额外标签15. Finalizers16. 连接示例16.1 客户端连接字符串16.2 获取集群状态17. 运维参考17.1 常用命令17.2 日志查看17.3 故障排查18. 附录18.1 文档版本18.2 相关文档ZooKeeper Cluster 架构文档集群标识属性值集群名称zookeeper-e93aadcf命名空间qfusion-admin1. 环境信息1.1 Kubernetes 集群属性值KUBECONFIG/bpx/.145-admin.confKubernetes 版本v1.24.10网络插件Cilium CNI容器运行时containerd://1.7.31.2 工作节点分布节点名称IP 地址操作系统角色ZooKeeper Podqfusion1x.x.x.145openEuler 22.03 (LTS-SP1)control-plane,masterzookeeper-e93aadcf-0-0qfusion2x.x.x.148openEuler 22.03 (LTS-SP1)control-plane,masterzookeeper-e93aadcf-observer-0-0qfusion3x.x.x.150openEuler 22.03 LTScontrol-plane,masterzookeeper-e93aadcf-2-0qfusion4x.x.x.147Kylin Linux V10 (Halberd)workerzookeeper-e93aadcf-1-02. 集群架构概览2.1 集群拓扑图┌─────────────────────────────────────────────────────────────────────────────────┐ │ ZooKeeper Cluster: zookeeper-e93aadcf │ │ 版本: 3.7.1 │ ├─────────────────────────────────────────────────────────────────────────────────┤ │ │ │ ┌─────────────────────────────────────────────────────────────────────────┐ │ │ │ Quorum 集群 (3节点) │ │ │ ├─────────────────────────────────────────────────────────────────────────┤ │ │ │ │ │ │ │ ┌──────────────────┐ ┌──────────────────┐ ┌──────────────────┐ │ │ │ │ │ Replica 0 │ │ Replica 1 │ │ Replica 2 │ │ │ │ │ │ Participant │ │ Participant │ │ Participant │ │ │ │ │ │ │ │ │ │ │ │ │ │ │ │ Node: qfusion1 │ │ Node: qfusion4 │ │ Node: qfusion3 │ │ │ │ │ │ IP: 245.0.0.94 │ │ IP: 245.0.3.243 │ │ IP: 245.0.2.35 │ │ │ │ │ │ myid: 0 │ │ myid: 1 │ │ myid: 2 │ │ │ │ │ └────────┬─────────┘ └────────┬─────────┘ └────────┬─────────┘ │ │ │ │ │ │ │ │ │ │ │ └───────────────────────┼───────────────────────┘ │ │ │ │ │ │ │ │ │ ┌───────────┴───────────┐ │ │ │ │ │ Leader Election │ │ │ │ │ │ (Port 3888) │ │ │ │ │ └───────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────┘ │ │ │ │ │ ▼ │ │ ┌─────────────────────────────────────────────────────────────────────────┐ │ │ │ Observer 节点 (只读副本) │ │ │ ├─────────────────────────────────────────────────────────────────────────┤ │ │ │ ┌──────────────────┐ │ │ │ │ │ Observer-0 │ │ │ │ │ │ │ │ │ │ │ │ Node: qfusion2 │ │ │ │ │ │ IP: 245.0.1.95 │ │ │ │ │ │ myid: 3 │ │ │ │ │ └──────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────────────┘2.2 集群规模统计指标值Participant 副本数3Observer 副本数1总 Pod 数4总容器数8 (每Pod 2个容器)存储总容量80Gi (4 × 20Gi)Sources3. Pod 详细信息3.1 Participant 节点Replica 0属性值Pod 名称zookeeper-e93aadcf-0-0命名空间qfusion-admin角色Participant (投票节点)StatefulSetzookeeper-e93aadcf-0节点qfusion1 (x.x.x.145)Pod IP245.0.0.94FQDNzookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local状态RunningReady2/2重启次数0创建时间2025-12-24 22:07:29myid0Zonezone1Replica 1属性值Pod 名称zookeeper-e93aadcf-1-0命名空间qfusion-admin角色Participant (投票节点)StatefulSetzookeeper-e93aadcf-1节点qfusion4 (x.x.x.147)Pod IP245.0.3.243FQDNzookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local状态RunningReady2/2重启次数2创建时间2025-12-24 22:07:49myid1Replica 2属性值Pod 名称zookeeper-e93aadcf-2-0命名空间qfusion-admin角色Participant (投票节点)StatefulSetzookeeper-e93aadcf-2节点qfusion3 (x.x.x.150)Pod IP245.0.2.35FQDNzookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local状态RunningReady2/2重启次数0创建时间2025-12-24 22:08:41myid23.2 Observer 节点属性值Pod 名称zookeeper-e93aadcf-observer-0-0命名空间qfusion-admin角色Observer (只读节点不参与投票)StatefulSetzookeeper-e93aadcf-observer-0节点qfusion2 (x.x.x.148)Pod IP245.0.1.95FQDNzookeeper-e93aadcf-observer-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local状态RunningReady2/2重启次数0创建时间2025-12-24 22:09:15myid34. 容器架构4.1 Pod 容器组成每个 ZooKeeper Pod 包含 2 个容器┌─────────────────────────────────────────────────────────────────────────┐ │ Pod │ ├─────────────────────────────────────────────────────────────────────────┤ │ │ │ ┌───────────────────────────────────────────────────────────────────┐ │ │ │ Container 1: zookeeper (主容器) │ │ │ ├───────────────────────────────────────────────────────────────────┤ │ │ │ Image: k8smaster.qfusion.irds/irds/zookeeper-3.7:v1.1.0 │ │ │ │ Image ID: sha256:15681693... │ │ │ │ Command: /usr/local/bin/zookeeperStart.sh │ │ │ │ Pull Policy: Always │ │ │ │ │ │ │ │ Resources: │ │ │ │ CPU Request: 500m │ │ │ │ CPU Limit: 1000m │ │ │ │ Memory Request: 2Gi │ │ │ │ Memory Limit: 2Gi │ │ │ │ │ │ │ │ Ports: │ │ │ │ 2181/tcp - client (客户端连接) │ │ │ │ 2888/tcp - quorum (集群通信) │ │ │ │ 3888/tcp - leader-election (领导者选举) │ │ │ │ 7000/tcp - metrics (Prometheus监控指标) │ │ │ │ 8080/tcp - admin-server (Jetty管理服务器) │ │ │ │ │ │ │ │ Health Checks: │ │ │ │ Liveness: exec [zookeeperLive.sh] │ │ │ │ delay10s, timeout10s, period10s │ │ │ │ Readiness: exec [zookeeperReady.sh] │ │ │ │ delay10s, timeout10s, period10s │ │ │ │ │ │ │ │ Lifecycle: │ │ │ │ PreStop: exec [zookeeperTeardown.sh] │ │ │ │ │ │ │ │ Volume Mounts: │ │ │ │ /conf - ConfigMap (zookeeper-e93aadcf-configmap) │ │ │ │ /data - PVC (持久化数据) │ │ │ │ /mnt/log - EmptyDir (日志) │ │ │ │ /etc/localtime - HostPath (时区) │ │ │ └───────────────────────────────────────────────────────────────────┘ │ │ │ │ ┌───────────────────────────────────────────────────────────────────┐ │ │ │ Container 2: ssh (备份/管理容器) │ │ │ ├───────────────────────────────────────────────────────────────────┤ │ │ │ Image: k8smaster.qfusion.irds/irds/zookeeper-backup:v3.14.4 │ │ │ │ Image ID: sha256:47b43198... │ │ │ │ Pull Policy: IfNotPresent │ │ │ │ │ │ │ │ Resources: │ │ │ │ CPU Request: 250m │ │ │ │ CPU Limit: 500m │ │ │ │ Memory Request: 1Gi │ │ │ │ Memory Limit: 1Gi │ │ │ │ │ │ │ │ Security Context: │ │ │ │ Privileged: true │ │ │ │ Capabilities: │ │ │ │ - SYS_ADMIN │ │ │ │ - DAC_READ_SEARCH │ │ │ │ │ │ │ │ Volume Mounts: │ │ │ │ /conf - ConfigMap │ │ │ │ /data - PVC (共享主容器数据用于备份) │ │ │ │ /mnt/log - EmptyDir │ │ │ │ /etc/localtime - HostPath │ │ │ └───────────────────────────────────────────────────────────────────┘ │ │ │ └─────────────────────────────────────────────────────────────────────────┘4.2 容器环境变量zookeeper 容器环境变量变量名值说明ZOO_ROLEparticipant节点角色ZOO_LOG_DIR/mnt/log日志目录ZOO_LOG4J_PROPLOG,CONSOLE,ROLLINGFILE日志输出格式ZK_SERVER_HEAP1536服务端堆大小(MB)SERVER_JVMFLAGS-Xms512m -XX:AlwaysPreTouch -Djute.maxbuffer8388608 -XX:MaxGCPauseMillis50JVM参数ENABLE_ACLfalse是否启用ACLZOO_SUPER_ID* (from secret)超级用户IDZOO_SUPER_DIGEST* (from secret)超级用户摘要CLIENT_JVMFLAGS* (empty)客户端JVM参数ZU_JVMFLAGS* (empty)额外JVM参数5. 存储架构5.1 持久化卷声明 (PVC)PVC 名称容量访问模式存储类绑定 PV绑定节点data-zookeeper-e93aadcf-0-020GiRWOcsi-localpvdata-zookeeper-e93aadcf-0-0-qfusion1-f7aa7c0fqfusion1data-zookeeper-e93aadcf-1-020GiRWOcsi-localpvdata-zookeeper-e93aadcf-1-0-qfusion4-fc233897qfusion4data-zookeeper-e93aadcf-2-020GiRWOcsi-localpvdata-zookeeper-e93aadcf-2-0-qfusion3-5ea93ba0qfusion3data-zookeeper-e93aadcf-observer-0-020GiRWOcsi-localpvdata-zookeeper-e93aadcf-observer-0-0-qfusion2-3b05d9e5qfusion25.2 存储目录结构/mnt/log/ # EmptyDir - 日志目录 ├── server.log # ZooKeeper 服务日志 /data/ # PVC - 数据目录 ├── version-2/ # 快照和事务日志目录 │ ├── snapshot.0 # 快照文件 │ └── log.0 # 事务日志 ├── zoo.cfg.dynamic # 动态配置文件 └── myid # 节点ID文件 /conf/ # ConfigMap - 配置目录 ├── zoo.cfg # 主配置文件 ├── log4j.properties # 日志配置 ├── jaas.config # SASL认证配置 └── env.sh # 环境变量脚本5.3 存储策略属性值存储类型Persistence (持久化)Reclaim PolicyDelete (删除PVC时自动删除PV)Volume ModeFilesystemIOPS 配额20006. 网络架构6.1 服务列表服务名称类型Cluster IP端口选择器zookeeper-e93aadcf-clientClusterIP246.102.223.1222181/TCPappzookeeper-e93aadcf,kindZookeeperMemberzookeeper-e93aadcf-admin-serverClusterIP246.99.195.1248080/TCPappzookeeper-e93aadcf,kindZookeeperMemberzookeeper-e93aadcf-headlessClusterIP (None)None2181,2888,3888,7000,8080appzookeeper-e93aadcf,kindZookeeperMemberzookeeper-e93aadcf-client-lbClusterIP246.104.64.16031144/TCPappzookeeper-e93aadcf,kindZookeeperMember6.2 端口映射内部端口名称协议用途2181clientTCP客户端连接端口处理客户端请求2888quorumTCP集群成员间通信端口用于数据同步3888leader-electionTCP领导者选举端口7000metricsTCPPrometheus 指标导出端口8080admin-serverTCPJetty 管理服务器端口暴露命令和监控信息6.3 网络端点内部客户端端点zookeeper-e93aadcf-client.qfusion-admin.svc.cluster.local:2181 246.102.223.122:2181Headless 服务端点 (直接访问各节点)zookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181 zookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181 zookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181 zookeeper-e93aadcf-observer-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:21816.4 Endpoints 详情Client Service Endpoints245.0.0.94:2181 (Replica-0) 245.0.3.243:2181 (Replica-1) 245.0.2.35:2181 (Replica-2) 245.0.1.95:2181 (Observer-0)Admin Server Endpoints245.0.0.94:8080 (Replica-0) 245.0.3.243:8080 (Replica-1) 245.0.2.35:8080 (Replica-2) 245.0.1.95:8080 (Observer-0)Headless Service Endpoints245.0.0.94:2181,2888,3888,7000,8080 (Replica-0) 245.0.3.243:2181,2888,3888,7000,8080 (Replica-1) 245.0.2.35:2181,2888,3888,7000,8080 (Replica-2) 245.0.1.95:2181,2888,3888,7000,8080 (Observer-0)7. ZooKeeper 配置详解7.1 主配置文件 (zoo.cfg)# 基础配置 skipACLyes # 跳过ACL检查 standaloneEnabledfalse # 禁用单机模式 dataDir/data # 数据目录 admin.serverPort8080 # 管理服务器端口 # 时间配置 (单位: 毫秒) tickTime2000 # 心跳时间间隔 (2秒) initLimit10 # 初始连接超时 (10*tickTime20秒) syncLimit2 # 同步超时 (2*tickTime4秒) # 性能配置 globalOutstandingLimit1000 # 全局未完成请求限制 preAllocSize65536 # 预分配磁盘空间大小 (64KB) snapCount10000 # 每N个事务写一次快照 commitLogCount500 # 提交日志数量 snapSizeLimitInKb4194304 # 快照大小限制 (4GB) # 连接配置 maxCnxns0 # 最大连接数 (0无限制) maxClientCnxns60 # 每个客户端最大连接数 # 会话超时配置 minSessionTimeout4000 # 最小会话超时 (4秒) maxSessionTimeout40000 # 最大会话超时 (40秒) # 自动清理配置 autopurge.purgeInterval1 # 自动清理间隔 (小时) autopurge.snapRetainCount3 # 保留的快照数量 # 网络配置 quorumListenOnAllIPsfalse # 不监听所有IP # 四字命令白名单 4lw.commands.whitelistcons, envi, conf, crst, srvr, stat, mntr, ruok # Prometheus 监控配置 metricsProvider.exportJvmInfotrue # 导出JVM信息 metricsProvider.httpPort7000 # 指标端口 metricsProvider.classNameorg.apache.zookeeper.metrics.prometheus.PrometheusMetricsProvider # 动态配置 reconfigEnabledtrue # 启用动态重配置 dynamicConfigFile/data/zoo.cfg.dynamic # 动态配置文件路径7.2 环境配置 (env.sh)#!/usr/bin/env bashDOMAINzookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.localQUORUM_PORT2888LEADER_PORT3888CLIENT_HOSTzookeeper-e93aadcf-clientCLIENT_PORT2181ADMIN_SERVER_HOSTzookeeper-e93aadcf-admin-serverADMIN_SERVER_PORT8080CLUSTER_NAMEzookeeper-e93aadcfCLUSTER_SIZE37.3 日志配置 (log4j.properties)# 日志根配置 zookeeper.root.loggerCONSOLE,ROLLINGFILE zookeeper.console.thresholdINFO zookeeper.log.dir/mnt/log zookeeper.log.fileserver.log zookeeper.log.thresholdINFO # RollingFile 配置 log4j.appender.ROLLINGFILE.MaxFileSize10MB # 单个日志文件最大10MB log4j.appender.ROLLINGFILE.MaxBackupIndex10 # 保留10个历史文件7.4 SASL 认证配置 (jaas.config)Server { org.apache.zookeeper.server.auth.DigestLoginModule required user_qfusionmcdloHmiDdoI0E9; }; Client { org.apache.zookeeper.server.auth.DigestLoginModule required usernameqfusion passwordmcdloHmiDdoI0E9; };8. 调度与亲和性8.1 Pod 反亲和性规则podAntiAffinity:preferredDuringSchedulingIgnoredDuringExecution:-weight:20podAffinityTerm:labelSelector:matchExpressions:-key:appoperator:Invalues:-zookeeper-e93aadcftopologyKey:kubernetes.io/hostname说明: 尽量将 ZooKeeper Pod 调度到不同的节点上权重为20软约束8.2 节点分布Pod节点Zonezookeeper-e93aadcf-0-0qfusion1zone1zookeeper-e93aadcf-1-0qfusion4-zookeeper-e93aadcf-2-0qfusion3-zookeeper-e93aadcf-observer-0-0qfusion2-9. 安全与认证9.1 ServiceAccount属性值名称dboperator命名空间qfusion-adminUIDaa4358dc-1705-4f1a-aec2-935076cb84a29.2 SecretSecret 名称类型数据项zookeeper-e93aadcf-aclOpaqueid, digest (超级用户认证信息)9.3 安全上下文ssh 容器 (备份容器)privileged:truecapabilities:add:-SYS_ADMIN-DAC_READ_SEARCH10. 监控与健康检查10.1 健康检查配置Liveness Probe (存活探针)exec:command:[zookeeperLive.sh]initialDelaySeconds:10timeoutSeconds:10periodSeconds:10failureThreshold:3Readiness Probe (就绪探针)exec:command:[zookeeperReady.sh]initialDelaySeconds:10timeoutSeconds:10periodSeconds:10failureThreshold:3successThreshold:110.2 Prometheus 监控指标指标类型端口路径JVM Metrics7000/metricsAdmin Server8080/commands10.3 四字命令命令描述端口ruok检查服务是否运行2181srvr列出服务器详细信息2181stat输出服务器统计信息2181mntr输出监控指标2181cons列出所有连接的客户端2181envi打印服务环境信息2181conf打印配置信息2181crst重置连接统计信息218111. Operator 管理11.1 ZooKeeper Operator属性值Pod 名称zookeeper-operator-bcc44b994-wmqq6命名空间qfusion镜像k8smaster.qfusion.irds/irds/zookeeper-operator:b4f4c95e…节点qfusion3ServiceAccountzookeeper-operatorCPU Request10mCPU Limit500mMemory Request10KiMemory Limit500Mi11.2 Operator 启动参数zookeeper-operator\-metrics-bind-address127.0.0.1:600011.3 Web 管理界面属性值服务名称zookeeper-webserver命名空间qfusion类型Deployment副本数2Cluster IP246.98.8.26端口909012. 资源汇总12.1 单 Pod 资源资源类型RequestLimitCPU750m1.5 coresMemory3Gi3GiStorage20Gi20Gi12.2 集群总资源 (4 Pods)资源类型RequestLimitCPU3 cores6 coresMemory12Gi12GiStorage80Gi80Gi12.3 Operator 资源资源类型RequestLimitCPU10m500mMemory10Ki500Mi13. 自定义资源 (CRD)13.1 ZookeeperCluster CRapiVersion:zookeeper.pravega.io/v1beta1kind:ZookeeperClustermetadata:name:zookeeper-e93aadcfnamespace:qfusion-adminspec:replicas:3# Participant 副本数observerReplicas:1# Observer 副本数maxUnavailableReplicas:1# 最大不可用副本数image:repository:k8smaster.qfusion.irds/irds/zookeeper-3.7tag:v1.1.0pullPolicy:AlwaysstorageType:Persistence# 存储类型iopsQuota:2000# IOPS 配额persistence:reclaimPolicy:Deletespec:storageClassName:csi-localpvaccessModes:[ReadWriteOnce]resources:requests:storage:20Gi13.2 集群状态status:replicas:4# 总副本数readyReplicas:4# 就绪副本数internalClientEndpoint:246.102.223.122:2181externalClientEndpoint:N/Amembers:ready:-zookeeper-e93aadcf-0-0-zookeeper-e93aadcf-1-0-zookeeper-e93aadcf-2-0-zookeeper-e93aadcf-observer-0-0conditions:-type:PodsReadystatus:True-type:Errorstatus:False-type:Upgradingstatus:False14. 标签 (Labels)14.1 集群级别标签KeyValueAppNamezookeeper-e93aadcfCreatedBywoqutech.comDBBranchzookeeperv3.7.1DBTypezookeeperResourceUseTypeDatabaseTenantId847798ee3db44716b6357b04e5a55c16appzookeeper-e93aadcfreleasezookeeper-e93aadcf14.2 Pod 额外标签KeyValue范围TypeDatabase所有 PodkindZookeeperMember所有 Podreplica0/1/2Participant Podzk_roleParticipant/Observer所有 Podqfusion/zonezone1Replica-015. FinalizersFinalizer说明cleanUpZookeeperPVC删除集群时自动清理PVC资源16. 连接示例16.1 客户端连接字符串# 集群连接字符串 (推荐)zookeeper-e93aadcf-client.qfusion-admin.svc.cluster.local:2181# 直连所有节点 (高可用)zookeeper-e93aadcf-0.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181,\zookeeper-e93aadcf-1.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:2181,\zookeeper-e93aadcf-2.zookeeper-e93aadcf-headless.qfusion-admin.svc.cluster.local:218116.2 获取集群状态# 设置 KUBECONFIGexportKUBECONFIG/bpx/.145-admin.conf# 查看 ZooKeeper 集群状态kubectl get ZookeeperCluster zookeeper-e93aadcf-nqfusion-admin# 查看 Pods 状态kubectl get pods-nqfusion-admin-lappzookeeper-e93aadcf# 查看服务kubectl get svc-nqfusion-admin-lappzookeeper-e93aadcf# 查看 PVCkubectl get pvc-nqfusion-admin-lappzookeeper-e93aadcf# 连接到 ZooKeeper (需要先 exec 进入 Pod)kubectlexec-it-nqfusion-admin zookeeper-e93aadcf-0-0-czookeeper -- /bin/bash17. 运维参考17.1 常用命令# 四字命令检查echoruok|nc245.0.0.942181# 检查节点是否正常echosrvr|nc245.0.0.942181# 获取服务器详情echomntr|nc245.0.0.942181# 获取监控指标echocons|nc245.0.0.942181# 列出客户端连接# Prometheus 指标curlhttp://245.0.0.94:7000/metrics# Admin Servercurlhttp://245.0.0.94:8080/commands17.2 日志查看# ZooKeeper 主容器日志kubectl logs-nqfusion-admin zookeeper-e93aadcf-0-0-czookeeper# 实时跟踪日志kubectl logs-f-nqfusion-admin zookeeper-e93aadcf-0-0-czookeeper# 查看备份容器日志kubectl logs-nqfusion-admin zookeeper-e93aadcf-0-0-cssh17.3 故障排查症状可能原因排查方法Pod 无法启动资源不足、镜像拉取失败kubectl describe pod连接超时网络问题、防火墙kubectl get svc, endpoints数据丢失PVC 问题kubectl get pvc, pv领导者选举失败网络分区、节点故障检查节点状态和日志18. 附录18.1 文档版本版本日期说明1.02025-12-24初始版本18.2 相关文档ZooKeeper 官方文档QFusion 平台文档Kubernetes StatefulSet 文档本文档由 AI 自动生成基于 Kubernetes 集群 145 的实时配置