Monitoring Multiple K8s Clusters
This document was translated by ChatGPT
#1. Introduction
DeepFlow Server can serve DeepFlow Agents in multiple K8s clusters. Assuming you have already deployed DeepFlow Server in one K8s cluster, this chapter explains how to monitor other K8s clusters.
#2. Preparation
#2.1 Deployment Topology
#2.2 Ensure Different K8s Clusters Can Be Distinguished
DeepFlow uses the MD5 value of the K8s CA file to distinguish different clusters. Please check the /run/secrets/kubernetes.io/serviceaccount/ca.crt
file in the Pods of different K8s clusters to ensure that the CA files of different clusters are different.
If your different K8s clusters use the same CA file, you need to use deepflow-ctl domain create
to obtain a K8sClusterID
before deploying deepflow-agent in multiple clusters:
Note: It is uncommon for multiple K8s clusters to have the same CA file. Nevertheless, we still recommend manually connecting the deepflow-agent of other K8s clusters to the deepflow-server cluster. The advantage of manual connection is that you can customize the K8s cluster name displayed in the Grafana dashboard. You can create a custom K8s cluster domain using deepflow-ctl domain create -f custom-domain.yaml
:
# Name (you can customize the cluster name, for example, beijing-prod-k8s)
name: $CLUSTER_NAME # FIXME
# Type of cloud platform
type: kubernetes
config:
## Regional identifier (must use this default value)
#region_uuid: ffffffff-ffff-ffff-ffff-ffffffffffff
## Resource synchronization controller (it is recommended to use the default setting here)
#controller_ip: 127.0.0.1
## Maximum mask for POD subnet IPv4 addresses
#pod_net_ipv4_cidr_max_mask: 16
## Maximum mask for POD subnet IPv6 addresses
#pod_net_ipv6_cidr_max_mask: 64
## Additional routing interface connection
#node_port_name_regex: ^(cni|flannel|vxlan.calico|tunl|en[ospx])
## Synchronization interval, in seconds: minimum 1, maximum 86400, default 60
#sync_timer: 60
## View the specific information of the domain created:
deepflow-ctl domain list $CLUSTER_NAME
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
#3. Deploy deepflow-agent
Use Helm to install deepflow-agent. If the service used by deepflow-server is the default NodePort type, directly fill in the deepflow-server Node IP under deepflowServerNodeIPS
; if the service used by deepflow-server is of LoadBalancer type, directly fill in the LoadBalancer VIP.
cat << EOF > values-custom.yaml
deepflowServerNodeIPS:
- 10.1.2.3 # FIXME
- 10.4.5.6 # FIXME
clusterNAME: $CLUSTER_NAME # FIXME: $CLUSTER_NAME
deepflowK8sClusterID: # FIXME: $CLUSTER_NAME ID
EOF
helm repo add deepflow https://deepflowio.github.io/deepflow
helm repo update deepflow # use `helm repo update` when helm < 3.7.0
helm install deepflow-agent -n deepflow deepflow/deepflow-agent --create-namespace \
-f values-custom.yaml
2
3
4
5
6
7
8
9
10
11
12
cat << EOF > values-custom.yaml
image:
repository: registry.cn-beijing.aliyuncs.com/deepflow-ce/deepflow-agent
deepflowServerNodeIPS:
- 10.1.2.3 # FIXME
- 10.4.5.6 # FIXME
clusterNAME: $CLUSTER_NAME # FIXME: $CLUSTER_NAME
deepflowK8sClusterID: # FIXME: $CLUSTER_NAME ID
EOF
helm repo add deepflow https://deepflow-ce.oss-cn-beijing.aliyuncs.com/chart/stable
helm repo update deepflow # use `helm repo update` when helm < 3.7.0
helm install deepflow-agent -n deepflow deepflow/deepflow-agent --create-namespace \
-f values-custom.yaml
2
3
4
5
6
7
8
9
10
11
12
13
14
We recommend configuring the deepflowServerNodeIPS
of deepflow-agent to one or more relatively fixed Node IPs of the K8s cluster during the above deployment process.
#4. Next Steps
- Universal Service Map - Experience DeepFlow's AutoMetrics Capability
- Distributed Tracing - Experience DeepFlow's AutoTracing Capability
- Eliminate Data Silos - Learn About DeepFlow's AutoTagging and SmartEncoding Capabilities
- Say Goodbye to High Cardinality Issues - Integrate Metrics Data from Prometheus, etc.
- Full-Stack Distributed Tracing - Integrate Tracing Data from OpenTelemetry, etc.