全部操做所有用root使用者进行,高可用通常建议大于等于3台的奇数,咱们使用3台master来作高可用html
练习环境说明: 参考GitHubnode
master: kube-apiserver,kube-controller-manager,kube-scheduler,flanneldlinux
node: kubelet,kube-proxy,flannelnginx
Service_CIDR:10.254.0.0/16 服务网段,部署前路由不可达,部署后集群内部使用IP:Port可达git
Cluster_CIDR:172.30.0.0/16 pod网段,部署前路由不可达,部署后路由可达(flanneld 保证)github
主机名称 | IP地址 | 部署软件 | 备注 |
---|---|---|---|
k8s-m12 | 192.168.10.12 | keepalived+haproxy+etcd+master | master |
k8s-m13 | 192.168.10.13 | keepalived+haproxy+etcd+master | master |
k8s-m14 | 192.168.10.14 | keepalived+haproxy+etcd+master | master |
k8s-n15 | 192.168.10.15 | node+docker | work |
k8s-n16 | 192.168.10.16 | node+docker | work |
VIP | 192.168.10.100 | VIP |
kubernetes的GitHub网址 : https://github.com/kubernetes/kubernetes/releasesweb
下载Server Binaries中的 kubernetes-server-linux-amd64.tar.gz 安装包docker
下载Node Binaries中的 kubernetes-node-linux-amd64.tar.gz 安装包json
下载Client Binares中的 kubernetes-client-linux-amd64.tar.gz 安装包bootstrap
k8s各版本组件下载地址
https://github.com/kubernetes/kubernetes/tree/v1.14.3 #kubernetes wget https://storage.googleapis.com/kubernetes-release/release/v1.14.3/kubernetes-node-linux-amd64.tar.gz wget https://storage.googleapis.com/kubernetes-release/release/v1.14.3/kubernetes-client-linux-amd64.tar.gz wget https://storage.googleapis.com/kubernetes-release/release/v1.14.3/kubernetes-server-linux-amd64.tar.gz wget https://storage.googleapis.com/kubernetes-release/release/v1.14.3/kubernetes.tar.gz #etcd wget https://github.com/etcd-io/etcd/releases/download/v3.3.13/etcd-v3.3.13-linux-amd64.tar.gz #flannel wget https://github.com/coreos/flannel/releases/download/v0.11.0/flannel-v0.11.0-linux-amd64.tar.gz #cni-plugins wget https://github.com/containernetworking/plugins/releases/download/v0.8.1/cni-plugins-linux-amd64-v0.8.1.tgz #docker wget https://download.docker.com/linux/static/stable/x86_64/docker-18.09.6.tgz #cfssl wget https://pkg.cfssl.org/R1.2/cfssl_linux-amd64 wget https://pkg.cfssl.org/R1.2/cfssljson_linux-amd64 wget https://pkg.cfssl.org/R1.2/cfssl-certinfo_linux-amd64 #heapster wget https://github.com/kubernetes-retired/heapster/archive/v1.5.4.tar.gz
#1.12机器上生成密钥,无密码ssh登录 ssh-keygen -t rsa ssh-copy-id 192.168.10.13 #依次拷贝到其余节点上 #2.关闭防火墙,如下点全部机器执行 systemctl stop firewalld systemctl disable firewalld #3.关闭swap分区 swapoff -a sed -i '/ swap / s/^\(.*\)$/#\1/g' /etc/fstab #4.关闭SELinux sestatus #查看selinux状态 setenforce 0 #临时关闭selinux sed -i 's/SELINUX=enforcing/SELINUX=disabled/g' /etc/selinux/config #5.升级内核参考:https://www.cnblogs.com/fan-gx/p/11006762.html #6.修改文件句柄数 cat <<EOF >>/etc/security/limits.conf * soft nofile 65536 * hard nofile 65536 * soft nproc 65536 * hard nproc 65536 * soft memlock unlimited * hard memlock unlimited EOF #7.安装ipvs yum install ipvsadm ipset sysstat conntrack libseccomp -y #开机加载内核模块,并设置开机自动加载 cat > /etc/sysconfig/modules/ipvs.modules <<EOF #!/bin/bash modprobe -- ip_vs modprobe -- ip_vs_rr modprobe -- ip_vs_wrr modprobe -- ip_vs_sh modprobe -- nf_conntrack_ipv4 EOF #而后执行脚本 chmod 755 /etc/sysconfig/modules/ipvs.modules && bash /etc/sysconfig/modules/ipvs.modules lsmod | grep -e ip_vs -e nf_conntrack_ipv4 #8.修改系统参数 cat <<EOF > /etc/sysctl.d/k8s.conf net.bridge.bridge-nf-call-ip6tables = 1 net.bridge.bridge-nf-call-iptables = 1 net.ipv4.ip_nonlocal_bind = 1 net.ipv4.ip_forward = 1 vm.swappiness=0 EOF sysctl --system #-----------下面参考别人的---------# # cat <<EOF > /etc/sysctl.d/k8s.conf net.ipv4.tcp_keepalive_time = 600 net.ipv4.tcp_keepalive_intvl = 30 net.ipv4.tcp_keepalive_probes = 10 net.ipv6.conf.all.disable_ipv6 = 1 net.ipv6.conf.default.disable_ipv6 = 1 net.ipv6.conf.lo.disable_ipv6 = 1 net.ipv4.neigh.default.gc_stale_time = 120 net.ipv4.conf.all.rp_filter = 0 net.ipv4.conf.default.rp_filter = 0 net.ipv4.conf.default.arp_announce = 2 net.ipv4.conf.lo.arp_announce = 2 net.ipv4.conf.all.arp_announce = 2 net.ipv4.ip_forward = 1 net.ipv4.tcp_max_tw_buckets = 5000 net.ipv4.tcp_syncookies = 1 net.ipv4.tcp_max_syn_backlog = 1024 net.ipv4.tcp_synack_retries = 2 net.bridge.bridge-nf-call-ip6tables = 1 net.bridge.bridge-nf-call-iptables = 1 net.netfilter.nf_conntrack_max = 2310720 fs.inotify.max_user_watches=89100 fs.may_detach_mounts = 1 fs.file-max = 52706963 fs.nr_open = 52706963 net.bridge.bridge-nf-call-arptables = 1 vm.swappiness = 0 vm.overcommit_memory=1 vm.panic_on_oom=0 EOF #9.在生产环境建议预留内存,避免因为内存耗尽致使ssh连不上主机(32G的机器留2G,251的留3G, 500G的留5G)。下面是预留5G echo 'vm.min_free_kbytes=5000000' >> /etc/sysctl.conf sysctl -p
二进制部署方式可参考:https://www.kubernetes.org.cn/3831.html 这里为了方便直接yum安装全部节点
#1.安装yum源工具包 yum install -y yum-utils device-mapper-persistent-data lvm2 #2.下载docker-ce官方的yum源配置文件 # yum-config-manager --add-repo https://download.docker.com/linux/centos/docker-ce.repo # yum-config-manager --add-repo http://mirrors.aliyun.com/docker-ce/linux/centos/docker-ce.repo #3.禁用docker-c-edge源配edge是不开发版,不稳定,下载stable版 yum-config-manager --disable docker-ce-edge #4.更新本地YUM源缓存 yum makecache fast #5.安装Docker-ce相应版本 yum -y install docker-ce #6.配置daemon, 由于kubelet的启动环境变量要与docker的cgroup-driver驱动相同,如下是官方推荐处理方式(如今新版二进制kubelet就是cgroup了) #因为国内拉取镜像较慢,配置文件最后追加了阿里云镜像加速配置。 mkdir -p /etc/docker && cat > /etc/docker/daemon.json <<EOF { "exec-opts": ["native.cgroupdriver=systemd"], "log-driver": "json-file", "log-opts": { "max-size": "100m" }, "storage-driver": "overlay2", "storage-opts": [ "overlay2.override_kernel_check=true" ], "registry-mirrors": ["https://uyah70su.mirror.aliyuncs.com"] } EOF #7.设置开机自启动 systemctl restart docker && systemctl enable docker && systemctl status docker #8.能够先在本身电脑下来安装包,本环境安装的是18.09版本 yum install --downloadonly docker-ce-18.09 --downloaddir=/opt #yum下载docker-ce yum localinstall docker-ce -y #而后安装
etcd是用来保存集群全部状态的 Key/Value 存储系统,经常使用于服务发现、共享配置以及并发控制(如 leader 选举、分布式锁等)。kubernetes 使用 etcd 存储全部运行数据。
全部 Kubernetes 组件会经过 API Server 来跟 Etcd 进行沟通从而保存或读取资源状态。有条件的能够单独几台机器跑,不过须要配置apiserver指向etcd集群。
若是不但愿将cfssl工具安装到部署主机上,能够在其余的主机上进行该步骤,生成之后将证书拷贝到部署etcd的主机上便可。不是要证书也能够部署,etcd.service文件和etcd.conf文件不要有https的URL
wget https://pkg.cfssl.org/R1.2/cfssl_linux-amd64 -O /usr/local/bin/cfssl wget https://pkg.cfssl.org/R1.2/cfssljson_linux-amd64 -O /usr/local/bin/cfssljson wget https://pkg.cfssl.org/R1.2/cfssl-certinfo_linux-amd64 -O /usr/local/bin/cfssl-certinfo chmod +x /usr/local/bin/cfssl* #配置CA文件 mkdir /root/ssl && cd /root/ssl cat > ca-config.json <<EOF { "signing": { "default": { "expiry": "8760h" }, "profiles": { "kubernetes": { "usages": [ "signing", "key encipherment", "server auth", "client auth" ], "expiry": "8760h" } } } } EOF #2---------------------------------------------- cat > ca-csr.json <<EOF { "CN": "kubernetes", "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "k8s", "OU": "System" } ] } EOF #3-------------------------------------------------- cat > etcd-csr.json <<EOF { "CN": "etcd", "hosts": [ "127.0.0.1", "192.168.10.12", "192.168.10.13", "192.168.10.14" ], "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "k8s", "OU": "System" } ] } EOF # hosts字段的IP地址是指受权使用证书的etcd地址 #------------------------------------ cfssl gencert -initca ca-csr.json | cfssljson -bare ca cfssl gencert -ca=ca.pem -ca-key=ca-key.pem -config=ca-config.json -profile=kubernetes etcd-csr.json | cfssljson -bare etcd #生产后证书包含文件以下,共9个 ca-config.json ca.csr ca-csr.json ca-key.pem ca.pem etcd.csr etcd-csr.json etcd-key.pem etcd.pem #将生成好的etcd.pem和etcd-key.pem以及ca.pem三个文件拷贝到etcd机器上 mkdir -p /etc/kubernetes/ssl && cp *.pem /etc/kubernetes/ssl/ ssh -n 192.168.10.13 "mkdir -p /etc/kubernetes/ssl && exit" ssh -n 192.168.10.14 "mkdir -p /etc/kubernetes/ssl && exit" scp -r /etc/kubernetes/ssl/*.pem 192.168.10.13:/etc/kubernetes/ssl/ scp -r /etc/kubernetes/ssl/*.pem 192.168.10.14:/etc/kubernetes/ssl/
将下载的etcd二进制文件上传到etcd节点机器上。
#在etcd的机器上安装etcd程序 mkdir -p /var/lib/etcd tar -zxvf etcd-v3.3.13-linux-amd64.tar.gz cp etcd-v3.3.13-linux-amd64/etcd* /usr/local/bin scp etcd-v3.3.13-linux-amd64/etcd* 192.168.10.13:/usr/local/bin scp etcd-v3.3.13-linux-amd64/etcd* 192.168.10.14:/usr/local/bin #1.在12机器上建立etcd.service文件 cat <<EOF >/etc/systemd/system/etcd.service [Unit] Description=Etcd Server After=network.target After=network-online.target Wants=network-online.target Documentation=https://github.com/coreos [Service] Type=notify WorkingDirectory=/var/lib/etcd/ ExecStart=/usr/local/bin/etcd \ --name k8s-m12 \ --cert-file=/etc/kubernetes/ssl/etcd.pem \ --key-file=/etc/kubernetes/ssl/etcd-key.pem \ --trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --peer-cert-file=/etc/kubernetes/ssl/etcd.pem \ --peer-key-file=/etc/kubernetes/ssl/etcd-key.pem \ --peer-trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --initial-advertise-peer-urls https://192.168.10.12:2380 \ --listen-peer-urls https://192.168.10.12:2380 \ --listen-client-urls https://192.168.10.12:2379,http://127.0.0.1:2379 \ --advertise-client-urls https://192.168.10.12:2379 \ --initial-cluster-token etcd-cluster-0 \ --initial-cluster k8s-m12=https://192.168.10.12:2380,k8s-m13=https://192.168.10.13:2380,k8s-m14=https://192.168.10.14:2380 \ --initial-cluster-state new \ --data-dir=/var/lib/etcd Restart=on-failure RestartSec=5 LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF #2.启动etcd服务 systemctl daemon-reload && systemctl enable etcd.service && systemctl start etcd.service && systemctl status etcd
#1.在13机器上建立etcd.service文件 cat <<EOF >/etc/systemd/system/etcd.service [Unit] Description=Etcd Server After=network.target After=network-online.target Wants=network-online.target Documentation=https://github.com/coreos [Service] Type=notify WorkingDirectory=/var/lib/etcd/ ExecStart=/usr/local/bin/etcd \ --name k8s-m13 \ --cert-file=/etc/kubernetes/ssl/etcd.pem \ --key-file=/etc/kubernetes/ssl/etcd-key.pem \ --peer-cert-file=/etc/kubernetes/ssl/etcd.pem \ --peer-key-file=/etc/kubernetes/ssl/etcd-key.pem \ --trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --peer-trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --initial-advertise-peer-urls https://192.168.10.13:2380 \ --listen-peer-urls https://192.168.10.13:2380 \ --listen-client-urls https://192.168.10.13:2379,http://127.0.0.1:2379 \ --advertise-client-urls https://192.168.10.13:2379 \ --initial-cluster-token etcd-cluster-0 \ --initial-cluster k8s-m12=https://192.168.10.12:2380,k8s-m13=https://192.168.10.13:2380,k8s-m14=https://192.168.10.14:2380 \ --initial-cluster-state new \ --data-dir=/var/lib/etcd Restart=on-failure RestartSec=5 LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF #2.启动etcd服务 systemctl daemon-reload && systemctl enable etcd.service && systemctl start etcd.service && systemctl status etcd
#1.在14机器上建立etcd.service文件 cat <<EOF >/etc/systemd/system/etcd.service [Unit] Description=Etcd Server After=network.target After=network-online.target Wants=network-online.target Documentation=https://github.com/coreos [Service] Type=notify WorkingDirectory=/var/lib/etcd/ ExecStart=/usr/local/bin/etcd \ --name k8s-m14 \ --cert-file=/etc/kubernetes/ssl/etcd.pem \ --key-file=/etc/kubernetes/ssl/etcd-key.pem \ --peer-cert-file=/etc/kubernetes/ssl/etcd.pem \ --peer-key-file=/etc/kubernetes/ssl/etcd-key.pem \ --trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --peer-trusted-ca-file=/etc/kubernetes/ssl/ca.pem \ --initial-advertise-peer-urls https://192.168.10.14:2380 \ --listen-peer-urls https://192.168.10.14:2380 \ --listen-client-urls https://192.168.10.14:2379,http://127.0.0.1:2379 \ --advertise-client-urls https://192.168.10.14:2379 \ --initial-cluster-token etcd-cluster-0 \ --initial-cluster k8s-m12=https://192.168.10.12:2380,k8s-m13=https://192.168.10.13:2380,k8s-m14=https://192.168.10.14:2380 \ --initial-cluster-state new \ --data-dir=/var/lib/etcd Restart=on-failure RestartSec=5 LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF #2.启动etcd服务 systemctl daemon-reload && systemctl enable etcd.service && systemctl start etcd.service && systemctl status etcd
#1.查看集群状态 etcdctl --ca-file=/etc/kubernetes/ssl/ca.pem --cert-file=/etc/kubernetes/ssl/etcd.pem --key-file=/etc/kubernetes/ssl/etcd-key.pem cluster-health #返回以下,表明集群正常 member 1af68d968c7e3f22 is healthy: got healthy result from https://192.168.10.12:2379 member 55204c19ed228077 is healthy: got healthy result from https://192.168.10.14:2379 member e8d9a97b17f26476 is healthy: got healthy result from https://192.168.10.13:2379 cluster is healthy #2.查看集群成员 etcdctl --endpoints=https://192.168.10.12:2379,https://192.168.10.13:2379,https://192.168.10.14:2379 --ca-file=/etc/kubernetes/ssl/ca.pem --cert-file=/etc/kubernetes/ssl/etcd.pem --key-file=/etc/kubernetes/ssl/etcd-key.pem member list #返回以下结果 1af68d968c7e3f22: name=k8s-m12 peerURLs=https://192.168.10.12:2380 clientURLs=https://192.168.10.12:2379 isLeader=false 55204c19ed228077: name=k8s-m14 peerURLs=https://192.168.10.14:2380 clientURLs=https://192.168.10.14:2379 isLeader=false e8d9a97b17f26476: name=k8s-m13 peerURLs=https://192.168.10.13:2380 clientURLs=https://192.168.10.13:2379 isLeader=true
全部的节点都须要安装flannel,,主要目的是跨主机的docker可以互相通讯,也是保障kubernetes集群的网络基础和保障
#1.生产TLS证书,是让kubectl当作client证书使用,(证书只须要生成一次) cd /root/ssl cat > flanneld-csr.json <<EOF { "CN": "flanneld", "hosts": [], "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "k8s", "OU": "System" } ] } EOF #2.生成证书和私钥 cfssl gencert -ca=ca.pem -ca-key=ca-key.pem -config=ca-config.json -profile=kubernetes flanneld-csr.json | cfssljson -bare flanneld #包含如下文件 flanneld.csr flanneld-csr.json flanneld-key.pem flanneld.pem #3.而后将证书拷贝到全部节点下。 cp flanneld*.pem /etc/kubernetes/ssl scp flanneld*.pem 192.168.10.13:/etc/kubernetes/ssl scp flanneld*.pem 192.168.10.14:/etc/kubernetes/ssl scp flanneld*.pem 192.168.10.15:/etc/kubernetes/ssl scp flanneld*.pem 192.168.10.16:/etc/kubernetes/ssl
#1.开始安装flannel tar -zvxf flannel-v0.11.0-linux-amd64.tar.gz cp flanneld mk-docker-opts.sh /usr/local/bin scp flanneld mk-docker-opts.sh 192.168.10.13:/usr/local/bin scp flanneld mk-docker-opts.sh 192.168.10.14:/usr/local/bin scp flanneld mk-docker-opts.sh 192.168.10.15:/usr/local/bin scp flanneld mk-docker-opts.sh 192.168.10.16:/usr/local/bin #2.向etcd写入集群Pod网段信息,在etcd集群中任意一台执行一次便可 etcdctl \ --endpoints=https://192.168.10.12:2379,https://192.168.10.13:2379,https://192.168.10.14:2379 \ --ca-file=/etc/kubernetes/ssl/ca.pem \ --cert-file=/etc/kubernetes/ssl/flanneld.pem \ --key-file=/etc/kubernetes/ssl/flanneld-key.pem \ mk /kubernetes/network/config '{"Network":"172.30.0.0/16", "SubnetLen": 24, "Backend": {"Type": "vxlan"}}' #----获得返回信息以下,设置的网络是172.30.0.0/16,子网掩码是24位 {"Network":"172.30.0.0/16", "SubnetLen": 24, "Backend": {"Type": "vxlan"}} #2.1.列出键值存储的目录 etcdctl \ --ca-file=/etc/kubernetes/ssl/ca.pem \ --cert-file=/etc/kubernetes/ssl/flanneld.pem \ --key-file=/etc/kubernetes/ssl/flanneld-key.pem ls -r #2.2.查看键值存储 etcdctl \ --ca-file=/etc/kubernetes/ssl/ca.pem \ --cert-file=/etc/kubernetes/ssl/flanneld.pem \ --key-file=/etc/kubernetes/ssl/flanneld-key.pem get /kubernetes/network/config #2.3查看已分配pod的子网列表 etcdctl \ --ca-file=/etc/kubernetes/ssl/ca.pem \ --cert-file=/etc/kubernetes/ssl/flanneld.pem \ --key-file=/etc/kubernetes/ssl/flanneld-key.pem ls /kubernetes/network/subnets #三、建立flannel.service文件 cat > /etc/systemd/system/flannel.service << EOF [Unit] Description=Flanneld overlay address etcd agent After=network.target After=network-online.target Wants=network-online.target After=etcd.service Before=docker.service [Service] Type=notify ExecStart=/usr/local/bin/flanneld \ -etcd-cafile=/etc/kubernetes/ssl/ca.pem \ -etcd-certfile=/etc/kubernetes/ssl/flanneld.pem \ -etcd-keyfile=/etc/kubernetes/ssl/flanneld-key.pem \ -etcd-endpoints=https://192.168.10.12:2379,https://192.168.10.13:2379,https://192.168.10.14:2379 \ -etcd-prefix=/kubernetes/network ExecStartPost=/usr/local/bin/mk-docker-opts.sh -k DOCKER_NETWORK_OPTIONS -d /run/flannel/docker Restart=on-failure [Install] WantedBy=multi-user.target RequiredBy=docker.service EOF #mk-docker-opts.sh 脚本将分配给flanneld的Pod子网网段信息写入到/run/flannel/docker文件中,后续docker启动时使用这个文件中参数值设置docker0网桥。 #flanneld 使用系统缺省路由所在的接口和其它节点通讯,对于有多个网络接口的机器(如,内网和公网),能够用 -iface=enpxx 选项值指定通讯接口。 #四、启动flannel systemctl daemon-reload && systemctl enable flannel && systemctl start flannel && systemctl status flannel #5.验证flannel cat /run/flannel/docker #/run/flannel/docker是flannel分配给docker的子网信息, #显示以下 DOCKER_OPT_BIP="--bip=172.30.7.1/24" DOCKER_OPT_IPMASQ="--ip-masq=true" DOCKER_OPT_MTU="--mtu=1450" DOCKER_NETWORK_OPTIONS=" --bip=172.30.7.1/24 --ip-masq=true --mtu=1450" cat /run/flannel/subnet.env #/run/flannel/subnet.env包含了flannel整个大网段以及在此节点上的子网段 #显示以下 FLANNEL_NETWORK=172.30.0.0/16 FLANNEL_SUBNET=172.30.7.1/24 FLANNEL_MTU=1450 FLANNEL_IPMASQ=false ip add | grep flannel #查看网卡信息 4: flannel.1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1450 qdisc noqueue state UNKNOWN group default inet 172.30.7.0/32 scope global flannel.1
#1.配置docker支持flannel网络,须要在[Service]标签下新加 vim /etc/systemd/system/multi-user.target.wants/docker.service EnvironmentFile=/run/flannel/docker #这行新加内容,下面行新加$后面的内容 ExecStart=/usr/bin/dockerd -H fd:// --containerd=/run/containerd/containerd.sock $DOCKER_NETWORK_OPTIONS #2.重启docker,而后能够查看到已分配pod的子网列表 systemctl daemon-reload && systemctl restart docker && systemctl status docker ip add | grep docker #docker0网口IP地址,已改变 4: docker0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc noqueue state DOWN group default inet 172.30.7.1/24 brd 172.30.7.255 scope global docker0
tar -zxvf cni-plugins-linux-amd64-v0.8.1.tgz -C /opt/cni mkdir -p /etc/cni/net.d cat > /etc/cni/net.d/10-default.conf <<EOF { "name": "flannel", "type": "flannel", "delegate": { "bridge": "docker0", "isDefaultGateway": true, "mtu": 1400 } } EOF #把相关指令和文件拷贝到全部主机 scp /opt/cni/* 192.168.10.13:/usr/local/bin && scp /etc/cni/net.d/* 192.168.10.13:/etc/cni/net.d/
keepalived 提供 kube-apiserver 对外服务的 VIP;haproxy 监听 VIP,后端链接全部 kube-apiserver 实例,提供健康检查和负载均衡功能;
本文档复用 master 节点的三台机器,haproxy 监听的端口(8443) 须要与 kube-apiserver 的端口 6443 不一样,避免冲突。
keepalived 在运行过程当中周期检查本机的 haproxy 进程状态,若是检测到 haproxy 进程异常,则触发从新选主的过程,VIP 将飘移到新选出来的主节点,从而实现 VIP 的高可用。全部组件(如 kubeclt、apiserver、controller-manager、scheduler 等)都经过 VIP 和 haproxy 监听的 8443 端口访问 kube-apiserver 服务。
yum install -y haproxy #12机器上配置 cat << EOF > /etc/haproxy/haproxy.cfg global log 127.0.0.1 local2 chroot /var/lib/haproxy pidfile /var/run/haproxy.pid maxconn 4000 user haproxy group haproxy daemon defaults mode tcp log global retries 3 timeout connect 10s timeout client 1m timeout server 1m listen admin_stats bind 0.0.0.0:9090 mode http log 127.0.0.1 local0 err stats refresh 30s stats uri /status stats realm welcome login\ Haproxy stats auth admin:123456 stats hide-version stats admin if TRUE frontend kubernetes bind *:8443 mode tcp default_backend kubernetes-master backend kubernetes-master balance roundrobin server k8s-m12 192.168.10.12:6443 check maxconn 2000 server k8s-m13 192.168.10.13:6443 check maxconn 2000 server k8s-m14 192.168.10.14:6443 check maxconn 2000 EOF #13 和 14机器上配置都同样 # 启动haproxy systemctl enable haproxy && systemctl start haproxy && systemctl status haproxy
yum install -y keepalived #10.12机器上配置 cat <<EOF > /etc/keepalived/keepalived.conf global_defs { router_id LVS_k8s } vrrp_script CheckK8sMaster { script "curl -k https://192.168.10.100:8443" interval 3 timeout 9 fall 2 rise 2 } vrrp_instance VI_1 { state MASTER interface ens33 virtual_router_id 100 priority 100 advert_int 1 mcast_src_ip 192.168.10.12 nopreempt authentication { auth_type PASS auth_pass fana123 } unicast_peer { 192.168.10.13 192.168.10.14 } virtual_ipaddress { 192.168.10.100/24 } track_script { CheckK8sMaster } } EOF #13机器keepalived配置 cat <<EOF > /etc/keepalived/keepalived.conf global_defs { router_id LVS_k8s } vrrp_script CheckK8sMaster { script "curl -k https://192.168.10.100:8443" interval 3 timeout 9 fall 2 rise 2 } vrrp_instance VI_1 { state BACKUP interface ens33 virtual_router_id 100 priority 90 advert_int 1 mcast_src_ip 192.168.10.13 nopreempt authentication { auth_type PASS auth_pass fana123 } unicast_peer { 192.168.10.12 192.168.10.14 } virtual_ipaddress { 192.168.10.100/24 } track_script { CheckK8sMaster } } EOF #14机器上keepalived配置 cat <<EOF > /etc/keepalived/keepalived.conf global_defs { router_id LVS_k8s } vrrp_script CheckK8sMaster { script "curl -k https://192.168.10.100:8443" interval 3 timeout 9 fall 2 rise 2 } vrrp_instance VI_1 { state BACKUP interface ens33 virtual_router_id 100 priority 80 advert_int 1 mcast_src_ip 192.168.10.14 nopreempt authentication { auth_type PASS auth_pass fana123 } unicast_peer { 192.168.10.12 192.168.10.13 } virtual_ipaddress { 192.168.10.100/24 } track_script { CheckK8sMaster } } EOF #启动keepalived systemctl restart keepalived && systemctl enable keepalived && systemctl status keepalived #查看vip ip add | grep 10.100
kube-scheduler,kube-controller-manager 和 kube-apiserver 三者的功能紧密相关;同时kube-scheduler 和 kube-controller-manager 只能有一个进程处于工做状态,若是运行多个,则须要经过选举产生一个 leader;
kubectl 是 kubernetes 集群的命令行管理工具,默认从 ~/.kube/config
文件读取 kube-apiserver 地址、证书、用户名等信息,若是没有配置,执行 kubectl 命令时可能会出错。~/.kube/config
只须要部署一次,而后拷贝到其余的master。
#1.解压命令 tar -zxvf kubernetes-server-linux-amd64.tar.gz cd kubernetes/server/bin/ cp kube-apiserver kubeadm kube-controller-manager kubectl kube-scheduler /usr/local/bin scp kube-apiserver kubeadm kube-controller-manager kubectl kube-scheduler 192.168.10.13:/usr/local/bin scp kube-apiserver kubeadm kube-controller-manager kubectl kube-scheduler 192.168.10.14:/usr/local/bin #2.建立CA证书 cd /root/ssl cat > admin-csr.json <<EOF { "CN": "admin", "hosts": [], "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "system:masters", "OU": "System" } ] } EOF #3.生成证书和私钥 cfssl gencert -ca=ca.pem \ -ca-key=ca-key.pem \ -config=ca-config.json \ -profile=kubernetes admin-csr.json | cfssljson -bare admin #4.建立~/.kube/config文件 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=kubectl.kubeconfig #4.1.设置客户端认证参数 kubectl config set-credentials admin \ --client-certificate=admin.pem \ --client-key=admin-key.pem \ --embed-certs=true \ --kubeconfig=kubectl.kubeconfig #4.2.设置上下文参数 kubectl config set-context kubernetes \ --cluster=kubernetes \ --user=admin \ --kubeconfig=kubectl.kubeconfig #4.3.设置默认上下文 kubectl config use-context kubernetes --kubeconfig=kubectl.kubeconfig #4.4.拷贝kubectl.kubeconfig文件 cp kubectl.kubeconfig ~/.kube/config scp kubectl.kubeconfig 192.168.10.13:/root/.kube/config scp kubectl.kubeconfig 192.168.10.14:/root/.kube/config cp admin*.pem /etc/kubernetes/ssl/ scp admin*.pem 192.168.10.13:/etc/kubernetes/ssl/ scp admin*.pem 192.168.10.14:/etc/kubernetes/ssl/
#1.建立CA证书,hosts字段指定受权使用该证书的IP或域名列表,这里列出了VIP/apiserver节点IP/kubernetes服务IP和域名 cd /root/ssl cat > kubernetes-csr.json <<EOF { "CN": "kubernetes", "hosts": [ "127.0.0.1", "192.168.10.12", "192.168.10.13", "192.168.10.14", "192.168.10.100", "10.254.0.1", "kubernetes", "kubernetes.default", "kubernetes.default.svc", "kubernetes.default.svc.cluster", "kubernetes.default.svc.cluster.local" ], "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "k8s", "OU": "System" } ] } EOF #2.生成证书和私钥 cfssl gencert -ca=ca.pem \ -ca-key=ca-key.pem \ -config=ca-config.json \ -profile=kubernetes kubernetes-csr.json | cfssljson -bare kubernetes #3.将证书拷贝到其余master节点 cp kubernetes*.pem /etc/kubernetes/ssl/ scp kubernetes*.pem 192.168.10.13:/etc/kubernetes/ssl/ scp kubernetes*.pem 192.168.10.14:/etc/kubernetes/ssl/ #4.建立加密配置文件 cat > encryption-config.yaml <<EOF kind: EncryptionConfig apiVersion: v1 resources: - resources: - secrets providers: - aescbc: keys: - name: key1 secret: $(head -c 32 /dev/urandom | base64) - identity: {} EOF #4.1建立kube-apiserver使用的客户端令牌文件 cat <<EOF > bootstrap-token.csv $(head -c 32 /dev/urandom | base64),kubelet-bootstrap,10001,"system:kubelet-bootstrap" EOF #5.将加密文件拷贝到其余master节点 cp encryption-config.yaml bootstrap-token.csv /etc/kubernetes/ssl scp encryption-config.yaml bootstrap-token.csv 192.168.10.13:/etc/kubernetes/ssl scp encryption-config.yaml bootstrap-token.csv 192.168.10.14:/etc/kubernetes/ssl #6.建立kube-apiserver.service文件 cat > /etc/systemd/system/kube-apiserver.service << EOF [Unit] Description=Kubernetes API Server Documentation=https://github.com/GoogleCloudPlatform/kubernetes After=network.target [Service] ExecStart=/usr/local/bin/kube-apiserver \ --enable-admission-plugins=NamespaceLifecycle,NodeRestriction,LimitRanger,ServiceAccount,DefaultStorageClass,ResourceQuota \ --anonymous-auth=false \ --experimental-encryption-provider-config=/etc/kubernetes/ssl/encryption-config.yaml \ --advertise-address=0.0.0.0 \ --bind-address=0.0.0.0 \ --insecure-bind-address=127.0.0.1 \ --secure-port=6443 \ --insecure-port=0 \ --authorization-mode=Node,RBAC \ --runtime-config=api/all \ --enable-bootstrap-token-auth \ --service-cluster-ip-range=10.254.0.0/16 \ --service-node-port-range=30000-32700 \ --tls-cert-file=/etc/kubernetes/ssl/kubernetes.pem \ --tls-private-key-file=/etc/kubernetes/ssl/kubernetes-key.pem \ --client-ca-file=/etc/kubernetes/ssl/ca.pem \ --kubelet-client-certificate=/etc/kubernetes/ssl/kubernetes.pem \ --kubelet-client-key=/etc/kubernetes/ssl/kubernetes-key.pem \ --service-account-key-file=/etc/kubernetes/ssl/ca-key.pem \ --etcd-cafile=/etc/kubernetes/ssl/ca.pem \ --etcd-certfile=/etc/kubernetes/ssl/kubernetes.pem \ --etcd-keyfile=/etc/kubernetes/ssl/kubernetes-key.pem \ --etcd-servers=https://192.168.10.12:2379,https://192.168.10.13:2379,https://192.168.10.14:2379 \ --enable-swagger-ui=true \ --allow-privileged=true \ --apiserver-count=3 \ --audit-log-maxage=30 \ --audit-log-maxbackup=3 \ --audit-log-maxsize=100 \ --audit-log-path=/var/log/kubernetes/kube-apiserver-audit.log \ --event-ttl=1h \ --alsologtostderr=true \ --logtostderr=false \ --log-dir=/var/log/kubernetes \ --v=2 Restart=on-failure RestartSec=5 Type=notify LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF mkdir -p /var/log/kubernetes #建立日志目录而后拷贝到其余master scp /etc/systemd/system/kube-apiserver.service 192.168.10.13:/etc/systemd/system/ scp /etc/systemd/system/kube-apiserver.service 192.168.10.14:/etc/systemd/system/ #7.启动服务 systemctl daemon-reload && systemctl enable kube-apiserver && systemctl start kube-apiserver && systemctl status kube-apiserver #8.授予kubernetes证书访问kubelet api权限。在执行kubectl exec、run、logs 等命令时,apiserver会转发到kubelet。这里定义 RBAC规则,受权apiserver调用kubelet API。 kubectl create clusterrolebinding kube-apiserver:kubelet-apis --clusterrole=system:kubelet-api-admin --user kubernetes #8.1预约义的ClusterRole system:kubelet-api-admin授予访问kubelet全部 API 的权限: kubectl describe clusterrole system:kubelet-api-admin #9.检查api-server和集群状态 netstat -ptln | grep kube-apiserver tcp 0 0 192.168.10.12:6443 0.0.0.0:* LISTEN 13000/kube-apiserve kubectl cluster-info #显示以下 Kubernetes master is running at https://192.168.10.100:8443 To further debug and diagnose cluster problems, use 'kubectl cluster-info dump'. kubectl get all --all-namespaces #显示以下 NAMESPACE NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE default service/kubernetes ClusterIP 10.254.0.1 <none> 443/TCP 12m kubectl get componentstatuses #显示以下,因scheduler和controller-manager尚未部署 NAME STATUS MESSAGE ERROR scheduler Unhealthy Get http://127.0.0.1:10251/healthz: dial tcp 127.0.0.1:10251: connect: connection refused controller-manager Unhealthy Get http://127.0.0.1:10252/healthz: dial tcp 127.0.0.1:10252: connect: connection refused etcd-2 Healthy {"health":"true"} etcd-1 Healthy {"health":"true"} etcd-0 Healthy {"health":"true"}
该集群包含 3 个节点,启动后将经过竞争选举机制产生一个 leader 节点,其它节点为阻塞状态。当 leader 节点不可用后,剩余节点将再次进行选举产生新的 leader 节点,从而保证服务的可用性。
#1.建立CA证书 cd /root/ssl cat > kube-controller-manager-csr.json << EOF { "CN": "system:kube-controller-manager", "key": { "algo": "rsa", "size": 2048 }, "hosts": [ "127.0.0.1", "192.168.10.12", "192.168.10.13", "192.168.10.14" ], "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "system:kube-controller-manager", "OU": "System" } ] } EOF #2.生成证书 cfssl gencert -ca=ca.pem \ -ca-key=ca-key.pem \ -config=ca-config.json \ -profile=kubernetes kube-controller-manager-csr.json | cfssljson -bare kube-controller-manager #3.将证书拷贝到其余master节点 cp kube-controller-manager*.pem /etc/kubernetes/ssl/ scp kube-controller-manager*.pem 192.168.10.13:/etc/kubernetes/ssl/ scp kube-controller-manager*.pem 192.168.10.14:/etc/kubernetes/ssl/ #4.建立kubeconfig文件 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=kube-controller-manager.kubeconfig kubectl config set-credentials system:kube-controller-manager \ --client-certificate=kube-controller-manager.pem \ --client-key=kube-controller-manager-key.pem \ --embed-certs=true \ --kubeconfig=kube-controller-manager.kubeconfig kubectl config set-context system:kube-controller-manager \ --cluster=kubernetes \ --user=system:kube-controller-manager \ --kubeconfig=kube-controller-manager.kubeconfig kubectl config use-context system:kube-controller-manager --kubeconfig=kube-controller-manager.kubeconfig #5.拷贝kube-controller-manager.kubeconfig到其余master节点 cp kube-controller-manager.kubeconfig /etc/kubernetes/ssl/ scp kube-controller-manager.kubeconfig 192.168.10.13:/etc/kubernetes/ssl/ scp kube-controller-manager.kubeconfig 192.168.10.14:/etc/kubernetes/ssl/ #6.建立kube-controller-manager.service文件 cat > /etc/systemd/system/kube-controller-manager.service << EOF [Unit] Description=Kubernetes Controller Manager Documentation=https://github.com/GoogleCloudPlatform/kubernetes [Service] ExecStart=/usr/local/bin/kube-controller-manager \ --address=127.0.0.1 \ --master=https://192.168.10.100:8443 \ --kubeconfig=/etc/kubernetes/ssl/kube-controller-manager.kubeconfig \ --allocate-node-cidrs=true \ --authentication-kubeconfig=/etc/kubernetes/ssl/kube-controller-manager.kubeconfig \ --service-cluster-ip-range=10.254.0.0/16 \ --cluster-cidr=172.30.0.0/16 \ --cluster-name=kubernetes \ --cluster-signing-cert-file=/etc/kubernetes/ssl/ca.pem \ --cluster-signing-key-file=/etc/kubernetes/ssl/ca-key.pem \ --experimental-cluster-signing-duration=8760h \ --leader-elect=true \ --feature-gates=RotateKubeletServerCertificate=true \ --controllers=*,bootstrapsigner,tokencleaner \ --horizontal-pod-autoscaler-use-rest-clients=true \ --horizontal-pod-autoscaler-sync-period=10s \ --tls-cert-file=/etc/kubernetes/ssl/kube-controller-manager.pem \ --tls-private-key-file=/etc/kubernetes/ssl/kube-controller-manager-key.pem \ --service-account-private-key-file=/etc/kubernetes/ssl/ca-key.pem \ --root-ca-file=/etc/kubernetes/ssl/ca.pem \ --use-service-account-credentials=true \ --alsologtostderr=true \ --logtostderr=false \ --log-dir=/var/log/kubernetes \ --v=2 Restart=on Restart=on-failure RestartSec=5 [Install] WantedBy=multi-user.target EOF #7.拷贝到其余master节点,而后启动服务 scp /etc/systemd/system/kube-controller-manager.service 192.168.10.13:/etc/systemd/system/ scp /etc/systemd/system/kube-controller-manager.service 192.168.10.14:/etc/systemd/system/ systemctl daemon-reload && systemctl enable kube-controller-manager && systemctl start kube-controller-manager && systemctl status kube-controller-manager #8.检查服务 netstat -lnpt|grep kube-controll tcp 0 0 127.0.0.1:10252 0.0.0.0:* LISTEN 14492/kube-controll tcp6 0 0 :::10257 :::* LISTEN 14492/kube-controll kubectl get cs #显示以下 NAME STATUS MESSAGE ERROR scheduler Unhealthy Get http://127.0.0.1:10251/healthz: dial tcp 127.0.0.1:10251: connect: connection refused controller-manager Healthy ok etcd-1 Healthy {"health":"true"} etcd-2 Healthy {"health":"true"} etcd-0 Healthy {"health":"true"} #检查leader所在机器 kubectl get endpoints kube-controller-manager --namespace=kube-system -o yaml #显示以下,k8s-m12选为leader apiVersion: v1 kind: Endpoints metadata: annotations: control-plane.alpha.kubernetes.io/leader: '{"holderIdentity":"k8s-m12_6f9b09e6-995b-11e9-b2bf-000c29959a05","leaseDurationSeconds":15,"acquireTime":"2019-06-28T04:16:00Z","renewTime":"2019-06-28T04:21:32Z","leaderTransitions":0}' creationTimestamp: "2019-06-28T04:16:00Z" name: kube-controller-manager namespace: kube-system resourceVersion: "1481" selfLink: /api/v1/namespaces/kube-system/endpoints/kube-controller-manager uid: 6f9d838f-995b-11e9-9cb7-000c29959a05
关于 controller 权限和 use-service-account-credentials 参数
kublet 认证和受权
该集群包含 3 个节点,启动后将经过竞争选举机制产生一个 leader 节点,其它节点为阻塞状态。当 leader 节点不可用后,剩余节点将再次进行选举产生新的 leader 节点,从而保证服务的可用性
#1.建立CA证书 cd /root/ssl cat > kube-scheduler-csr.json << EOF { "CN": "system:kube-scheduler", "hosts": [ "127.0.0.1", "192.168.10.12", "192.168.10.13", "192.168.10.14" ], "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "system:kube-scheduler", "OU": "System" } ] } EOF #2.生成证书 cfssl gencert -ca=ca.pem \ -ca-key=ca-key.pem \ -config=ca-config.json \ -profile=kubernetes kube-scheduler-csr.json | cfssljson -bare kube-scheduler #3.建立kube-scheduler.kubeconfig文件 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=kube-scheduler.kubeconfig kubectl config set-credentials system:kube-scheduler \ --client-certificate=kube-scheduler.pem \ --client-key=kube-scheduler-key.pem \ --embed-certs=true \ --kubeconfig=kube-scheduler.kubeconfig kubectl config set-context system:kube-scheduler \ --cluster=kubernetes \ --user=system:kube-scheduler \ --kubeconfig=kube-scheduler.kubeconfig kubectl config use-context system:kube-scheduler --kubeconfig=kube-scheduler.kubeconfig #4.拷贝kubeconfig到其余master节点 cp kube-scheduler.kubeconfig kube-scheduler*.pem /etc/kubernetes/ssl/ scp kube-scheduler.kubeconfig kube-scheduler*.pem 192.168.10.13:/etc/kubernetes/ssl/ scp kube-scheduler.kubeconfig kube-scheduler*.pem 192.168.10.14:/etc/kubernetes/ssl/ #5.建立kube-scheduler.service文件 cat > /etc/systemd/system/kube-scheduler.service << EOF [Unit] Description=Kubernetes Scheduler Documentation=https://github.com/GoogleCloudPlatform/kubernetes [Service] ExecStart=/usr/local/bin/kube-scheduler \ --address=127.0.0.1 \ --master=https://192.168.10.100:8443 \ --kubeconfig=/etc/kubernetes/ssl/kube-scheduler.kubeconfig \ --leader-elect=true \ --alsologtostderr=true \ --logtostderr=false \ --log-dir=/var/log/kubernetes \ --v=2 Restart=on-failure RestartSec=5 [Install] WantedBy=multi-user.target EOF #6.将kube-scheduler.service拷贝到其余master节点,而后启动服务 scp /etc/systemd/system/kube-scheduler.service 192.168.10.13:/etc/systemd/system scp /etc/systemd/system/kube-scheduler.service 192.168.10.14:/etc/systemd/system systemctl daemon-reload && systemctl enable kube-scheduler && systemctl start kube-scheduler && systemctl status kube-scheduler #7.检查服务 netstat -lnpt|grep kube-sche tcp 0 0 127.0.0.1:10251 0.0.0.0:* LISTEN 15137/kube-schedule tcp6 0 0 :::10259 :::* LISTEN 15137/kube-schedule kubectl get cs #显示以下 NAME STATUS MESSAGE ERROR scheduler Healthy ok controller-manager Healthy ok etcd-2 Healthy {"health":"true"} etcd-1 Healthy {"health":"true"} etcd-0 Healthy {"health":"true"} kubectl get endpoints kube-scheduler --namespace=kube-system -o yaml #显示以下,k8s-m12选为leader apiVersion: v1 kind: Endpoints metadata: annotations: control-plane.alpha.kubernetes.io/leader: '{"holderIdentity":"k8s-m12_1c3f7882-995f-11e9-a5c1-000c29959a05","leaseDurationSeconds":15,"acquireTime":"2019-06-28T04:42:19Z","renewTime":"2019-06-28T04:45:18Z","leaderTransitions":0}' creationTimestamp: "2019-06-28T04:42:19Z" name: kube-scheduler namespace: kube-system resourceVersion: "2714" selfLink: /api/v1/namespaces/kube-system/endpoints/kube-scheduler uid: 1cda2b3a-995f-11e9-ac7d-000c2928fce6
kubectl get componentstatuses NAME STATUS MESSAGE ERROR controller-manager Healthy ok scheduler Healthy ok etcd-2 Healthy {"health":"true"} etcd-1 Healthy {"health":"true"} etcd-0 Healthy {"health":"true"}
node节点运行kubelet kube-proxy docker flannel。
kubelet运行在每一个 worker 节点上,接收 kube-apiserver 发送的请求,管理 Pod 容器,执行交互式命令,如 exec、run、logs 等。kubelet 启动时自动向 kube-apiserver注册节点信息,内置的 cadvisor 统计和监控节点的资源使用状况。
#1.解压包,拷贝命令 tar -zxvf kubernetes-node-linux-amd64.tar.gz cd /opt/kubernetes/node/bin cp kubectl kubelet kube-proxy /usr/local/bin scp kubectl kubelet kube-proxy 192.168.10.16:/usr/local/bin #2.建立kubelet-bootstrap.kubeconfig文件(也是在12机器上执行)要建立3次分别是(k8s-m12,k8s-m13,k8s-m14) #2.1.建立 token cd /root/ssl export BOOTSTRAP_TOKEN=$(kubeadm token create \ --description kubelet-bootstrap-token \ --groups system:bootstrappers:k8s-m12 \ --kubeconfig ~/.kube/config) #2.2.设置集群参数 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=kubelet-bootstrap-k8s-m12.kubeconfig #2.3.设置客户端认证参数 kubectl config set-credentials kubelet-bootstrap \ --token=${BOOTSTRAP_TOKEN} \ --kubeconfig=kubelet-bootstrap-k8s-m12.kubeconfig #2.4.设置上下文参数 kubectl config set-context default \ --cluster=kubernetes \ --user=kubelet-bootstrap \ --kubeconfig=kubelet-bootstrap-k8s-m12.kubeconfig #2.5.设置默认上下文 kubectl config use-context default --kubeconfig=kubelet-bootstrap-k8s-m12.kubeconfig #3.查看kubeadm为各节点建立的token kubeadm token list --kubeconfig ~/.kube/config #显示以下 11rq5j.3f628cf6ura1hf2x 20h 2019-06-29T13:01:52+08:00 authentication,signing kubelet-bootstrap-token system:bootstrappers:k8s-m14 8zamvk.rfat8wyzh8311f89 20h 2019-06-29T12:59:26+08:00 authentication,signing kubelet-bootstrap-token system:bootstrappers:k8s-m12 lhxalz.busnf6izk82e0xqx 20h 2019-06-29T13:01:03+08:00 authentication,signing kubelet-bootstrap-token system:bootstrappers:k8s-m13 #3.1.r若是须要删除建立的token kubeadm token --kubeconfig ~/.kube/config delete lhxalz.busnf6izk82e0xqx # 建立的token有效期为 1 天,超期后将不能再被使用,且会被kube-controller-manager的tokencleaner清理(若是启用该 controller 的话)。 # kube-apiserver接收kubelet的bootstrap token后,将请求的user设置为system:bootstrap;group设置为 system:bootstrappers; #3.2.查看各token关联的secret kubectl get secrets -n kube-system #4.拷贝bootstrap kubeconfig文件到各个node机器上 scp kubelet-bootstrap-kube12.kubeconfig 192.168.10.15:/etc/kubernetes/ssl/kubelet-bootstrap.kubeconfig scp kubelet-bootstrap-kube12.kubeconfig 192.168.10.16:/etc/kubernetes/ssl/kubelet-bootstrap.kubeconfig #5.建立kubelet配置文件 cd /root/ssl cat > kubelet.config.json <<EOF { "kind": "KubeletConfiguration", "apiVersion": "kubelet.config.k8s.io/v1beta1", "authentication": { "x509": { "clientCAFile": "/etc/kubernetes/ssl/ca.pem" }, "webhook": { "enabled": true, "cacheTTL": "2m0s" }, "anonymous": { "enabled": false } }, "authorization": { "mode": "Webhook", "webhook": { "cacheAuthorizedTTL": "5m0s", "cacheUnauthorizedTTL": "30s" } }, "address": "0.0.0.0", "port": 10250, "readOnlyPort": 0, "cgroupDriver": "cgroupfs", "hairpinMode": "promiscuous-bridge", "serializeImagePulls": false, "featureGates": { "RotateKubeletClientCertificate": true, "RotateKubeletServerCertificate": true }, "clusterDomain": "cluster.local", "clusterDNS": ["10.254.0.2"] } EOF #6.拷贝到其余主机,注意,能够修改address为本机IP地址 cp kubelet.config.json /etc/kubernetes/ssl scp kubelet.config.json 192.168.10.15:/etc/kubernetes/ssl scp kubelet.config.json 192.168.10.16:/etc/kubernetes/ssl #7.建立kubelet.service文件 mkdir -p /var/log/kubernetes && mkdir -p /var/lib/kubelet #先建立目录 cat <<EOF > /etc/systemd/system/kubelet.service [Unit] Description=Kubernetes Kubelet Documentation=https://github.com/GoogleCloudPlatform/kubernetes After=docker.service Requires=docker.service [Service] WorkingDirectory=/var/lib/kubelet ExecStart=/usr/local/bin/kubelet \ --bootstrap-kubeconfig=/etc/kubernetes/ssl/kubelet-bootstrap.kubeconfig \ --cert-dir=/etc/kubernetes/ssl \ --network-plugin=cni \ --cni-conf-dir=/etc/cni/net.d \ --cni-bin-dir=/usr/local/bin/ \ --fail-swap-on=false \ --kubeconfig=/etc/kubernetes/ssl/kubelet.kubeconfig \ --config=/etc/kubernetes/ssl/kubelet.config.json \ --hostname-override=192.168.10.15 \ --pod-infra-container-image=registry.cn-hangzhou.aliyuncs.com/google_containers/pause-amd64:3.1 \ --allow-privileged=true \ --alsologtostderr=true \ --logtostderr=false \ --cgroup-driver=systemd \ --log-dir=/var/log/kubernetes \ --v=2 Restart=on-failure RestartSec=5 [Install] WantedBy=multi-user.target EOF #拷贝到其余主机,注意修改hostname-override为本机IP地址 #8.Bootstrap Token Auth 和授予权限 ,须要先将bootstrap-token文件中的kubelet-bootstrap用户赋予system:node-bootstrapper角色,而后kubelet才有权限建立认证请求 kubectl create clusterrolebinding kubelet-bootstrap --clusterrole=system:node-bootstrapper --group=system:bootstrappers #9.启动kubele服务 systemctl daemon-reload && systemctl enable kubelet && systemctl restart kubelet && systemctl status kubelet #10.检查服务 netstat -lantp|grep kubelet tcp 0 0 192.168.10.15:46936 192.168.10.100:8443 ESTABLISHED 15299/kubelet #8.经过kubelet 的TLS 证书请求,kubelet 首次启动时向kube-apiserver 发送证书签名请求,必须经过后kubernetes 系统才会将该 Node 加入到集群。查看未受权的CSR 请求 kubectl get csr NAME AGE REQUESTOR CONDITION node-csr-ZyWLfyY4nBb1GPBCCNGf2pCjbFKGHt04q50R1_3oprU 16m system:bootstrap:rhwf4g Pending node-csr-hiZbOHizDYsE_n36kfuSxWTmUzobCEnCpIXfN54Lh6Y 18m system:bootstrap:rhwf4g Pending
approve kubelet csr请求
#1.手动approve csr请求(推荐自动的方式) kubectl certificate approve node-csr-ZyWLfyY4nBb1GPBCCNGf2pCjbFKGHt04q50R1_3oprU #手动建立 #显示以下 certificatesigningrequest.certificates.k8s.io/node-csr-ZyWLfyY4nBb1GPBCCNGf2pCjbFKGHt04q50R1_3oprU approved #1.1.查看Approve结果 kubectl describe csr node-csr-ZyWLfyY4nBb1GPBCCNGf2pCjbFKGHt04q50R1_3oprU #显示以下 Name: node-csr-ZyWLfyY4nBb1GPBCCNGf2pCjbFKGHt04q50R1_3oprU Labels: <none> Annotations: <none> CreationTimestamp: Wed, 26 Jun 2019 15:12:40 +0800 Requesting User: system:bootstrap:rhwf4g Status: Approved,Issued Subject: Common Name: system:node:192.168.10.16 Serial Number: Organization: system:nodes Events: <none> #1.2.特别多能够用这样的方式 kubectl get csr|grep 'Pending' | awk 'NR>0{print $1}'| xargs kubectl certificate approve kubectl get csr|awk 'NR==3{print $1}'| xargs kubectl describe csr #查看Approve结果 #2.自动approve csr请求(推荐),建立ClusterRoleBinding,分别用于自动 approve client、renew client、renew server 证书 cd /root/ssl cat > csr-crb.yaml <<EOF # Approve all CSRs for the group "system:bootstrappers" kind: ClusterRoleBinding apiVersion: rbac.authorization.k8s.io/v1 metadata: name: auto-approve-csrs-for-group subjects: - kind: Group name: system:bootstrappers apiGroup: rbac.authorization.k8s.io roleRef: kind: ClusterRole name: system:certificates.k8s.io:certificatesigningrequests:nodeclient apiGroup: rbac.authorization.k8s.io --- # To let a node of the group "system:bootstrappers" renew its own credentials kind: ClusterRoleBinding apiVersion: rbac.authorization.k8s.io/v1 metadata: name: node-client-cert-renewal subjects: - kind: Group name: system:bootstrappers apiGroup: rbac.authorization.k8s.io roleRef: kind: ClusterRole name: system:certificates.k8s.io:certificatesigningrequests:selfnodeclient apiGroup: rbac.authorization.k8s.io --- # A ClusterRole which instructs the CSR approver to approve a node requesting a # serving cert matching its client cert. kind: ClusterRole apiVersion: rbac.authorization.k8s.io/v1 metadata: name: approve-node-server-renewal-csr rules: - apiGroups: ["certificates.k8s.io"] resources: ["certificatesigningrequests/selfnodeserver"] verbs: ["create"] --- # To let a node of the group "system:nodes" renew its own server credentials kind: ClusterRoleBinding apiVersion: rbac.authorization.k8s.io/v1 metadata: name: node-server-cert-renewal subjects: - kind: Group name: system:nodes apiGroup: rbac.authorization.k8s.io roleRef: kind: ClusterRole name: approve-node-server-renewal-csr apiGroup: rbac.authorization.k8s.io EOF #3.拷贝到其余节点 cp csr-crb.yaml /etc/kubernetes/ssl scp csr-crb.yaml 192.168.10.13:/etc/kubernetes/ssl scp csr-crb.yaml 192.168.10.14:/etc/kubernetes/ssl #4.生效配置 kubectl apply -f /etc/kubernetes/ssl/csr-crb.yaml #5.验证 kubectl get csr #等待一段时间,查看CSR都被自动approve #显示以下 NAME AGE REQUESTOR CONDITION node-csr-cF4D5xoTEQCkK5QCsCAmsHGItlZ2cJ43RjkGXpM4BNw 38m system:bootstrap:8zamvk Approved,Issued node-csr-lUIuS1_ggYM8Q95rgsUrBawzrsAXQ4QfYcP3BbPnWl8 36m system:bootstrap:lhxalz Approved,Issued kubectl get --all-namespaces -o wide nodes #全部节点均 ready #显示以下 NAME STATUS ROLES AGE VERSION INTERNAL-IP EXTERNAL-IP OS-IMAGE KERNEL-VERSION CONTAINER-RUNTIME 192.168.10.15 Ready <none> 5m33s v1.14.3 192.168.10.15 <none> CentOS Linux 7 (Core) 4.4.103-1.el7.elrepo.x86_64 docker://18.9.6 192.168.10.16 Ready <none> 54s v1.14.3 192.168.10.16 <none> CentOS Linux 7 (Core) 4.4.103-1.el7.elrepo.x86_64 docker://18.9.6 kubectl get nodes NAME STATUS ROLES AGE VERSION 192.168.10.15 Ready <none> 6m55s v1.14.3 192.168.10.16 Ready <none> 2m16s v1.14.3 netstat -lnpt|grep kubelet tcp 0 0 127.0.0.1:10248 0.0.0.0:* LISTEN 20302/kubelet tcp 0 0 192.168.10.15:10250 0.0.0.0:* LISTEN 20302/kubelet tcp 0 0 127.0.0.1:37706 0.0.0.0:* LISTEN 20302/kubelet tcp 0 0 192.168.10.15:60332 192.168.10.100:8443 ESTABLISHED 20302/kubelet #10248: healthz http 服务,10250; https API 服务;注意:未开启只读端口 10255;因为关闭了匿名认证,同时开启了 webhook 受权,全部访问 10250 端口 https API 的请求都须要被认证和受权。
kublet api 认证和受权
kublet的配置文件kubelet.config.json配置了以下认证参数:
同时配置了以下受权参数:
# kubelet 收到请求后,使用 clientCAFile 对证书签名进行认证,或者查询 bearer token 是否有效。若是二者都没经过,则拒绝请求,提示 Unauthorized curl -s --cacert /etc/kubernetes/ssl/ca.pem https://127.0.0.1:10250/metrics curl -s --cacert /etc/kubernetes/ssl/ca.pem -H "Authorization: Bearer 123456" https://192.168.10.15:10250/metrics #经过认证后,kubelet 使用 SubjectAccessReview API 向 kube-apiserver 发送请求,查询证书或 token 对应的 user、group 是否有操做资源的权限(RBAC); #1.证书认证和受权 #权限不足的证书; curl -s --cacert /etc/kubernetes/ssl/ca.pem --cert /etc/kubernetes/ssl/kube-controller-manager.pem --key /etc/kubernetes/ssl/kube-controller-manager-key.pem https://192.168.10.15:10250/metrics #使用部署 kubectl 命令行工具时建立的、具备最高权限的 admin 证书; curl -s --cacert /etc/kubernetes/ssl/ca.pem --cert /etc/kubernetes/ssl/admin.pem --key /etc/kubernetes/ssl/admin-key.pem https://192.168.10.15:10250/metrics|head #2.bear token认证和受权: # 建立一个ServiceAccount,将它和ClusterRole system:kubelet-api-admin绑定,从而具备调用kubelet API的权限: kubectl create sa kubelet-api-test kubectl create clusterrolebinding kubelet-api-test --clusterrole=system:kubelet-api-admin --serviceaccount=default:kubelet-api-test SECRET=$(kubectl get secrets | grep kubelet-api-test | awk '{print $1}') TOKEN=$(kubectl describe secret ${SECRET} | grep -E '^token' | awk '{print $2}') echo ${TOKEN} curl -s --cacert /etc/kubernetes/ssl/ca.pem -H "Authorization: Bearer ${TOKEN}" https://192.168.10.15:10250/metrics|head # cadvisor 和 metrics # cadvisor 统计所在节点各容器的资源(CPU、内存、磁盘、网卡)使用状况,分别在本身的 http web 页面(4194 端口)和 10250 以 promehteus metrics 的形式输出。 # 浏览器访问 http://192.168.10.15:4194/containers/ 能够查看到 cadvisor 的监控页面: # 浏览器访问 https://192.168.10.15:10250/metrics 和 https://192.168.10.15:10250/metrics/cadvisor 分别返回 kublet 和 cadvisor 的 metrics。
注意:kublet.config.json 设置 authentication.anonymous.enabled 为 false,不容许匿名证书访问 10250 的 https 服务;参考A.浏览器访问kube-apiserver安全端口.md,建立和导入相关证书,而后访问上面的 10250 端口;
#1.须要安装jdk而后使用keytool工具 .\keytool -import -v -trustcacerts -alias appmanagement -file "E:\ca.pem" -storepass password -keystore cacerts #2.而后在linux上执行 openssl pkcs12 -export -out admin.pfx -inkey admin-key.pem -in admin.pem -certfile ca.pem #3.而后把证书导进去,就能够正常访问了
kube-proxy 运行在全部 worker 节点上,,它监听 apiserver 中 service 和 Endpoint 的变化状况,建立路由规则来进行服务负载均衡。
#1.建立CA证书 cd /root/ssl cat > kube-proxy-csr.json <<EOF { "CN": "system:kube-proxy", "key": { "algo": "rsa", "size": 2048 }, "names": [ { "C": "CN", "ST": "ShangHai", "L": "ShangHai", "O": "k8s", "OU": "System" } ] } EOF #2.生成证书和私钥 cfssl gencert -ca=ca.pem \ -ca-key=ca-key.pem \ -config=ca-config.json \ -profile=kubernetes kube-proxy-csr.json | cfssljson -bare kube-proxy #3.建立kubeconfig文件 #3.1.设置集群参数 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=kube-proxy.kubeconfig #3.2.设置客户端认证参数 kubectl config set-credentials kube-proxy \ --client-certificate=kube-proxy.pem \ --client-key=kube-proxy-key.pem \ --embed-certs=true \ --kubeconfig=kube-proxy.kubeconfig #3.3.设置上下文参数 kubectl config set-context default \ --cluster=kubernetes \ --user=kube-proxy \ --kubeconfig=kube-proxy.kubeconfig #3.4.设置默认上下文 kubectl config use-context default --kubeconfig=kube-proxy.kubeconfig #4.拷贝到其余节点 cp kube-proxy*.pem kube-proxy.kubeconfig /etc/kubernetes/ssl/ scp kube-proxy*.pem kube-proxy.kubeconfig 192.168.10.15:/etc/kubernetes/ssl/ scp kube-proxy*.pem kube-proxy.kubeconfig 192.168.10.16:/etc/kubernetes/ssl/ #5.建立kube-proxy配置文件 cd /root/ssl cat >kube-proxy.config.yaml <<EOF apiVersion: kubeproxy.config.k8s.io/v1alpha1 bindAddress: 192.168.10.15 clientConnection: kubeconfig: /etc/kubernetes/ssl/kube-proxy.kubeconfig clusterCIDR: 172.30.0.0/16 healthzBindAddress: 192.168.10.15:10256 hostnameOverride: 192.168.10.15 kind: KubeProxyConfiguration metricsBindAddress: 192.168.10.15:10249 mode: "ipvs" EOF #6.拷贝到其余节点 cp kube-proxy.config.yaml /etc/kubernetes/ssl/ scp kube-proxy.config.yaml 192.168.10.15:/etc/kubernetes/ssl/ scp kube-proxy.config.yaml 192.168.10.16:/etc/kubernetes/ssl/ #7.建立kube-proxy.service文件,而后拷贝到其余节点 cat << EOF > /etc/systemd/system/kube-proxy.service [Unit] Description=Kubernetes Kube-Proxy Server Documentation=https://github.com/GoogleCloudPlatform/kubernetes After=network.target [Service] WorkingDirectory=/var/lib/kube-proxy ExecStart=/usr/local/bin/kube-proxy \ --config=/etc/kubernetes/ssl/kube-proxy.config.yaml \ --alsologtostderr=true \ --logtostderr=false \ --log-dir=/var/log/kubernetes/kube-proxy \ --v=2 Restart=on-failure RestartSec=5 LimitNOFILE=65536 [Install] WantedBy=multi-user.target EOF #8.启动kube-proxy服务 mkdir -p /var/lib/kube-proxy && mkdir -p /var/log/kubernetes/kube-proxy systemctl daemon-reload && systemctl enable kube-proxy && systemctl restart kube-proxy && systemctl status kube-proxy netstat -lnpt|grep kube-proxy #查看端口 ipvsadm -ln #查看ipvs路由规则 #显示以下 IP Virtual Server version 1.2.1 (size=4096) Prot LocalAddress:Port Scheduler Flags -> RemoteAddress:Port Forward Weight ActiveConn InActConn TCP 10.254.0.1:443 rr -> 192.168.10.12:6443 Masq 1 0 0 -> 192.168.10.13:6443 Masq 1 0 0 -> 192.168.10.14:6443 Masq 1 0 0
kubectl get nodes #查看节点状态 # 一、建立nginx 测试文件 cat << EOF > nginx-web.yml apiVersion: v1 kind: Service metadata: name: nginx-web labels: tier: frontend spec: type: NodePort selector: tier: frontend ports: - name: http port: 80 targetPort: 80 --- apiVersion: extensions/v1beta1 kind: Deployment metadata: name: nginx-con labels: tier: frontend spec: replicas: 3 template: metadata: labels: tier: frontend spec: containers: - name: nginx-pod image: nginx ports: - containerPort: 80 EOF #2.执行文件 kubectl create -f nginx-web.yml #显示已建立 service/nginx-web created deployment.extensions/nginx-con created #3.查看pod状态 kubectl get pod -o wide #显示以下 NAME READY STATUS RESTARTS AGE IP NODE NOMINATED NODE READINESS GATES nginx-con-7dc84bdfb6-h6bt6 1/1 Running 0 105s 172.30.85.2 192.168.10.16 <none> <none> nginx-con-7dc84bdfb6-nt5qs 1/1 Running 0 105s 172.30.34.3 192.168.10.15 <none> <none> nginx-con-7dc84bdfb6-sfg87 1/1 Running 0 105s 172.30.34.2 192.168.10.15 <none> <none> #4.测试IP是否ping通 ping -c4 172.30.34.2 PING 172.30.34.2 (172.30.34.2) 56(84) bytes of data. 64 bytes from 172.30.34.2: icmp_seq=1 ttl=63 time=0.543 ms 64 bytes from 172.30.34.2: icmp_seq=2 ttl=63 time=0.684 ms 64 bytes from 172.30.34.2: icmp_seq=3 ttl=63 time=0.886 ms 64 bytes from 172.30.34.2: icmp_seq=4 ttl=63 time=0.817 ms #5.查看server集群IP kubectl get svc #显示以下 NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE kubernetes ClusterIP 10.254.0.1 <none> 443/TCP 37h nginx-web NodePort 10.254.153.104 <none> 80:31808/TCP 4m19s # 10.254.153.104是nginx的集群IP,代理前面3个pod,80是集群IP的端口31808是nodeport端口 #6.curl访问node_ip:nodeport curl -I 192.168.10.15:31808 #状态200表示访问成功 HTTP/1.1 200 OK Server: nginx/1.17.0 Date: Sat, 29 Jun 2019 05:03:15 GMT Content-Type: text/html Content-Length: 612 Last-Modified: Tue, 21 May 2019 14:23:57 GMT Connection: keep-alive ETag: "5ce409fd-264" Accept-Ranges: bytes #7.在flannel网络主机上访问集群IP ip add | grep 10.254 inet 10.254.0.1/32 brd 10.254.0.1 scope global kube-ipvs0 inet 10.254.153.104/32 brd 10.254.153.104 scope global kube-ipvs0 curl -I http://10.254.153.104:80 #返回以下 HTTP/1.1 200 OK Server: nginx/1.17.0 Date: Sat, 29 Jun 2019 05:05:56 GMT Content-Type: text/html Content-Length: 612 Last-Modified: Tue, 21 May 2019 14:23:57 GMT Connection: keep-alive ETag: "5ce409fd-264" Accept-Ranges: bytes
插件是集群的附件组件,丰富和完善了集群的功能
#1.将kubernetes-server-linux-amd64.tar.gz解压后,再解压其中的 kubernetes-src.tar.gz 文件 tar -zxvf kubernetes-src.tar.gz -C src #coredns对应的目录是:cluster/addons/dns #2.修改配置文件 cd src/cluster/addons/dns/coredns cp coredns.yaml.base /etc/kubernetes/coredns.yaml sed -i "s/__PILLAR__DNS__DOMAIN__/cluster.local/g" /etc/kubernetes/coredns.yaml sed -i "s/__PILLAR__DNS__SERVER__/10.254.0.2/g" /etc/kubernetes/coredns.yaml #3.建立coredns kubectl create -f /etc/kubernetes/coredns.yaml #4.检查codedns功能 kubectl -n kube-system get all -o wide #显示以下 NAME READY STATUS RESTARTS AGE pod/coredns-8854569d4-5vshp 1/1 Running 0 58m # NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE service/kube-dns ClusterIP 10.254.0.2 <none> 53/UDP,53/TCP,9153/TCP 81m # NAME READY UP-TO-DATE AVAILABLE AGE deployment.apps/coredns 1/1 1 1 58m # NAME DESIRED CURRENT READY AGE replicaset.apps/coredns-8854569d4 1 1 1 58m #4.1 kubectl -n kube-system describe pod coredns #4.2 kubectl -n kube-system logs coredns-8854569d4-5vshp #5.使用容器验证 kubectl run dns-test --rm -it --image=alpine /bin/sh #进入容器 ping 百度正常 ping www.baidu.com PING www.baidu.com (182.61.200.6): 56 data bytes 64 bytes from 182.61.200.6: seq=0 ttl=127 time=41.546 ms 64 bytes from 182.61.200.6: seq=1 ttl=127 time=35.043 ms 64 bytes from 182.61.200.6: seq=2 ttl=127 time=38.977 ms 64 bytes from 182.61.200.6: seq=3 ttl=127 time=40.633 ms #查看全部集群pod kubectl get --all-namespaces pods #6.若是遇到镜像下载不下来,能够修改文件 sed -i "s/k8s.gcr.io/coredns/g" /etc/kubernetes/coredns.yaml
参考
https://github.com/kubernetes/dashboard/wiki/Access-control
https://github.com/kubernetes/dashboard/issues/2558
https://kubernetes.io/docs/concepts/configuration/organize-cluster-access-kubeconfig/
#1.将kubernetes-server-linux-amd64.tar.gz 解压后,再解压其中的 kubernetes-src.tar.gz 文件。dashboard 对应的目录是:cluster/addons/dashboard ,拷贝dashboard的文件 mkdir -p /etc/kubernetes/dashboard cp -a /opt/kubernetes/src/cluster/addons/dashboard/{dashboard-configmap.yaml,dashboard-controller.yaml,dashboard-rbac.yaml,dashboard-secret.yaml,dashboard-service.yaml} /etc/kubernetes/dashboard #2.修改配置文件 sed -i "s@image:.*@image: registry.cn-hangzhou.aliyuncs.com/google_containers/kubernetes-dashboard-amd64:v1.10.1@g" /etc/kubernetes/dashboard/dashboard-controller.yaml sed -i "/spec/a\ type: NodePort" /etc/kubernetes/dashboard/dashboard-service.yaml sed -i "/targetPort/a\ nodePort: 32700" /etc/kubernetes/dashboard/dashboard-service.yaml #3.执行全部定义文件 kubectl create -f /etc/kubernetes/dashboard #4.查看分配的NodePort kubectl -n kube-system get all -o wide # NAME READY STATUS RESTARTS AGE pod/coredns-8854569d4-5vshp 1/1 Running 0 119m pod/kubernetes-dashboard-7d5f7c58f5-mr8zn 1/1 Running 0 5m1s # NAME TYPE CLUSTER-IP EXTERNAL-IP PORT(S) AGE service/kube-dns ClusterIP 10.254.0.2 <none> 53/UDP,53/TCP,9153/TCP 142m service/kubernetes-dashboard NodePort 10.254.63.16 <none> 443:32700/TCP 51s # NAME READY UP-TO-DATE AVAILABLE AGE deployment.apps/coredns 1/1 1 1 119m deployment.apps/kubernetes-dashboard 1/1 1 1 5m4s # NAME DESIRED CURRENT READY AGE replicaset.apps/coredns-8854569d4 1 1 1 119m replicaset.apps/kubernetes-dashboard-7d5f7c58f5 1 1 1 5m4s kubectl -n kube-system describe pod kubernetes-dashboard #NodePort映射到dasrd pod 443端口; #dashboard的 --authentication-mode 支持 token、basic,默认为 token。若是使用 basic,则 kube-apiserver 必须配置 '--authorization-mode=ABAC' 和 '--basic-auth-file' 参数。 #5.查看 dashboard 支持的命令行参数 kubectl exec --namespace kube-system -it kubernetes-dashboard-7d5f7c58f5-mr8zn -- /dashboard --help #6.访问dashboard # 为了集群安全,从1.7开始,dashboard只容许经过https访问,若是使用kube proxy则必须监听localhost或 127.0.0.1,对于NodePort没有这个限制,可是仅建议在开发环境中使用。对于不知足这些条件的登陆访问,在登陆成功后浏览器不跳转,始终停在登陆界面。 参考1:https://github.com/kubernetes/dashboard/wiki/Accessing-Dashboard---1.7.X-and-above 参考2:https://github.com/kubernetes/dashboard/issues/2540 # 三种访问 dashboard 的方式 # 经过NodePort访问dashboard: # 经过kubectl proxy访问dashboard: # 经过kube-apiserver访问dashboard; #7.经过NodePort访问dashboard # kubernetes-dashboard服务暴露了NodePort,可使用http://NodeIP:NodePort地址访问dashboard; #8.经过 kubectl proxy 访问 dashboard #启动代理: kubectl proxy --address='localhost' --port=8086 --accept-hosts='^*$' # --address 必须为 localhost 或 127.0.0.1; # 须要指定 --accept-hosts 选项,不然浏览器访问 dashboard 页面时提示 “Unauthorized”; # 浏览器访问 URL:http://127.0.0.1:8086/api/v1/namespaces/kube-system/services/https:kubernetes-dashboard:/proxy #9.经过 kube-apiserver 访问 dashboard # 获取集群服务地址列表: kubectl cluster-info # 必须经过 kube-apiserver 的安全端口(https)访问 dashbaord,访问时浏览器须要使用自定义证书,不然会被 kube-apiserver 拒绝访问。 # 建立和导入自定义证书的步骤,参考:A.浏览器访问kube-apiserver安全端口 # 浏览器访问 URL:https://192.168.10.100:8443/api/v1/namespaces/kube-system/services/https:kubernetes-dashboard:/proxy #10.建立登陆 Dashboard 的 token 和 kubeconfig 配置文件 # 上面提到,Dashboard 默认只支持 token 认证,因此若是使用 KubeConfig 文件,须要在该文件中指定 token,不支持使用 client 证书认证。 # 建立登陆 token,访问 dashboard时使用 kubectl create sa dashboard-admin -n kube-system kubectl create clusterrolebinding dashboard-admin --clusterrole=cluster-admin --serviceaccount=kube-system:dashboard-admin ADMIN_SECRET=$(kubectl get secrets -n kube-system | grep dashboard-admin | awk '{print $1}') DASHBOARD_LOGIN_TOKEN=$(kubectl describe secret -n kube-system ${ADMIN_SECRET} | grep -E '^token' | awk '{print $2}') echo ${DASHBOARD_LOGIN_TOKEN} #使用输出的 token 登陆 Dashboard。 #建立使用 token 的 KubeConfig 文件 cd /root/ssl #设置集群参数 kubectl config set-cluster kubernetes \ --certificate-authority=ca.pem \ --embed-certs=true \ --server=https://192.168.10.100:8443 \ --kubeconfig=dashboard.kubeconfig #设置客户端认证参数,使用上面建立的 Token kubectl config set-credentials dashboard_user \ --token=${DASHBOARD_LOGIN_TOKEN} \ --kubeconfig=dashboard.kubeconfig #设置上下文参数 kubectl config set-context default \ --cluster=kubernetes \ --user=dashboard_user \ --kubeconfig=dashboard.kubeconfig #设置默认上下文 kubectl config use-context default --kubeconfig=dashboard.kubeconfig #生成的 dashboard.kubeconfig 登陆 Dashboard。 #因为缺乏 Heapster 插件,当前 dashboard 不能展现 Pod、Nodes 的 CPU、内存等统计数据和图表;
Heapster是一个收集者,将每一个Node上的cAdvisor的数据进行汇总,而后导到第三方工具(如InfluxDB)。Heapster 是经过调用 kubelet 的 http API 来获取 cAdvisor 的 metrics 数据的。因为 kublet 只在 10250 端口接收 https 请求,故须要修改 heapster 的 deployment 配置。同时,须要赋予 kube-system:heapster ServiceAccount 调用 kubelet API 的权限。
参考:配置 heapster:https://github.com/kubernetes/heapster/blob/master/docs/source-configuration.md
heapster下载地址:https://github.com/kubernetes-retired/heapster/releases
#1.解压heapster mkdir /opt/heapster tar -xzvf heapster-1.5.4.tar.gz -C /opt/heapster #2.修改配置 mkdir -p /etc/kubernetes/heapster cp -a /opt/heapster/deploy/kube-config/influxdb/{grafana.yaml,heapster.yaml,influxdb.yaml} /etc/kubernetes/heapster sed -i "s@image:.*@image: registry.cn-hangzhou.aliyuncs.com/google_containers/heapster-grafana-amd64:v4.4.3@g" /etc/kubernetes/heapster/grafana.yaml sed -i "67a\ type: NodePort" /etc/kubernetes/heapster/grafana.yaml sed -i "/targetPort/a\ nodePort: 32699" /etc/kubernetes/heapster/grafana.yaml sed -i "s@image:.*@image: registry.cn-hangzhou.aliyuncs.com/google_containers/heapster-amd64:v1.5.3@g" /etc/kubernetes/heapster/heapster.yaml # 因为 kubelet 只在 10250 监听 https 请求,故添加相关参数; sed -i "s@source=.*@source=kubernetes:https://kubernetes.default?kubeletHttps=true\&kubeletPort=10250@g" /etc/kubernetes/heapster/heapster.yaml sed -i "s@image:.*@image: registry.cn-hangzhou.aliyuncs.com/google_containers/heapster-influxdb-amd64:v1.3.3@g" /etc/kubernetes/heapster/influxdb.yaml # 将 serviceAccount kube-system:heapster 与 ClusterRole system:kubelet-api-admin 绑定,授予它调用 kubelet API 的权限; cp -a /opt/heapster/deploy/kube-config/rbac/heapster-rbac.yaml /etc/kubernetes/heapster cat > /etc/kubernetes/heapster/heapster-rbac.yaml <<EOF kind: ClusterRoleBinding apiVersion: rbac.authorization.k8s.io/v1beta1 metadata: name: heapster-kubelet-api roleRef: apiGroup: rbac.authorization.k8s.io kind: ClusterRole name: system:kubelet-api-admin subjects: - kind: ServiceAccount name: heapster namespace: kube-system EOF #3.执行全部定义文件 kubectl create -f /etc/kubernetes/heapster kubectl apply -f /etc/kubernetes/heapster/heapster-rbac.yaml #4.检查执行结果 kubectl -n kube-system get all -o wide | grep -E 'heapster|monitoring' kubectl -n kube-system describe pod heapster kubectl -n kube-system describe pod monitoring # 检查 kubernets dashboard 界面,能够正确显示各 Nodes、Pods 的 CPU、内存、负载等统计数据和图表: kubectl -n kube-system get all -o wide kubectl -n kube-system logs heapster-7bdc95b5cc-8h7zt #5.访问 grafana,经过 NodePort 访问: kubectl get svc -n kube-system|grep -E 'monitoring|heapster' #显示以下,grafana 监听 NodePort 32699; heapster ClusterIP 10.254.159.62 <none> 80/TCP 12m k8s-app=heapster monitoring-grafana NodePort 10.254.167.38 <none> 80:32699/TCP 4m29s k8s-app=grafana monitoring-influxdb ClusterIP 10.254.155.141 <none> 8086/TCP 12m k8s-app=influxdb kubectl get pod -n kube-system -o wide |grep -E 'monitoring|heapster' #显示以下,而后浏览器访问 URL:http://192.168.10.16:32699/?orgId=1 heapster-7bdc95b5cc-8h7zt 1/1 Running 0 13m 172.30.34.4 192.168.10.15 monitoring-grafana-6cf5948cd4-rstxk 1/1 Running 0 5m 172.30.85.11 192.168.10.16 monitoring-influxdb-7d6c5fb944-qfd65 1/1 Running 0 13m 172.30.85.10 192.168.10.16 #6.经过 kube-apiserver 访问: 获取 monitoring-grafana 服务 URL: kubectl cluster-info #查到浏览器访问URL:https://192.168.10.100:8443/api/v1/namespaces/kube-system/services/monitoring-grafana/proxy #经过 kubectl proxy 访问:建立代理 kubectl proxy --address='192.168.10.16' --port=8086 --accept-hosts='^*$' # 浏览器访问 URL:http://192.168.10.16:8086/api/v1/namespaces/kube-system/services/monitoring-grafana/proxy/?orgId=1