使用kubeadm方式安装kubernetes 1.10.0

  1. 环境准备
#系统环境
lsb_release -a
Distributor ID:	CentOS
Description:	CentOS Linux release 7.3.1611 (Core)
Release:	7.3.1611
Codename:	Core
说明:如提示:-bash: lsb_release: command not found,请yum install -y redhat-lsb
#查看selinux状态
getenforce
Disabled
#关闭防火墙
systemctl disable firewalld
systemctl stop firewalld
systemctl status firewalld
  1. docker安装部署 所需docker安装包,请点击17.03自行下载使用。
#docker安装
yum install -y yum-utils device-mapper-persistent-data lvm2
yum install -y docker-ce-selinux-17.03.1.ce-1.el7.centos.noarch.rpm
yum install -y docker-ce-17.03.1.ce-1.el7.centos.x86_64.rpm
#docker启动
systemctl enable docker
systemctl start docker
systemctl status docker
#docker 版本
docker --version
Docker version 17.03.1-ce, build c6d412e
  1. 基础镜像准备 本次演示 kubernetes 所依赖的各个镜像列表如下:
k8s.gcr.io/kube-apiserver-amd64:v1.10.0
k8s.gcr.io/kube-controller-manager-amd64:v1.10.0
k8s.gcr.io/kube-scheduler-amd64:v1.10.0
k8s.gcr.io/kube-proxy-amd64:v1.10.0
k8s.gcr.io/k8s-dns-dnsmasq-nanny-amd64:1.14.8
k8s.gcr.io/k8s-dns-sidecar-amd64:1.14.8
k8s.gcr.io/k8s-dns-kube-dns-amd64:1.14.8
k8s.gcr.io/pause-amd64:3.1
quay.io/coreos/flannel:v0.9.1
quay.io/calico/node:v2.6.2
quay.io/calico/cni:v1.11.0
k8s.gcr.io/etcd-amd64:3.1.12
k8s.gcr.io/heapster-amd64:v1.5.3
k8s.gcr.io/heapster-influxdb-amd64:v1.3.3
k8s.gcr.io/heapster-grafana-amd64:v4.4.3
k8s.gcr.io/kubernetes-dashboard-amd64:v1.8.3
k8s.gcr.io/kubernetes-dashboard-init-amd64:v1.0.1

所需镜像,请点击images自行下载。

  1. 系统配置 根据官方文档中limitations小节中的内容,对各节点系统做如下设置:
#设置系统参数 – 允许路由转发,不对bridge的数据进行处理
#创建/etc/sysctl.d/k8s.conf文件:
touch /etc/sysctl.d/k8s.conf
#添加如下内容:
net.bridge.bridge-nf-call-ip6tables = 1
net.bridge.bridge-nf-call-iptables = 1
net.ipv4.ip_forward = 1
#执行
sysctl -p /etc/sysctl.d/k8s.conf
使修改生效。
  1. 安装kubeadm和kubelet 下面在各节点安装kubeadm和kubelet
#配置yum源
cat <<EOF > /etc/yum.repos.d/kubernetes.repo
[kubernetes]
name=Kubernetes
baseurl=https://mirrors.aliyun.com/kubernetes/yum/repos/kubernetes-el7-x86_64
enabled=1
gpgcheck=1
repo_gpgcheck=1
gpgkey=https://mirrors.aliyun.com/kubernetes/yum/doc/yum-key.gpg https://mirrors.aliyun.com/kubernetes/yum/doc/rpm-package-key.gpg
EOF
#生成缓存
yum clean all
yum makecache
#查看kubeadm, kubelet, kubectl的最新版本
yum list kubeadm  --showduplicates |sort -r
kubeadm.x86_64                        1.10.0-0                        kubernetes
yum list kubelet  --showduplicates |sort -r
kubelet.x86_64                        1.10.0-0                        kubernetes
yum list kubectl  --showduplicates |sort -r
kubectl.x86_64                        1.10.0-0                        kubernetes
直接安装即可:
yum install -y kubelet kubeadm kubectl kubernetes-cni
#启动kubelet.service
systemctl enable kubelet.service
systemctl start kubelet.service
systemctl status kubelet.service
注意⚠️:阿里云的kubernetes的yum源是不断更新的,所以yum安装之前,请先确定下安装包版本是否是1.10.0的!
  1. 初始化master准备 #说明: (1)kubeadm 等相关 rpm 安装后会生成 /etc/kubernetes 目录,而 kubeadm init 时候又会检测这些目录是否存在,如果存在则停止初始化,所以要先清理一下。
#清理命令
kubeadm reset

参考官网tear down部分内容。

(2)初始化以前记得一定要启动 kubelet。

#启动命令如下:
systemctl enable kubelet
systemctl start kubelet

(3)安装ebtables包 新版本直接 init 会提示 ebtables not found in system path 错误,所以还得先安装一下这个包在初始化

#安装 ebtables
yum install -y ebtables

(4)修改kubelet配置文件

kubelet和docker 的cgroup driver 有2种方式:cgroupfs和systemd.注意保持 2个应用
的driver保持一致。
#查看kubelet的driver
cat /etc/systemd/system/kubelet.service.d/10-kubeadm.conf其中包含如下内容:
Environment="KUBELET_CGROUP_ARGS=--cgroup-driver=systemd"
#查看docker的driver
docker info
Server Version: 17.03.1-ce
。。。
Cgroup Driver: cgroupfs
。。。
#更改kubelet的driver
Environment="KUBELET_CGROUP_ARGS=--cgroup-driver=cgroupfs"

(5)关闭系统Swap Kubernetes 1.8开始要求关闭系统的Swap,如果不关闭,默认配置下kubelet将无法启动。可以通过kubelet的启动参数--fail-swap-on=false更改这个限制。

#关闭系统的Swap方法
swapoff -a
#修改 /etc/fstab 文件,注释掉 SWAP 的自动挂载,使用free -m确认swap已经关闭。 swappiness参数调整,修改/etc/sysctl.d/k8s.conf添加下面一行:
vm.swappiness=0
#执行
sysctl -p /etc/sysctl.d/k8s.conf
使修改生效。

说明:因为这里本次用于测试的主机上还运行其他服务,关闭swap可能会对其他服务产生影响,所以这里修改kubelet的启动参数--fail-swap-on=false去掉这个限制。 注意⚠️如果碰到一下错误:

[kubelet-check] The HTTP call equal to 'curl -sSL http://localhost:10255/healthz' failed with error: Get http://localhost:10255/healthz: dial tcp [::1]:10255: getsockopt: connection refused.

解决方法:

#修改/etc/systemd/system/kubelet.service.d/10-kubeadm.conf,加入:
Environment="KUBELET_EXTRA_ARGS=--fail-swap-on=false"
#使配置修改生效:
systemctl daemon-reload
  1. 使用kubeadm init初始化集群 #使用kubeadm初始化集群,在Master Node上执行下面的命令:
kubeadm init \
  --kubernetes-version=v1.10.0 \
  --pod-network-cidr=10.244.0.0/16 \
  --apiserver-advertise-address=10.0.0.39 (master这台主机的IP)

#说明 我们选择flannel作为Pod网络插件,所以上面的命令指定–pod-network-cidr=10.244.0.0/16 #执行时的WARNING

[WARNING FileExisting-crictl]: crictl not found in system path
Suggestion: go get github.com/kubernetes-incubator/cri-tools/cmd/crictl

#执行时如有:

[preflight] Some fatal errors occurred:
[ERROR Swap]: running with swap on is not supported. Please disable swap
请添加–ignore-preflight-errors=Swap参数忽略这个错误,重新运行。
即:
kubeadm init \
   --kubernetes-version=v1.10.0 \
   --pod-network-cidr=10.244.0.0/16 \
   --apiserver-advertise-address=10.0.0.39 \
   --ignore-preflight-errors=Swap

#整个初始化流程

kubeadm init   --kubernetes-version=v1.10.0   --pod-network-cidr=10.244.0.0/16   
--apiserver-advertise-address=10.0.0.39
[init] Using Kubernetes version: v1.10.0
[init] Using Authorization modes: [Node RBAC]
[preflight] Running pre-flight checks.
	[WARNING FileExisting-crictl]: crictl not found in system path
Suggestion: go get github.com/kubernetes-incubator/cri-tools/cmd/crictl
[preflight] Starting the kubelet service
[certificates] Generated ca certificate and key.
[certificates] Generated apiserver certificate and key.
[certificates] apiserver serving cert is signed for DNS names [swarm2 kubernetes
kubernetes.default kubernetes.default.svc kubernetes.default.svc.cluster.local] an
d IPs [10.96.0.1 10.0.0.39]
[certificates] Generated apiserver-kubelet-client certificate and key.
[certificates] Generated etcd/ca certificate and key.
[certificates] Generated etcd/server certificate and key.
[certificates] etcd/server serving cert is signed for DNS names [localhost] and IPs
[127.0.0.1]
[certificates] Generated etcd/peer certificate and key.
[certificates] etcd/peer serving cert is signed for DNS names [swarm2] and IPs
[10.0.0.39]
[certificates] Generated etcd/healthcheck-client certificate and key.
[certificates] Generated apiserver-etcd-client certificate and key.
[certificates] Generated sa key and public key.
[certificates] Generated front-proxy-ca certificate and key.
[certificates] Generated front-proxy-client certificate and key.
[certificates] Valid certificates and keys now exist in "/etc/kubernetes/pki"
[kubeconfig] Wrote KubeConfig file to disk: "/etc/kubernetes/admin.conf"
[kubeconfig] Wrote KubeConfig file to disk: "/etc/kubernetes/kubelet.conf"
[kubeconfig] Wrote KubeConfig file to disk: "/etc/kubernetes/controller-manager.co
nf"
[kubeconfig] Wrote KubeConfig file to disk: "/etc/kubernetes/scheduler.conf"
[controlplane] Wrote Static Pod manifest for component kube-apiserver to
"/etc/kubernetes/manifests/kube-apiserver.yaml"
[controlplane] Wrote Static Pod manifest for component kube-controller-manager 
to "/etc/kubernetes/manifests/kube-controller-manager.yaml"
[controlplane] Wrote Static Pod manifest for component kube-scheduler 
to "/etc/kubernetes/manifests/kube-scheduler.yaml"
[etcd] Wrote Static Pod manifest for a local etcd instance
to "/etc/kubernetes/manifests/etcd.yaml"
[init] Waiting for the kubelet to boot up the control plane as Static Pods 
from directory "/etc/kubernetes/manifests".
[init] This might take a minute or longer if the control plane images have to be pulled.
[apiclient] All control plane components are healthy after 22.002127 seconds
[uploadconfig] Storing the configuration used in ConfigMap "kubeadm-config" in 
the "kube-system" Namespace
[markmaster] Will mark node swarm2 as master by adding a label and a taint
[markmaster] Master swarm2 tainted and labelled with key/value: 
node-role.kubernetes.io/master=""
[bootstraptoken] Using token: 4g0p8w.w5p29ukwvitim2ti
[bootstraptoken] Configured RBAC rules to allow Node Bootstrap tokens to post 
CSRs in order for nodes to get long term certificate credentials
[bootstraptoken] Configured RBAC rules to allow the csrapprover controller
automatically approve CSRs from a Node Bootstrap Token
[bootstraptoken] Configured RBAC rules to allow certificate rotation for all node 
client certificates in the cluster
[bootstraptoken] Creating the "cluster-info" ConfigMap in the "kube-public" 
namespace
[addons] Applied essential addon: kube-dns
[addons] Applied essential addon: kube-proxy
Your Kubernetes master has initialized successfully!
To start using your cluster, you need to run the following as a regular user:
mkdir -p $HOME/.kube
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
sudo chown $(id -u):$(id -g) $HOME/.kube/config
You should now deploy a pod network to the cluster.
Run "kubectl apply -f [podnetwork].yaml" with one of the options listed at:
https://kubernetes.io/docs/concepts/cluster-administration/addons/
You can now join any number of machines by running the following on each node
as root:
kubeadm join 10.0.0.39:6443 --token 4g0p8w.w5p29ukwvitim2ti --discovery-token
-ca-cert-hash sha256:21d0adbfcb409dca97e655641573b2ee51c77a212f194e20a3
07cb459e5f77c8

#关键内容说明

[certificates] 生成相关的各种证书
[kubeconfig] 接下来是生成证书和相关的kubeconfig文件,
这个目前我们在Kubernetes 1.6 高可用集群部署也是这么做的,
目前没看出有什么新东西。
[bootstraptoken] 生成token记录下来,后边使用kubeadm join往集群中添加节点时
会用到。
下面的命令是配置常规用户如何使用kubectl访问集群:
mkdir -p $HOME/.kube
sudo cp -i /etc/kubernetes/admin.conf $HOME/.kube/config
sudo chown $(id -u):$(id -g) $HOME/.kube/config
最后给出了将节点加入集群的命令:
kubeadm join 10.0.0.39:6443 --token 4g0p8w.w5p29ukwvitim2ti 
--discovery-token-ca-cert-hash sha256:21d0adbfcb409dca97e655641573b2ee51c
77a212f194e20a307cb459e5f77c8
这条命令一定保存好!!

如果你需要重新做 kubeadm init ,那么我们最好把上次初始化和容器服务全部清理掉,具体步骤:

(1)kubeadm reset
(2)systemctl stop kubelet
(3)docker stop $(docker ps -qa) && docker rm $(docker ps -qa) ## 如果docker上有其他服务,请不要使用这个命令,这时你需要手动排查出kubernetes相关容器并删除
(4)systemctl start kubelet
(5)kubeadm init 
  1. master节点验证 #查看一下集群状态
kubectl get cs
error: the server doesn't have a resource type "cs"
kubectl get nodes
Unable to connect to the server: x509: certificate signed by unknown authority
(possibly because of "crypto/rsa: verification error" while trying to verify 
candidate authority certificate "kubernetes")

#解决方法:

mkdir -p /root/.kube/
cp -i /etc/kubernetes/admin.conf /root/.kube/config
chown root:root /root/.kube/config

#再次查看集群状态

kubectl get cs
NAME                 STATUS    MESSAGE              ERROR
controller-manager   Healthy   ok
scheduler            Healthy   ok
etcd-0               Healthy   {"health": "true"}

#查看集群节点

kubectl get nodes
NAME      STATUS    ROLES     AGE       VERSION
swarm2     Ready       master       1d             v1.10.0

#命令行验证

命令:
curl --cacert /etc/kubernetes/pki/ca.crt --cert /etc/kubernetes/pki/apiserver-kubelet-client.crt --key /etc/kubernetes/pki/apiserver-kubelet-client.key https://10.0.0.39:6443
  1. 查看当前Network kubeadm init成功之后,此时的master node上k8s的核心组件均正常启动,而且是多以container的形式启动。
docker ps
CONTAINER ID        IMAGE                        COMMAND                  CREATED             STATUS              PORTS               NAMES
c2eb8ca152a2        k8s.gcr.io/pause-amd64:3.1   "/pause"                 20 minutes ago      Up 20 minutes                           k8s_POD_kube-flannel-ds-nhjx5_kube-system_a3016d6d-51dd-11e8-a243-0017fa00e437_0
eb98616f7458        6f7f2dc7fab5                 "/sidecar --v=2 --..."   25 minutes ago      Up 25 minutes                           k8s_sidecar_kube-dns-86f4d74b45-ztrbs_kube-system_17bb3bf9-50e1-11e8-921b-0017fa00e437_4
912942318fed        c2ce1ffb51ed                 "/dnsmasq-nanny -v..."   25 minutes ago      Up 25 minutes                           k8s_dnsmasq_kube-dns-86f4d74b45-ztrbs_kube-system_17bb3bf9-50e1-11e8-921b-0017fa00e437_4
d5b193a4d60e        80cc5ea4b547                 "/kube-dns --domai..."   25 minutes ago      Up 25 minutes                           k8s_kubedns_kube-dns-86f4d74b45-ztrbs_kube-system_17bb3bf9-50e1-11e8-921b-0017fa00e437_4
c120c761b764        9df3c00f55e6                 "kube-apiserver --..."   25 minutes ago      Up 25 minutes                           k8s_kube-apiserver_kube-apiserver-swarm2_kube-system_659a6e4a0a2629e2c62563857da54a7f_3
81533f3ea2a7        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_kube-dns-86f4d74b45-ztrbs_kube-system_17bb3bf9-50e1-11e8-921b-0017fa00e437_4
867ce7e1d7f8        ceecd7155649                 "kube-scheduler --..."   25 minutes ago      Up 25 minutes                           k8s_kube-scheduler_kube-scheduler-swarm2_kube-system_0ede54c0e24ebcdc8ec84ec2aa830bfc_4
01474f2a8879        6e6237849607                 "/usr/local/bin/ku..."   25 minutes ago      Up 25 minutes                           k8s_kube-proxy_kube-proxy-2tghd_kube-system_17c7aa1b-50e1-11e8-921b-0017fa00e437_4
be4c6d21dc4d        52920ad46f5b                 "etcd --peer-key-f..."   25 minutes ago      Up 25 minutes                           k8s_etcd_etcd-swarm2_kube-system_11d7cb74cd31e890a93f59d783573f27_3
912413c032e1        8401bb3ff261                 "kube-controller-m..."   25 minutes ago      Up 25 minutes                           k8s_kube-controller-manager_kube-controller-manager-swarm2_kube-system_c9074b2decba8a970cedd0fa9c4dd366_3
6609d479936e        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_kube-apiserver-swarm2_kube-system_659a6e4a0a2629e2c62563857da54a7f_3
ddfb3e0b37f5        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_kube-scheduler-swarm2_kube-system_0ede54c0e24ebcdc8ec84ec2aa830bfc_4
67c0c0001635        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_kube-proxy-2tghd_kube-system_17c7aa1b-50e1-11e8-921b-0017fa00e437_4
5dac6220b6dc        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_etcd-swarm2_kube-system_11d7cb74cd31e890a93f59d783573f27_5
7c88b16b245d        k8s.gcr.io/pause-amd64:3.1   "/pause"                 25 minutes ago      Up 25 minutes                           k8s_POD_kube-controller-manager-swarm2_kube-system_c9074b2decba8a970cedd0fa9c4dd366_4

不过这些核心组件并不是跑在pod network中的(没错,此时的pod network还没有创建),而是采用了host network。以kube-apiserver的pod信息为例:

kubectl get pods -n kube-system
NAME                                   READY     STATUS    RESTARTS   AGE
kube-apiserver-swarm2            1/1        Running              3          1d

#查看kube-apiserver的容器id

docker ps |grep apiserver
c120c761b764        9df3c00f55e6                 "kube-apiserver --..."   33 minutes ago      Up 33 minutes                           k8s_kube-apiserver_kube-apiserver-swarm2_kube-system_659a6e4a0a2629e2c62563857da54a7f_3

#查看对应的pause容器的network属性

docker inspect c120c761b764
"NetworkMode": "host",
  1. 安装flannel pod网络
#创建kube-flannel.yml
mkdir /etc/kubernetes/manifests/my.conf
#下载kube-flannel.yml文件
wget https://raw.githubusercontent.com/coreos/flannel/master/Documentation/kube-flannel.yml
#修改文件
image:quay.io/coreos/flannel:v0.10.0-amd64
更改为:
image: quay.io/coreos/flannel:v0.9.1
#启动flannel
cd /etc/kubernetes/manifests/my.conf
kubectl create -f  kube-flannel.yml
  1. 查看master中所有pod的状态
kubectl get pod --all-namespaces -o wide
NAMESPACE     NAME                             READY     STATUS    RESTARTS   AGE       IP           NODE
kube-system   etcd-swarm2                      1/1              Running   3          1d        10.0.0.39    swarm2
kube-system   kube-apiserver-swarm2            1/1       Running   3          1d        10.0.0.39    swarm2
kube-system   kube-controller-manager-swarm2   1/1       Running   3          1d        10.0.0.39    swarm2
kube-system   kube-dns-86f4d74b45-ztrbs        3/3       Running   12         1d        10.244.0.6   swarm2
kube-system   kube-flannel-ds-nhjx5            1/1       Running   0          36m       10.0.0.39    swarm2
kube-system   kube-proxy-2tghd                 1/1       Running   4          1d        10.0.0.39    swarm2
kube-system   kube-scheduler-swarm2            1/1       Running   4          1d        10.0.0.39    swarm2
或使用:
kubectl get pods -n kube-system
NAME                             READY     STATUS    RESTARTS   AGE
etcd-swarm2                      1/1       Running   3          1d
kube-apiserver-swarm2            1/1       Running   3          1d
kube-controller-manager-swarm2   1/1       Running   3          1d
kube-dns-86f4d74b45-ztrbs        3/3       Running   12         1d
kube-flannel-ds-nhjx5            1/1       Running   0          37m
kube-proxy-2tghd                 1/1       Running   4          1d
kube-scheduler-swarm2            1/1       Running   4          1d
  1. 向Kubernetes集群添加/删除Node 关于这部分操作,请点击参考Node操作

  2. 参考文档

centos7使用kubeadm安装kubernetes 1.11版本多主高可用