etcd-operator/README.md

173 строки
5.2 KiB
Markdown
Исходник Обычный вид История

2016-08-05 03:12:08 +03:00
# kube-etcd-controller
2016-08-26 02:51:12 +03:00
Project status: pre-alpha
2016-08-10 03:58:05 +03:00
Managed etcd clusters on Kubernetes:
- creation
- destroy
- resize
- recovery
- backup
- rolling upgrade
2016-08-10 04:13:58 +03:00
## Requirements
2016-08-17 06:39:18 +03:00
- Kubernetes 1.4+
- etcd 3.0+
## Limitations
- Backup only works for data in etcd3 storage, not etcd2 storage.
2016-08-10 04:13:58 +03:00
2016-08-24 00:26:04 +03:00
## Deploy kube-etcd-controller
2016-08-05 03:12:08 +03:00
```bash
2016-08-24 00:26:04 +03:00
$ kubectl create -f example/etcd-controller.yaml
pod "kubeetcdctrl" created
2016-08-05 03:12:08 +03:00
```
2016-08-24 00:26:04 +03:00
kube-etcd-controller will create a TPR automatically.
2016-08-05 03:12:08 +03:00
```bash
2016-08-05 03:35:46 +03:00
$ kubectl get thirdpartyresources
2016-08-05 03:12:08 +03:00
NAME DESCRIPTION VERSION(S)
2016-08-24 00:26:04 +03:00
etcd-cluster.coreos.com Managed etcd clusters v1
2016-08-05 03:30:46 +03:00
```
2016-08-05 03:12:08 +03:00
## Create an etcd cluster
```bash
2016-08-05 03:35:46 +03:00
$ cat example/example-etcd-cluster.yaml
2016-08-05 03:12:08 +03:00
```
```yaml
apiVersion: "coreos.com/v1"
kind: "EtcdCluster"
metadata:
name: "example-etcd-cluster"
size: 3
```
```bash
2016-08-05 03:35:46 +03:00
$ kubectl create -f example/example-etcd-cluster.yaml
2016-08-05 03:12:08 +03:00
```
```bash
2016-08-05 03:35:46 +03:00
$ kubectl get pods
2016-08-23 21:18:32 +03:00
NAME READY STATUS RESTARTS AGE
etcd-cluster-0000 1/1 Running 0 11s
etcd-cluster-0001 1/1 Running 0 11s
etcd-cluster-0002 1/1 Running 0 11s
2016-08-05 03:12:08 +03:00
```
```bash
2016-08-05 03:35:46 +03:00
$ kubectl get services
2016-08-23 21:18:32 +03:00
NAME CLUSTER-IP EXTERNAL-IP PORT(S) AGE
etcd-cluster-0000 10.0.104.18 <none> 2380/TCP,2379/TCP 45s
etcd-cluster-0001 10.0.243.108 <none> 2380/TCP,2379/TCP 45s
etcd-cluster-0002 10.0.45.68 <none> 2380/TCP,2379/TCP 45s
kubernetes 10.0.0.1 <none> 443/TCP 8m
2016-08-05 03:12:08 +03:00
```
```bash
2016-08-23 21:18:32 +03:00
$ kubectl log etcd-cluster-0000
2016-08-05 03:35:46 +03:00
...
2016-08-05 00:33:32.453768 I | api: enabled capabilities for version 3.0
2016-08-05 00:33:32.454178 N | etcdmain: serving insecure client requests on 0.0.0.0:2379, this is strongly discouraged!
2016-08-05 03:12:08 +03:00
```
2016-08-10 00:20:42 +03:00
2016-08-26 01:40:55 +03:00
## Resize an etcd cluster
2016-08-25 23:17:25 +03:00
`kubectl apply` doesn't work for TPR at the moment. See [kubernetes/#29542](https://github.com/kubernetes/kubernetes/issues/29542).
2016-08-26 00:32:46 +03:00
In this example, we use cURL to update the cluster as a workaround.
The following command changed the cluster size from 3 to 5.
2016-08-25 23:17:25 +03:00
```
2016-08-26 00:32:46 +03:00
$ curl -H 'Content-Type: application/json'-X PUT http://127.0.0.1:8080/apis/coreos.com/v1/namespaces/default/etcdclusters/etcd-cluster -d '{"apiVersion":"coreos.com/v1", "kind": "EtcdCluster", "metadata": {"name": "etcd-cluster", "namespace": "default"}, "spec": {"size": 5}}'
2016-08-25 23:17:25 +03:00
{"apiVersion":"coreos.com/v1","kind":"EtcdCluster","metadata":{"name":"etcd-cluster","namespace":"default","selfLink":"/apis/coreos.com/v1/namespaces/default/etcdclusters/etcd-cluster","uid":"e5828789-6b01-11e6-a730-42010af00002","resourceVersion":"32179","creationTimestamp":"2016-08-25T20:24:17Z"},"spec":{"size":5}}
```
2016-08-26 00:32:46 +03:00
We should see
2016-08-25 23:17:25 +03:00
```
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
etcd-cluster-0000 1/1 Running 0 43m
etcd-cluster-0001 1/1 Running 0 43m
etcd-cluster-0002 1/1 Running 0 43m
etcd-cluster-0003 1/1 Running 0 17s
etcd-cluster-0004 1/1 Running 0 12s
```
2016-08-26 00:32:46 +03:00
Now we can decrease the size of cluster from 5 back to 3.
```
$ curl -H 'Content-Type: application/json'-X PUT http://127.0.0.1:8080/apis/coreos.com/v1/namespaces/default/etcdclusters/etcd-cluster -d '{"apiVersion":"coreos.com/v1", "kind": "EtcdCluster", "metadata": {"name": "etcd-cluster", "namespace": "default"}, "spec": {"size": 3}}'
{"apiVersion":"coreos.com/v1","kind":"EtcdCluster","metadata":{"name":"etcd-cluster","namespace":"default","selfLink":"/apis/coreos.com/v1/namespaces/default/etcdclusters/etcd-cluster","uid":"e5828789-6b01-11e6-a730-42010af00002","resourceVersion":"32179","creationTimestamp":"2016-08-25T20:24:17Z"},"spec":{"size":3}}
```
We should see
```
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
etcd-cluster-0000 1/1 Running 0 43m
etcd-cluster-0002 1/1 Running 0 43m
etcd-cluster-0004 1/1 Running 0 12s
```
2016-08-10 03:58:05 +03:00
## Destroy an existing etcd cluster
2016-08-10 00:20:42 +03:00
```bash
$ kubectl delete -f example/example-etcd-cluster.yaml
```
```bash
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
```
2016-08-17 19:51:45 +03:00
## Try cluster recovery
2016-08-10 03:45:48 +03:00
Simulate a pod failure by simply delete it
```bash
2016-08-23 21:18:32 +03:00
$ kubectl delete pod etcd-cluster-0000
2016-08-10 03:45:48 +03:00
```
2016-08-23 21:18:32 +03:00
etcd controller will recover the failure by creating a new pod `etcd-cluster-0003`
2016-08-10 03:45:48 +03:00
```bash
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
2016-08-23 21:18:32 +03:00
etcd-cluster-0001 1/1 Running 0 5s
etcd-cluster-0002 1/1 Running 0 5s
etcd-cluster-0003 1/1 Running 0 5s
2016-08-10 03:45:48 +03:00
```
2016-08-24 03:07:45 +03:00
2016-08-24 03:09:16 +03:00
## Try controller recovery
etcd controller can recover itself from restart or a crash. Continued from above, you can try to simulate a controller crash:
2016-08-24 03:07:45 +03:00
```bash
$ kubectl delete -f example/etcd-controller.yaml
pod "kubeetcdctrl" deleted
$ kubectl delete etcd-cluster-0003
pod "etcd-cluster-0003" deleted
```
Then restart etcd controller. It should automatically recover itself. It also recovers the etcd cluster!
2016-08-24 03:09:16 +03:00
2016-08-24 03:07:45 +03:00
```bash
$ kubectl create -f example/example-etcd-cluster.yaml
$ kubectl get pods
NAME READY STATUS RESTARTS AGE
etcd-cluster-0001 1/1 Running 0 4m
etcd-cluster-0002 1/1 Running 0 4m
etcd-cluster-0004 1/1 Running 0 6s
```