CacheCloud运维管理平台学习笔记

时间 2020-06-08

标签 cachecloud 维管平台学习笔记繁體版

原文原文链接

最近在读《Redis开发与运维》这本书，发现搜狐视频出的cachecloud这个平台很是不错，下面是本身的学习笔记。不过这个是用java语言写的，如今开发运维通常都使用python，后期看看能不能用python模仿作一个这样的运维管理平台，提高本身开发能力。php

目录结构：前端

-平台简介java

-平台安装node

- 安装包python

- 数据库配置mysql

- 应用配置文件git

- 编译应用github

- 启动应用web

- 登陆测试redis

-redis安装

- 建立cachecloud用户

- 初始化redis环境

- 添加机器

- 实际案例

- Redis sentinel群集应用

- Redis Cluster群集应用

一平台简介

CacheCloud是搜狐视频提供的一个开源Redis云管理平台：

实现多种类型(Redis Standalone、Redis Sentinel、Redis Cluster)自动部署、解决Redis实例碎片化现象、提供完善统计、监控、运维功能、减小开发人员的运维成本和误操做，提升机器的利用率，提供灵活的伸缩性，提供方便的接入客户端。它主要功能有：

监控统计：提供了机器、应用、实例下各个维度数据的监控和统计界面。
一键开启： Redis Standalone、Redis Sentinel、Redis Cluster三种类型的应用，无需手动配置初始化。
Failover：支持哨兵,集群的高可用模式。
伸缩：提供完善的垂直和水平在线伸缩功能。
完善运维：提供自动运维和简化运维操做功能，避免纯手工运维出错。
方便的客户端：方便快捷的客户端接入。
元数据管理：提供机器、应用、实例、用户信息管理。
流程化：提供申请，运维，伸缩，修改等完善的处理流程

二平台安装

2.1 下载安装包

从官方网站上https://github.com/sohutv/cachecloud，下载下来后，解压到/opt 下，确保环境已经安装了。

Java 7
Maven 3
MySQL
Redis 3

2.2 数据库配置

提早安装好数据库mysql数据库，而后建立cachecloud数据库，导入cachecloud数据库脚本，具体命令以下：

Welcome to the MariaDB monitor. Commands end with ; or \g.

Your MySQL connection id is 1

Server version: 5.5.24-patch-1.0 MySQL Community Server (GPL)

Type 'help;' or '\h' for help. Type '\c' to clear the current input statement.

MySQL [(none)]> create database cachecloud;

Query OK, 1 row affected (0.00 sec)

MySQL [(none)]> grant all on cachecloud.* to 'admin'@'localhost' identified by 'admin123';

Query OK, 0 rows affected (0.01 sec)

MySQL [(none)]> flush privileges;

Query OK, 0 rows affected (0.00 sec)

MySQL [(none)]> quit

[root@localhost opt]# mysql -uroot cachecloud < /opt/cachecloud-master/script/cachecloud.sql

2.3 修改应用配置文件

这里须要说明的是，设置好数据库地址，帐号密码等。

[root@localhost opt]# vi /opt/cachecloud-master/cachecloud-open-web/src/main/swap/online.properties

cachecloud.db.url = jdbc:mysql://127.0.0.1:3306/cachecloud

cachecloud.db.user = admin

cachecloud.db.password = admin123

cachecloud.maxPoolSize = 20

2.4 编译cachecloud

因为源码没有编译，须要使用mvn编译应用，而后生成实际的war包。

#mvn clean compile install-Ponline

[root@localhost cachecloud-master]# mkdir -p /opt/cachecloud

[root@localhost cachecloud-master]# cp cachecloud-open-web/target/cachecloud-open-web-1.0-SNAPSHOT.war /opt/cachecloud/

[root@localhost cachecloud-master]# cp cachecloud-open-web/src/main/resources/cachecloud-web.conf /opt/cachecloud/

[root@localhost cachecloud-master]# ln -s /opt/cachecloud /etc/init.d/cachecloud

[root@localhost cachecloud-master]#

2.5 启动cachecloud应用

[root@localhost cachecloud-master]# /etc/init.d/cachecloud start

which: no start-stop-daemon in (/usr/local/sbin:/usr/local/bin:/usr/sbin:/usr/bin:/home/pycharm-4.5.4/bin:/usr/local/php/bin:/usr/local/mysql/bin/:/root/bin)

Started [4551]

[root@localhost cachecloud-master]#

2.6 登陆测试

打开IE，输入登陆地址http://192.168.100.53:8585，输入用户和密码：admin/admin，便可登陆。

三 redis安装

3.1 在客户机上建立cachecloud用户

这里建立的用户和密码，要和cachecloud配置里面一致，以下图：

3.2 初始化redis环境

对于新安装的机器，须要使用下面脚本安装redis，和初始化环境

/opt/cachecloud-master/script/cachecloud-init.sh redis

3.3 添加机器

用管理员登陆后，到后台管理，添加机器，能够看到应用已经添加进来

四实际案例

4.1 redis sentinel

Redis-Sentinel是Redis官方推荐的高可用性(HA)解决方案，redis-sentinel又叫哨兵，它去监控主从数据库运行状态，当用Redis作Master-slave的高可用方案时，假如master宕机了，Redis自己(包括它的不少客户端)都没有实现自动进行主备切换，而Redis-sentinel自己也是一个独立运行的进程，它能监控多个master-slave集群，发现master宕机后能进行自懂切换，从而实现高可用。哨兵能够配置多个, 若是配置了多个，当主发生故障的时候，sentinel须要投票选举是否切换。

服务器地址以下，须要说明的是，添加到cachecloud上后，须要执行下面脚本，初始化redis环境，安装编译redis

/opt/cachecloud-master/script/cachecloud-init.sh

192.168.199.202 master sentinel1

192.168.199.203 slave sentinel2

192.168.199.204 sentinel3

已经添加机器到cachecloud

提交建立redis-sentinel申请

审批并配置，按照sentinel格式，输入下面配置，点击格式验证，没有没有问题，就点击开始部署

格式不能错处，下面ip必须是已经添加到cachecloud的机器IP

192.168.199.202:512:192.168.199.203

192.168.199.202

192.168.199.203

192.168.199.204

点击开始部署后，能够查看后台服务器

或者登录服务器上查看

[root@autoserver ~]# ps -ef | grep redis

cachecl+ 8999 1 0 11:31 ? 00:00:00 redis-server *:6386

cachecl+ 9085 1 0 11:32 ? 00:00:00 redis-server *:6387 [sentinel]

[root@node01 ~]# redis-cli -h 192.168.199.202 -p 6386 info | grep role

role:master

[root@node01 ~]# redis-cli -h 192.168.199.203 -p 6386 info | grep role

role:slave

测试sentinel是否成功

先设置一个数据值

[root@node01 ~]# redis-cli -h 192.168.199.202 -p 6386 set cw test01

[root@node01 ~]# redis-cli -h 192.168.199.203 -p 6386 get cw

"test01"

[root@node01 ~]#

关闭master节点

[root@node01 ~]# redis-cli -h 192.168.199.202 -p 6386 shutdown

检查进程确实没有了

[root@autoserver ~]# ps -ef | grep redis

cachecl+ 9085 1 0 11:32 ? 00:00:03 redis-server *:6387 [sentinel]

查看日志：

17092:X 23 Aug 11:46:16.995 # +new-epoch 3

17092:X 23 Aug 11:46:16.997 # +vote-for-leader 1c076f3969391012eb246c4723c40f593244462f 3

17092:X 23 Aug 11:46:17.027 # Next failover delay: I will not start a failover before Wed Aug 23 11:52:17 2017

17092:X 23 Aug 11:46:18.041 # +config-update-from sentinel 192.168.199.202:6387 192.168.199.202 6387 @ sentinel-192.168.199.202-6386 192.168.199.202 6386

17092:X 23 Aug 11:46:18.041 # +switch-master sentinel-192.168.199.202-6386 192.168.199.202 6386 192.168.199.203 6386

17092:X 23 Aug 11:46:18.042 * +slave slave 192.168.199.202:6386 192.168.199.202 6386 @ sentinel-192.168.199.202-6386 192.168.199.203 6386

17092:X 23 Aug 11:46:38.127 # +sdown slave 192.168.199.202:6386 192.168.199.202 6386 @ sentinel-192.168.199.202-6386 192.168.199.203 6386

获取redis值

[root@node01 conf]# redis-cli -h 192.168.199.203 -p 6386 info | grep role

role:master

[root@node01 conf]# redis-cli -h 192.168.199.203 -p 6386 get cw

"test01"

[root@node01 conf]#

启动192.168.199.202上的redis后，检查发现，已经变为从应用了。

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6386 info | grep role

role:slave

4.2 Redis Cluster

redis cluster群集，是经过主历来实现分片高可用，官方解释：

Redis 集群是一个提供在多个Redis间节点间共享数据的程序集。
Redis集群并不支持处理多个keys的命令,由于这须要在不一样的节点间移动数据,从而达不到像Redis那样的性能,在高负载的状况下可能会致使不可预料的错误.
Redis 集群经过分区来提供必定程度的可用性,在实际环境中当某个节点宕机或者不可达的状况下继续处理命令. Redis 集群的优点:
自动分割数据到不一样的节点上。
整个集群的部分节点失败或者不可达的状况下可以继续处理命令。

须要搭建的环境信息以下：

下面主从分配不能在一台机器上，3个交叉部署

192.168.199.202:200:192.168.199.203

192.168.199.203:200:192.168.199.204

192.168.199.204:200:192.168.199.202

首先，使用cachecloud，须要申请创建redis-cluster001群集

同redis sentinel同样，须要先点击格式检查，而后点击开始部署

检查是否建立成功，以下有cluster标签的，表示已经建立成功了

[root@node01 conf]# ps -ef | grep redis

cachecl+ 17230 1 0 11:31 ? 00:00:10 redis-server *:6386

cachecl+ 17316 1 0 11:32 ? 00:00:19 redis-server *:6387 [sentinel]

cachecl+ 18847 1 0 12:04 ? 00:00:00 redis-server *:6388 [cluster]

cachecl+ 18948 1 0 12:04 ? 00:00:00 redis-server *:6389 [cluster]

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6388 cluster info

cluster_state:ok

cluster_slots_assigned:16384

cluster_slots_ok:16384

cluster_slots_pfail:0

cluster_slots_fail:0

cluster_known_nodes:6

cluster_size:3

cluster_current_epoch:4

cluster_my_epoch:1

cluster_stats_messages_sent:420

cluster_stats_messages_received:417

[root@node01 conf]#

经过前端监控页面，也能够发现已经成功了。

验证cluster可用性

首先须要生成数据

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6388 -c set cw0001 chenwei0001

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6388 -c get cw0001

"chenwei0001"

[root@node01 conf]#

检查群集节点信息，能够找到202:6388对应的从节点是203:6388，

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6388 cluster nodes

a53f3d619aebc611b79650f8dd8b37f00076fd4d 192.168.199.204:6380 slave d5523f7cdebe8ee551fa86b66a63689457fa7ab7 0 1503486623834 2 connected

301cbb1c365cc6a60f50cb6f9f0d295f54c80912 192.168.199.203:6388 slave 7e01a9ac4087ef10d28fa7412f4bd9907f97e868 0 1503486624844 3 connected

c7b89ca5a92d605730c78f93fd83d6a22fa9eafc 192.168.199.202:6389 slave 4468e49f7ec3049a9d3452199123d686bb2f1df0 0 1503486623331 4 connected

7e01a9ac4087ef10d28fa7412f4bd9907f97e868 192.168.199.202:6388 myself,master - 0 0 1 connected 0-5461

4468e49f7ec3049a9d3452199123d686bb2f1df0 192.168.199.204:6381 master - 0 1503486625855 3 connected 10924-16383

d5523f7cdebe8ee551fa86b66a63689457fa7ab7 192.168.199.203:6389 master - 0 1503486622827 0 connected 5462-10923

[root@node01 conf]#

[root@node01 conf]# redis-cli -h 192.168.199.203 -p 6388 info | grep role

role:slave

关闭主节点192.168.199.202:6388应用

[root@node01 conf]# redis-cli -h 192.168.199.202 -p 6388 shutdown

[root@node01 conf]# redis-cli -h 192.168.199.203 -p 6388 cluster nodes

a53f3d619aebc611b79650f8dd8b37f00076fd4d 192.168.199.204:6380 slave d5523f7cdebe8ee551fa86b66a63689457fa7ab7 0 1503486875906 2 connected

301cbb1c365cc6a60f50cb6f9f0d295f54c80912 192.168.199.203:6388 myself,master - 0 0 5 connected 0-5461

d5523f7cdebe8ee551fa86b66a63689457fa7ab7 192.168.199.203:6389 master - 0 1503486874894 0 connected 5462-10923

c7b89ca5a92d605730c78f93fd83d6a22fa9eafc 192.168.199.202:6389 slave 4468e49f7ec3049a9d3452199123d686bb2f1df0 0 1503486876916 4 connected

7e01a9ac4087ef10d28fa7412f4bd9907f97e868 192.168.199.202:6388 master,fail - 1503486840733 1503486839525 1 disconnected

4468e49f7ec3049a9d3452199123d686bb2f1df0 192.168.199.204:6381 master - 0 1503486877924 3 connected 10924-16383

验证数据是否能够读取,经过其它主应用，发现能够读取到数据

[root@node01 conf]# redis-cli -h 192.168.199.204 -p 6381 -c get cw0001

"chenwei0001"

[root@node01 conf]#