朝花夕拾之–大数据平台CDH集群离线搭建
标签: Cloudera-Manager CDH Hadoop 部署 集群html
摘要:管理、部署Hadoop集群须要工具,Cloudera Manager即是其一。本文先是简要对比了当前的相似工具,然后详细记录了以离线方式部署CDH集群的步骤。最后对“讲究”一词提出了本身的观点。java
前言
以Apache Hadoop为主导的大数据技术的出现,使得中小型公司对于大数据的存储与处理也拥有了武器。目前Hadoop有很多发行版:华为发行版 收费、Intel发行版 收费、Cloudera发行版(Cloudera’s Distribution Including Apache Hadoop,简称 CDH)免费、Hortonworks发行版(Hortonworks Data Platform
,简称 HDP)免费 等,全部这些发行版均是基于Apache Hadoop社区版衍生出来的。node
部署、管理拥有数十数百甚至更多节点的Hadoop集群,也须要先进武器。Hortonworks公司的Apache Ambari项目的目的就是经过软件来配置、监控和管理Hadoop(HDP)集群,以使Hadoop的管理更加简单。Ambari提供了一个基于它自身RESTful的api实现的直观的、简单易用的web界面。Cloudera公司也提供了相似的工具:Cloudera Manager(简称 CM)来配置、监控和管理CDH集群。python
本文主要内容便是本人早先搭建CDH集群之记录,故称做朝花夕拾。需特别注意的是Cloudera Manager与操做系统的版本关系 el7暂不支持,按照官方文档的要求来,不然安装会有问题。mysql
注意用户。
本文是基于操做系统CentOS 6.5, 64-bit;Cloudera Manager 5.3.6;JDK 1.7 版本进行部署的。linux
部署步骤
网络配置(全部节点)
[root@cdh-server ~]# vi /etc/sysconfig/network #修改hostname: NETWORKING=yes HOSTNAME=cdh-server [root@cdh-server ~]# vi /etc/hosts #修改ip与主机名的对应关系: 192.168.180.173 cdh-server 192.168.180.175 node175 [root@cdh-server ~]# service network restart #重启网络服务生效
安装JDK(全部节点)
#卸载OpenJDK [root@cdh-server user1]# rpm -qa | grep java [root@cdh-server user1]# rpm -e --nodeps java-1.5.0-gcj-1.5.0.0-29.1.el6.x86_64 [root@cdh-server user1]# rpm -e --nodeps java-1.6.0-openjdk-1.6.0.0-1.66.1.13.0.el6.x86_64 [root@cdh-server user1]# rpm -e --nodeps java-1.7.0-openjdk-1.7.0.45-2.4.3.3.el6.x86_64 #安装JDK [root@cdh-server user1]# chmod a+x jdk-7u79-linux-x64.rpm [root@cdh-server user1]# rpm -ivh jdk-7u79-linux-x64.rpm [root@cdh-server user1]# echo "JAVA_HOME=/usr/java/jdk1.7.0_79/" >>
安装MySQL(主节点)
[user1@cdh-server]$ cd /home/user1 [user1@cdh-server]$ tar -zxvf mysql-5.6.26-linux-glibc2.5-x86_64.tar.gz [user1@cdh-server]$ mv mysql-5.6.26-linux-glibc2.5-x86_64 mysql-5.6.26 [user1@cdh-server]$ cd mysql-5.6.26/ [user1@cdh-server]$ vi support-files/my.cnf #新建文件
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ [mysqld] character-set-server=utf8 default-storage-engine=INNODB # Uncomment the following if you are using InnoDB tables innodb_data_home_dir = /home/user1/mysql-5.6.26/data innodb_data_file_path = ibdata1:10M:autoextend innodb_log_group_home_dir = /home/user1/mysql-5.6.26/data # You can set .._buffer_pool_size up to 50 - 80 % # of RAM but beware of setting memory usage too high innodb_buffer_pool_size = 16M innodb_additional_mem_pool_size = 2M # Set .._log_file_size to 25 % of buffer pool size innodb_log_file_size = 5M innodb_log_buffer_size = 8M innodb_flush_log_at_trx_commit = 1 innodb_lock_wait_timeout = 50 +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
初始化MySQL(主节点)
[user1@cdh-server]$ ./scripts/mysql_install_db --defaults-file=/home/user1/mysql-5.6.26/support-files/my.cnf --basedir=/home/user1/mysql-5.6.26 --datadir=/home/user1/mysql-5.6.26/data --user=user1 [user1@cdh-server]$ ./bin/mysqld --defaults-file=/home/user1/mysql-5.6.26/support-files/my.cnf --basedir=/home/user1/mysql-5.6.26 --datadir=/home/user1/mysql-5.6.26/data > mysql.log 2>&1 & [user1@cdh-server]$ ./bin/mysqladmin -u root password '123456'
[user1@cdh-server mysql-5.6.26]$ ./bin/mysql -uroot -p'123456' #hive mysql> create database hive DEFAULT CHARSET utf8 COLLATE utf8_general_ci; Query OK, 1 row affected (0.00 sec) #Activity Monitor使用 mysql> create database amon DEFAULT CHARSET utf8 COLLATE utf8_general_ci; Query OK, 1 row affected (0.01 sec) #Navigator Audit Server使用 mysql> create database audit DEFAULT CHARSET utf8 COLLATE utf8_general_ci; Query OK, 1 row affected (0.01 sec) #Navigator Metadata Server mysql> create database metadata DEFAULT CHARSET utf8 COLLATE utf8_general_ci; Query OK, 1 row affected (0.01 sec) mysql> grant all privileges on *.* to 'root'@'localhost' identified by '123456' with grant option; Query OK, 0 rows affected (0.00 sec) mysql> grant all privileges on *.* to 'root'@'cdh-server' identified by '123456' with grant option; Query OK, 0 rows affected (0.00 sec) #this user scm is for cloudera manager mysql> grant all privileges on *.* to 'scm'@'localhost' identified by 'scm' with grant option; Query OK, 0 rows affected (0.00 sec) mysql> grant all privileges on *.* to 'scm'@'cdh-server' identified by 'scm' with grant option; Query OK, 0 rows affected (0.00 sec) mysql> flush privileges; Query OK, 0 rows affected (0.00 sec)
部署/启动CM Server(主节点)
[user1@cdh-server ~]$ tar -zxvf cloudera-manager-el6-cm5.3.6_x86_64.tar.gz [user1@cdh-server ~]$ cp mysql-connector-java-5.1.33-bin.jar ./cm-5.3.6/share/cmf/lib/ [user1@cdh-server ~]$ su - root [root@cdh-server ~]# cd /home/user1/ [root@cdh-server user1]# cp -rf cloudera /opt [root@cdh-server user1]# mv CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel /opt/cloudera/parcel-repo/CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel [root@cdh-server user1]# mv CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha /opt/cloudera/parcel-repo/CDH-5.3.6-1.cdh5.3.6.p0.11-el6.parcel.sha [root@cdh-server user1]# mv manifest.json /opt/cloudera/parcel-repo/manifest.json [root@cdh-server user1]# ./cm-5.3.6/share/cmf/schema/scm_prepare_database.sh mysql cm -hlocalhost:3306 -uroot -p123456 --scm-host localhost scm scm scm [root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-server start Starting cloudera-scm-server: [ OK ] [root@cdh-server user1]# tail -f ./cm-5.3.6/log/cloudera-scm-server/cloudera-scm-server.log
关闭防火墙(全部节点)
#中止iptables [root@cdh-server user1]# service iptables stop #经过浏览器访问验证 http://192.168.180.173:7180/
部署/启动CM Agent(从节点)
[root@cdh-server user1]# tar -zxvf cloudera-manager-el6-cm5.3.6_x86_64.tar.gz [root@cdh-server user1]# vi cm-5.3.6/etc/cloudera-scm-agent/config.ini
+++++++++++++++++++++++++++++++++++++++++++++++++++++++++++ # Hostname of the CM server. #server_host=localhost server_host=cdh-server +++++++++++++++++++++++++++++++++++++++++++++++++++++++++++
[root@cdh-server user1]# useradd -G sys --home=/home/user1/cm-5.3.6/run/cloudera-scm-server --no-create-home --comment "Cloudera SCM User" cloudera-scm [root@cdh-server user1]# useradd --comment "Cloudera SCM User" cloudera-scm #若上一步执行正确,则此步省略 [root@cdh-server user1]# echo 0 > /proc/sys/vm/swappiness [root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-agent start Starting cloudera-scm-agent: [ OK ] [root@cdh-server user1]# tail -f ./cm-5.3.6/log/cloudera-scm-agent/cloudera-scm-agent.log
配置CDH
登录Cloudera Manager http://192.168.180.173:7180/,并新建集群Cluster_user1,进行各服务的配置启动。
web
#安装配置hive出错时,在hiveServer上: [root@hive-server user1]# cp mysql-connector-java-5.1.33-bin.jar /opt/cloudera/parcels/CDH-5.3.6-1.cdh5.3.6.p0.11/lib/hive/lib/ #同理:use this jar for Navigator Audit Server and Navigator Metadata Server or Activity Server [root@cdh-server user1]# cp mysql-connector-java-5.1.33-bin.jar /usr/share/java/mysql-connector-java.jar
其余
中止集群步骤
- 中止Cloudera Management Service和Cluster_user1
- 从节点中止Agent
[root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-agent stop
- 主节点中止Server
[root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-server stop
启动集群步骤
- 主节点启动MySQL
[user1@cdh-server]$ ./bin/mysqld --defaults-file=/home/user1/mysql-5.6.26/support-files/my.cnf --basedir=/home/user1/mysql-5.6.26 --datadir=/home/user1/mysql-5.6.26/data > mysql.log 2>&1 & [user1@cdh-server]$ ps -a | grep mysql
- 从节点启动Agent
[root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-agent start Starting cloudera-scm-agent: [ OK ] [root@cdh-server user1]# tail -f ./cm-5.3.6/log/cloudera-scm-agent/cloudera-scm-agent.log
- 主节点启动Server
[root@cdh-server user1]# ./cm-5.3.6/etc/init.d/cloudera-scm-server start Starting cloudera-scm-server: [ OK ] [root@cdh-server user1]# tail -f ./cm-5.3.6/log/cloudera-scm-server/cloudera-scm-server.log
- 启动各服务
登录Cloudera Manager http://192.168.180.173:7180/,进行各服务的检查启动。
题外话–论讲究
讲究 是一种情怀,也是一项技能。当处于卖方市场时,公司可能不须要太多的讲究就能够赚得盆满钵盈,然而若是它有更高的追求也就是情怀,可能会更讲究,好比更加注重细节、用户体验或者极力完美;而处于买方市场时,供需关系会使得公司不得不讲究起来,这对他们来讲是一项必须技能。sql
讲究 既是推进人类社会发展的动力,也是人类社会发展的成果。当前不少互联网公司都处于买方市场,谁能赢得用户谁就拼得胜利,如履薄冰、当心翼翼、极致体验之讲究将他们推向了社会发展的浪尖,先进技术的发明和使用也是层出不穷。json
讲究 不光体现于公司,也体现于我的、地域、国家等。愈是讲究的国家,愈是发达;讲究细节的公司,竞争力就越强。然而因为影响因素众多,讲究的个体有钱、赢钱与不然另当别论了。api
做者 @王安琪
aitanjupt@hotmail.com 2015 年 10月 20日