环境:centos + hadoop2.5.2 +scala-2.10.5 + spark1.3.1
html
一、从http://spark.apache.org/downloads.html 下载编译好的spark java
二、准备scalashell
从http://www.scala-lang.org/ 下载scala-2.10.5.rpm。不下载2.11,由于下载2.11要从新编译sparkapache
安装scala rpm -ivh scala-2.10.5.rpm
三、解压spark
centos
tar -zxvf spark-1.3.1-bin-hadoop2.4.tar.gz
四、配置环境变量oop
在/etc/profile最后面增长 export SPARK_HOME=/usr/local/spark-1.3.1-bin-hadoop2.4 export PATH=$PATH:$SPARK_HOME/bin # 生效 source /etc/profile
五、配置sparkspa
vi /usr/local/spark-1.3.1-bin-hadoop2.4/conf/spark-env.sh 在最后面增长: export JAVA_HOME=/usr/java/jdk1.7.0_76 export SPARK_MASTER_IP=192.168.1.21 export SPARK_WORKER_MEMORY=2g export HADOOP_CONF_DIR=/usr/local/hadoop-2.5.2/etc/hadoop
六、配置slave节点scala
vi /usr/local/spark-1.3.1-bin-hadoop2.4/conf/slaves master slaver1
七、复制配置文件到slave节点code
scp -r /usr/local/spark-1.3.1-bin-hadoop2.4/ root@slaver1:/usr/local/
八、启动集群htm
cd /usr/local/spark-1.3.1-bin-hadoop2.4/sbin ./start-all.sh
九、查看集群是否启动成功
jps # master 查看是否有:Master,Worker # slaver 查看是否有:Worker