1、安装JDK、Scalajava
2、安装zookeepernode
3、安装Hadoopoop
4、安装Sparkurl
一、修改spark/conf/spark-env.sh spa
export JAVA_HOME=/usr/java/jdk1.8.0_65
export SCALA_HOME=/usr/scala-2.11.8
export HADOOP_HOME=/usr/hadoop-2.7.2
export HADOOP_CONF_DIR=/usr/hadoop-2.7.2/etc/hadoop
export SPARK_MASTER_IP=node1
export SPARK_DAEMON_JAVA_OPTS="-Dspark.deploy.recoveryMode=ZOOKEEPER -Dspark.deploy.zookeeper.url=node1:2181,node2:2181,node3:2181 -Dspark.deploy.zookeeper.dir=/spark"
export SPARK_WORKER_MEMORY=1g
export SPARK_EXECUTOR_MEMORY=1g
export SPARK_DRIVER_MEMORY=1G
export SPARK_WORKER_CORES=2scala
二、修改 spark/conf/slaves server
node2
node3
node4three
三、修改 spark/conf/spark-defaults.confhadoop
spark.executor.extraJavaOptions -XX:+PrintGCDetails -Dkey=value -Dnumbers="one two three"
spark.eventLog.enabled true
spark.eventLog.dir hdfs://mycluster/historyServerforSpark
spark.yarn.historyServer.address node1:18080
spark.history.fs.logDirectory hdfs://mycluster/historyServerforSparkspark
四、须要到hdfs 系统上建立/historyServerforSpark目录
五、复制到各个机器上
六、启动spark集群和启动history-serve
./start-all.sh
./start-history-server.sh
PS:其余机器的master须要在其余机器运行./start-master.sh