大数据平台搭建:Hadoop

To construct big data distributed platform based on Hadoop is a common method. Hadoop comes fron Google File System  and is its open source realization. Here list the references for studying big data technology, especially on Hadoop. apache

基于Hadoop进行大数据分布式平台搭建是经常使用的方法,源于谷歌的GFS,为其开源实现。此处总结了学习大数据技术相关参考资料,尤为是Hadoop环境搭建时的参考文献,供你们参考,在后续学习过程当中若发现更好的参考文献,会不断更新完善。分布式

参考资料:ide

1.《大数据技术原理与应用—概念、存储、处理、分析与应用》oop

(林子雨 编著,人民邮电出版社,2017年2月第2版)学习

2. Hadoop: The Definitive Guide, Tom White, 4th Edition, 2015.4.大数据

(http://vdisk.weibo.com/s/u5ntMYF7_5pe)ui

3. https://www.tutorialspoint.com/hadoop/index.htmspa

(Introduced the basic knowledge about the basic concept of big data,and mainly focus on the environment setup  of Hadoop in detail) orm

4. http://www.apache.org/htm

"The Apache Software Foundation is a cornerstone of the modern Open Source software ecosystem – supporting some of the most widely used and important software solutions powering today's Internet economy." – Mark Driver, Research Vice President, Gartner

 From Apache Project List you can find most thing about big data technologyfor example HadoopSparkMahout,  ZooKeeper,  Sqoop, Pig,  Hive,  Hbase , Flume and so on. You can download the file data what you want,  and then learn to install the software based on the guide. This is the basic requirement to study big data technology.  

5. https://en.wikipedia.org/wiki/Big_data

相关文章
相关标签/搜索