CentOS-6.7 编译 hadoop-2.6.5

(只需输入红色命令即可)

目录

1、阅读编译文档 .............................................................................................................................1

2、准备编译环境 .............................................................................................................................2

3、安装 gcc .....................................................................................................................................2

4、安装 gcc-c++ ..............................................................................................................................2

5、安装 make ..................................................................................................................................3

6、安装 maven(必须) ................................................................................................................. 3

7、安装 jdk(必须) ........................................................................................................................4

8、安装 ant(重要) ........................................................................................................................4

9、安装 findbugs(可选,最好装) ..................................................................................................5

10、安装 cmake(重要) .................................................................................................................5

11、安装 protobuf(重要) ..............................................................................................................5

12、安装 Snappy .............................................................................................................................6

13、编译 hadoop .............................................................................................................................6

1 、 阅读编译文档

(1)准备一个 hadoop 源码包,我选择的 hadoop 版本是:hadoop-2.6.5-src.tar.gz,在hadoop-2.6.5 的源码包的根目录下有一个文档叫做 BUILDING.txt,这其中说明了编译

hadoop 所需要的一些编译环境相关的东西。不同的 hadoop 版本的要求都不一样。对应的版本参照 BUILDING.txt

    请仔细阅读:

 

Requirements:
 
    * Unix System
    * JDK 1.6+
    * Maven 3.0 or later
    * Findbugs 1.3.9 (if running findbugs)
    * ProtocolBuffer 2.5.0
    * CMake 2.6 or newer (if compiling native code), must be 3.0 or newer on Mac
    * Zlib devel (if compiling native code)
    * openssl devel ( if compiling native hadoop-pipes )
    * Internet connection for first build (to fetch all Maven and Hadoop dependencies

)

  (2)对应以上需求,我们准备好所要求版本的这些软件

      1. 准备一台 Unix 类型操作系统,在这里我们选用的是 CentOS-6.7,初次编译要求必须

 联网,切记

 以下这些东西都是需要的,详细安装在下面,这里只介绍我准备这些软件所选择的版本

    

2. 安装 openssl-devel
      3. 安装 gcc
      4. 安装 gcc-c++
      5. JDK:jdk-7u80-linux-x64.tar.gz
      6. Maven:apache-maven-3.3.3-bin.tar.gz
      7. Ant:apache-ant-1.9.4-bin.tar.gz
      8. FindBugs:findbugs-3.0.0.tar.gz
      9. Cmake:cmake-2.8.12.2.tar.gz
      10. Protobuf:protobuf-2.5.0.tar.gz
      11. Snappy:snappy-1.1.1.tar.gz

2 、 准备编译环境

  安装以下软件:

yum -y install svn

  yum -y install autoconf automake libtool cmake

  yum -y install ncurses-devel

  yum -y install openssl-devel

3 、 安装 gcc

先使用命令检测一下看 gcc 是否已经安装过了

gcc -v

 

  gcc version 4.4.7 20120313 (Red Hat 4.4.7-16) (GCC)

  如果最后一行出现如上的 gcc 版本信息日志,表示已经安装成功过了。

  不然使用命令安装:

yum install gcc

4 、 安装 gcc-c++

直接使用命令安装:yum install gcc-c++

 

5 、 安装 make

make -version

  

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop

yum install -y make

6 、 安装 maven (必须)

  (1) 解压

tar -zxvf /root/soft/apache-maven-3.3.3-bin.tar.gz -C /root/apps/
  (2)修改配置文件(如果需要更改默认的 maven 仓库路径的话)
/root/apps/apache-maven-3.3.3/conf
    2. 修改配置文件 settings.xml

      

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop_02

    在配置文件的中部找到 localRepository 这个标签,它本来是注释了的,并且有一个默认仓库路径,我们最好自己设置一个,

所以我自己加了一个,我的路径是:<localRepository>/root/mavenrepo/</localRepository>
    3. 配置环境变量
export M2_HOME=/root/apps/apache-maven-3.3.3
      export PATH=$PATH:$M2_HOME/bin
    4. 检测是否成功
source /etc/profile
      mvn -version

      

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop_03

7 、 安装 jdk (必须)

  1. 解压

tar -zxvf /root/soft/jdk-7u80-linux-x64.tar.gz -C /root/apps/
  2. 配置环境变量
export JAVA_HOME=/root/apps/jdk1.7.0_80
    export PATH=$PATH:$JAVA_HOME/bin
    export CLASSPATH=.:/root/apps/jdk1.7.0_80/lib/dt.jar:/root/apps/jdk1.7.0_80/lib/tools.jar
  3. 检测是否安装
source /etc/profile
    java -version

    

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop_04

8 、 安装 ant (重要)

  1. 解压

tar -zxvf /root/soft/apache-ant-1.9.4-bin.tar.gz -C /root/apps/
  2. 配置环境变量
 export ANT_HOME=/root/apps/apache-ant-1.9.4
    export PATH=$PATH:$ANT_HOME/bin
  3. 检测是否安装成功
source /etc/profile
    ant -version

    

hadoop安装完成后可执行文件一般位于哪个目录中_hadoop_05

9 、 安装 findbugs (可选 ,最好装 )

1. 解压
tar -zxvf /root/soft/findbugs-3.0.0.tar.gz -C /root/apps/
  2. 配置环境变量
export FINDBUGS_HOME=/root/apps/findbugs-3.0.0
    export PATH=$PATH:$FINDBUGS_HOME/bin
  3. 检测是否安装成功
 findbugs -version

    

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop_06

10 、 安装 cmake (重要)

1. 解压
 tar -zxvf /root/soft/cmake-2.8.12.2.tar.gz -C /root/apps/
  2. 安装
    首先进入到根目录:
cd /root/apps/cmake-2.8.12.2/
    然后依次执行以下命令:
./bootstrap
    gmake & gmake install
  3. 检测是否安装成功
cmake -version

    

hadoop安装完成后可执行文件一般位于哪个目录中_hadoop_07

11 、 安装 protobuf (重要)

  1. 解压

tar -zxvf /root/soft/protobuf-2.5.0.tar.gz -C /root/apps/
  2. 安装
    首先进入到根目录:
cd /root/apps/protobuf-2.5.0/
    然后依次执行以下命令:
 ./configure --prefix=/root/apps/protobuf (表示安装到这个目录)
    make
    make check
    make install
  3. 配置环境变量
export PROTOBUF_HOME=/root/apps/protobuf
    export PATH=$PATH:$PROTOBUF_HOME/bin
  注意:PROTOBUF_HOME 就是我们在执行 configure 命令的时候指定的目录
  4. 检测是否安装成功
protoc --version

    

hadoop安装完成后可执行文件一般位于哪个目录中_Hadoop_08

12 、 安装 Snappy

  1. 使用 root 用户安装 Snappy:

tar -zxvf /root/soft/snappy-1.1.1.tar.gz -C /root/apps/
  2. 编译安装:
cd snappy-1.1.1/
    ./configure
    make
    make install
  3. 查看 snappy 库文件
ls -lh /usr/local/lib |grep snappy

    

hadoop安装完成后可执行文件一般位于哪个目录中_hadoop_09

13 、 编译 hadoop

1. 解压 hadoop 源码包
tar -zxvf /root/soft/hadoop-2.6.5-src.tar.gz -C /root/apps/
  2. 在编译之前防止 java.lang.OutOfMemoryError: Java heap space 堆栈问题,在 centos 系
export MAVEN_OPTS="-Xms256m -Xmx512m"
  3. 进入到源码包根目录下
cd /root/apps/hadoop-2.6.5-src
  4. 执行命令编译
    Create binary distribution with native code and with documentation:
mvn package -Pdist,native,docs -DskipTests –Dtar
  如果中途编译失败,并且不要文档的话,请使用这个命令:
mvn clean package -Pdist,native -DskipTests -Dtar -Dsnappy.lib=/usr/local/lib -Dbundle.snappy -Drequire.openssl
tar 和 dist包放到 hadoop-dist 目录下,
native 和 docs包到该.tar.gz 文件中,skipTests

 

  5. 静静等待编译……. 第一次编译预估一个小时左右。

 

  6. 编译成功了的话,最后的日志信息:
 

[INFO] Reactor Summary:
  [INFO]
  [INFO] Apache Hadoop Main ................................. SUCCESS [05:52 min]
  [INFO] Apache Hadoop Project POM .......................... SUCCESS [02:23 min]
  [INFO] Apache Hadoop Annotations .......................... SUCCESS [01:11 min]
  [INFO] Apache Hadoop Assemblies ........................... SUCCESS [ 0.267 s]
  [INFO] Apache Hadoop Project Dist POM ..................... SUCCESS [01:08 min]
  [INFO] Apache Hadoop Maven Plugins ........................ SUCCESS [01:13 min]
  [INFO] Apache Hadoop MiniKDC .............................. SUCCESS [06:36 min]
  [INFO] Apache Hadoop Auth ................................. SUCCESS [04:32 min]
  [INFO] Apache Hadoop Auth Examples ........................ SUCCESS [ 22.255 s]
  [INFO] Apache Hadoop Common ............................... SUCCESS [08:12 min]
  [INFO] Apache Hadoop NFS .................................. SUCCESS [ 8.060 s]
  [INFO] Apache Hadoop KMS .................................. SUCCESS [01:37 min]
  [INFO] Apache Hadoop Common Project ....................... SUCCESS [ 0.039 s]
  [INFO] Apache Hadoop HDFS ................................. SUCCESS [05:54 min]
  [INFO] Apache Hadoop HttpFS ............................... SUCCESS [01:24 min]
  [INFO] Apache Hadoop HDFS BookKeeper Journal .............. SUCCESS [01:11 min]
  [INFO] Apache Hadoop HDFS-NFS ............................. SUCCESS [ 4.370 s]
  [INFO] Apache Hadoop HDFS Project ......................... SUCCESS [ 0.048 s]
  [INFO] hadoop-yarn ........................................ SUCCESS [ 0.040 s]
  [INFO] hadoop-yarn-api .................................... SUCCESS [01:26 min]
  [INFO] hadoop-yarn-common ................................. SUCCESS [02:11 min]
  [INFO] hadoop-yarn-server ................................. SUCCESS [ 0.048 s]
  [INFO] hadoop-yarn-server-common .......................... SUCCESS [ 32.832 s]
  [INFO] hadoop-yarn-server-nodemanager ..................... SUCCESS [03:11 min]
  [INFO] hadoop-yarn-server-web-proxy ....................... SUCCESS [ 2.603 s]
  [INFO] hadoop-yarn-server-applicationhistoryservice ....... SUCCESS [ 6.006 s]
  [INFO] hadoop-yarn-server-resourcemanager ................. SUCCESS [ 19.331 s]
  [INFO] hadoop-yarn-server-tests ........................... SUCCESS [ 5.008 s]
  [INFO] hadoop-yarn-client ................................. SUCCESS [ 7.812 s]
  [INFO] hadoop-yarn-applications ........................... SUCCESS [ 0.049 s]
  [INFO] hadoop-yarn-applications-distributedshell .......... SUCCESS [ 2.211 s]
  [INFO] hadoop-yarn-applications-unmanaged-am-launcher ..... SUCCESS [ 1.773 s]
  [INFO] hadoop-yarn-site ................................... SUCCESS [ 0.033 s]
  [INFO] hadoop-yarn-registry ............................... SUCCESS [ 4.927 s]
  [INFO] hadoop-yarn-project ................................ SUCCESS [ 2.988 s]
  [INFO] hadoop-mapreduce-client ............................ SUCCESS [ 0.052 s]
  [INFO] hadoop-mapreduce-client-core ....................... SUCCESS [ 22.567 s]
  [INFO] hadoop-mapreduce-client-common ..................... SUCCESS [ 16.208 s]
  [INFO] hadoop-mapreduce-client-shuffle .................... SUCCESS [ 4.122 s]
  [INFO] hadoop-mapreduce-client-app ........................ SUCCESS [ 9.984 s]
  [INFO] hadoop-mapreduce-client-hs ......................... SUCCESS [ 7.873 s]
  [INFO] hadoop-mapreduce-client-jobclient .................. SUCCESS [ 31.351 s]
  [INFO] hadoop-mapreduce-client-hs-plugins ................. SUCCESS [ 1.769 s]
  [INFO] Apache Hadoop MapReduce Examples ................... SUCCESS [ 5.106 s]
  [INFO] hadoop-mapreduce ................................... SUCCESS [ 2.335 s]
  [INFO] Apache Hadoop MapReduce Streaming .................. SUCCESS [ 11.850 s]
  [INFO] Apache Hadoop Distributed Copy ..................... SUCCESS [ 33.553 s]
  [INFO] Apache Hadoop Archives ............................. SUCCESS [ 1.983 s]
  [INFO] Apache Hadoop Rumen ................................ SUCCESS [ 5.720 s]
  [INFO] Apache Hadoop Gridmix .............................. SUCCESS [ 4.326 s]
  [INFO] Apache Hadoop Data Join ............................ SUCCESS [ 2.572 s]
  [INFO] Apache Hadoop Ant Tasks ............................ SUCCESS [ 1.878 s]
  [INFO] Apache Hadoop Extras ............................... SUCCESS [ 2.936 s]
  [INFO] Apache Hadoop Pipes ................................ SUCCESS [ 6.255 s]
  [INFO] Apache Hadoop OpenStack support .................... SUCCESS [ 4.872 s]
  [INFO] Apache Hadoop Amazon Web Services support .......... SUCCESS [05:32 min]
  [INFO] Apache Hadoop Client ............................... SUCCESS [ 4.333 s]
  [INFO] Apache Hadoop Mini-Cluster ......................... SUCCESS [ 0.094 s]
  [INFO] Apache Hadoop Scheduler Load Simulator ............. SUCCESS [ 3.894 s]
  [INFO] Apache Hadoop Tools Dist ........................... SUCCESS [ 6.618 s]
  [INFO] Apache Hadoop Tools ................................ SUCCESS [ 0.047 s]
  [INFO] Apache Hadoop Distribution ......................... SUCCESS [ 13.077 s]
  [INFO] ------------------------------------------------------------------------
  [INFO] BUILD SUCCESS
  [INFO] ------------------------------------------------------------------------
  [INFO] Total time: 01:01 h
  [INFO] Finished at: 2016-12-31T01:43:51-08:00
  [INFO] Final Memory: 93M/395M
  [INFO] ------------------------------------------------------------------------

  7. 编译成功之后,hadoop-2.6.5.tar.gz 位于/root/apps/hadoop-2.6.5-src/hadoop-dist/target 目录下,这是编译后该文件夹的状态:

    

hadoop安装完成后可执行文件一般位于哪个目录中_Apache_10

 

至此,大功告成。恭喜。