Apache Hadoop -2.2.0 - How to Install a Three Nodes Cluster

http://tonylixu.blogspot.ca/2014/02/apache-hadoop-how-to-install-three.html

Centos 6.5 hadoop 2.2.0 全分布式安装

http://xjliao.me/2014/03/21/hadoop-2.2.0-cluster-setup.html

==============================

cluster: n0,n1,n2

n0:NameNode,ResourceManager ;

n1.n2:DataNode,NodeManager;

1. prerequiration

1.1 添加用户hm

#useradd hm

#passwd hm

1.2 jdk 1.6/1.7

Remove OpenJDK.
    yum -y remove  *jdk*
    yum -y remove  *java*

1.3 ssh 无密码登录

  1.所有机器: 使用hm用户登录
$cd /home/hm
$mkdir .ssh 2. 在namenode上生成密钥对
$ ssh-keygen -t dsa -P '' -f ~/.ssh/id_dsa
$ cat ~/.ssh/id_dsa.pub >> ~/.ssh/authorized_keys
2.1 .ssh目录要设成700 有执行权限
2.2 authorized_keys要设成600 否则会出错
2.3 还有ssh 登陆要加入用户名的 比如(需要密码)
$ssh n1
$ssh n2 3. 复制公钥(需要密码)
$cd .ssh
$scp authorized_keys n1:/home/hm/.ssh
$scp authorized_keys n2:/home/hm/.ssh
4.测试 (!!不需要密码)
ssh n1
ssh n2

2. hadoop 通用配置

2.1 hadoop-env.sh

2.2 slave 工作节点

3. hadoop四大组件配置

3.1 组件core-site.xml

<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration>
<property>
<name>fs.defaultFS</name>
<value>hdfs://n0:9000</value>
</property>
<property>
<name>io.file.buffer.size</name>
<value>131072</value>
</property>
<property>
<name>hadoop.tmp.dir</name>
<value>file:/home/hm/temp</value>
</property>
<property>
<name>hadoop.proxyuser.hm.hosts</name>
<value>*</value>
</property>
<property>
<name>hadoop.proxyuser.hm.groups</name>
<value>*</value>
</property>
</configuration>

3.2 组件 hdfs-site.xml

<?xml version="1.0" encoding="UTF-8"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration>
<property>
<name>dfs.namenode.secondary.http-address</name>
<value>n0:9001</value>
</property>
<property>
<name>dfs.namenode.name.dir</name>
<value>file:/home/hm/dfs/name</value>
</property>
<property>
<name>dfs.datanode.data.dir</name>
<value>file:/home/hm/dfs/data</value>
</property>
<property>
<name>dfs.replication</name>
<value>2</value>
</property>
<property>
<name>dfs.webhdfs.enabled</name>
<value>true</value>
</property>
</configuration>

3.3 组件yarn-site.xml

<?xml version="1.0"?>

<configuration>

<!-- Site specific YARN configuration properties -->
<property>
<name>yarn.nodemanager.aux-services</name>
<value>mapreduce_shuffle</value>
</property>
<property>
<name>yarn.nodemanager.aux-services.mapreduce.shuffle.class</name>
<value>org.apache.hadoop.mapred.ShuffleHandler</value>
</property>
<property>
<name>yarn.resourcemanager.address</name>
<value>n0:8032</value>
</property>
<property>
<name>yarn.resourcemanager.scheduler.address</name>
<value>n0:8030</value>
</property>
<property>
<name>yarn.resourcemanager.resource-tracker.address</name>
<value>n0:8031</value>
</property>
<property>
<name>yarn.resourcemanager.admin.address</name>
<value>n0:8033</value>
</property>
<property>
<name>yarn.resourcemanager.webapp.address</name>
<value>n0:8088</value>
</property>
</configuration>

3.4 组件mapred-site.xml

<?xml version="1.0"?>
<?xml-stylesheet type="text/xsl" href="configuration.xsl"?> <configuration>
<property>
<name>mapreduce.framework.name</name>
<value>yarn</value>
</property>
<property>
<name>mapreduce.jobhistory.address</name>
<value>n0:10020</value>
</property>
<property>
<name>mapreduce.jobhistory.webapp.address</name>
<value>n0:19888</value>
</property>
</configuration>

4. 启动和停止

4.1 启动

sbin/start-dfs.sh

sbin/start-yarn.sh

4.2 停止

sbin/stop-dfs.sh

sbin/stop-yarn.sh

5.测试

运行wordcount单词计数案例:

$ mkdir input
$ cat > input/file
This is word count example
using hadoop 2.2.0

将目录加入hadoop:

$ bin/hadoop hdfs -copyFromLocal input /input

在HADOOP_HOME运行wordcount案例::
$ bin/hadoop jar share/hadoop/mapreduce/hadoop-mapreduce-examples-2.3.0.jar wordcount /input /output
检查输出:
$ bin/hadoop dfs -cat /out/*

===================

最新文章

  1. haproxy利用ACL规则封禁自定义IP地址拒绝访问
  2. window 安装grunt
  3. 安装恢复Windows10各种坑
  4. 【MySQL】MySQL忘记root密码解决方案
  5. java中String,StringBuffer,StringBuilder之间的区别
  6. 用count(*)还是count(列名) || Mysql中的count()与sum()区别
  7. 保留你的dSYM文件
  8. HBase 事务和并发控制机制原理
  9. PNPOLY - Point Inclusion in Polygon W. Randolph Franklin
  10. 视频编码器评测系统:VideoCodecRank
  11. 【PAT】A1034Head of a Gang
  12. UVa LA 4094 WonderTeam 构造 难度: 1
  13. Python第八章(北理国家精品课 嵩天等)
  14. ubuntu下编译小知识点
  15. Servlet学习记录3
  16. maven项目引入依赖之后,jar包没有自动导入报错
  17. shell脚本实例-nginx日志分析
  18. 关于hibernate中的session与数据库连接关系以及getCurrentSession 与 openSession() 的区别
  19. Win10巧用自带输入法轻松打出特殊字符
  20. leetCode题解之旋转数字

热门文章

  1. oracle数据库的一个表中,怎么设置字段的默认值
  2. for和while循环的区别
  3. P2519 [HAOI2011]problem a
  4. ComponentOne 2017 V1 发布
  5. 4、iptables扩展匹配及网络防火墙功能
  6. Win10远程桌面可能是由于CredSSP加密Oracle修正
  7. Java单例设计模式(实现Java的一个类只有一个对象)
  8. 网页中Div的打印
  9. 基于 Python 和 Pandas 的数据分析(3) --- 输入/输出 基础
  10. _instance_reset