想要学习hadoop,首先要将把“刀”磨好,安装好hadoop,hadoop集群有三种模式:本地模式,伪集群模式和集群模式,其中本地模式和伪集群模式主要用于在本地测试程序,集群模式用于实际运行中。本地模式和伪集群模式非常类似,只是在hadoop的配置上有一点差别,这里先介绍hadoop的伪集群模式的安装步骤
1.运行环境安装
1)ubuntu linux 12.04,安装非常简单,这里就不介绍了
2)java 1.6以上版本安装4'重启机器,在shell中键入java -version得到java version "1.6.0_37"的话表示安装好java环境了export JAVA_HOME=/usr/lib/jdk1.6.0_37
export JRE_HOME=/usr/lib/jdk1.6.0_37/jre
export CLASSPATH=.:$JAVA_HOME/lib:$JRE_HOME/lib:$CLASSPATH
export PATH=$JAVA_HOME/bin:$JRE_HOME/bin:$PATH
su - hduser配置好后ssh localhost测试一下
ssh-keygen -t rsa -P ""
cat ~/.ssh/id_rsa.pub >> ~/.ssh/authorized_keys
sudo addgroup hadoop6)增加一个给hadoop使用的目录,运行下面命令
sudo adduser --ingroup hadoop hduser
sudo mkdir -p /usr/local/app/hadoop/tmp2.安装hadoop
sudo chown hduser:hadoop /usr/local/app/hadoop/tmp
sudo chmod 750 /usr/local/app/hadoop/tmp
cd /usr/local3)修改shell启动文件 ~/.bashrc 增加下面几行
sudo tar xzf /你放hadoop安装文件的目录替换这里/hadoop-1.0.4.tar.gz
sudo mv hadoop-1.0.4 hadoop
sudo chown -R hduser:hadoop hadoop
export HADOOP_HOME=/usr/local/hadoop
export JAVA_HOME=/usr/lib/jdk1.6.0_37
export PATH=$PATH:$HADOOP_HOME/bin
#增加几个使用较多的命令的简写
unalias fs &>/dev/null
alias fs="hadoop fs"
unalias hls &>/dev/null
alias hls="fs -ls"
unalias start-all &>/dev/null
alias start-all="/usr/local/hadoop/bin/start-all.sh"
unalias stop-all &>/dev/null
alias stop-all="/usr/local/hadoop/bin/stop-all.sh"
unalias start-dfs &>/dev/null
alias start-dfs="/usr/local/hadoop/bin/start-dfs.sh"
unalias stop-dfs &>/dev/null
alias stop-dfs="/usr/local/hadoop/bin/stop-dfs.sh"
unalias start-mapred &>/dev/null
alias start-mapred="/usr/local/hadoop/bin/start-mapred.sh"
unalias stop-mapred &>/dev/null
alias stop-mapred="/usr/local/hadoop/bin/stop-mapred.sh"
<?xml version="1.0"?>
<configuration>
<property>
<name>fs.default.name</name>
<value>hdfs://localhost:54310</value>
</property>
<property>
<name>fs.tmp.dir</name>
<value>/usr/local/app/hadoop/tmp</value>
</property>
</configuration>
<?xml version="1.0"?>
<configuration>
<property>
<name>dfs.replication</name>
<value>1</value>
</property>
</configuration>
<?xml version="1.0"?>
<configuration>
<property>
<name>mapred.job.tracker</name>
<value>localhost:54311</value>
</property>
</configuration>
hadoop namenode -format
start-all
hadoop fs -put /usr/local/hadoop/conf /input2'运行hadoop作业
hadoop jar /usr/local/hadoop/hadoop-examples-1.0.4.jar WordCount /input /output运行成功表示安装OK了
stop-all