Hadoop伪分布式集群搭建

参考 https://www.jianshu.com/p/1352ce8c8d73
凭回忆记录,可能会遗漏几个操作

========== 环境说明 ==========
centos7 + hadoop-2.7.7

网络设置仅主机+共享网络

IP:192.168.137.91
[root@vm91 ~]# ifconfig enp0s3
enp0s3: flags=4163  mtu 1500
        inet 192.168.137.91  netmask 255.255.255.0  broadcast 192.168.137.255

主机名:vm91
[root@vm91 ~]# hostnamectl 
   Static hostname: vm91
   
[root@vm91 ~]# vim /etc/hosts
127.0.0.1   localhost localhost.localdomain localhost4 localhost4.localdomain4
::1         localhost localhost.localdomain localhost6 localhost6.localdomain6
127.0.0.1   vm91

Java版本
[root@vm91 ~]# echo $JAVA_HOME 
/usr/local/jdk1.8.0_201

========== 安装步骤 ==========
1、解压缩

2、重命名

3、配置profiles
[root@vm91 ~]# echo $HADOOP_HOME 
/usr/local/hadoop
[root@vm91 ~]# echo $PATH 
/usr/local/hadoop/sbin:/usr/local/hadoop/bin:.....

4、修改配置文件
    a.修改hadoop-env.sh、mapred-env.sh、yarn-env.sh中的JAVA_HOME
	b.修改core-site.xml、hdfs-site.xml、yarn-site.xml中的配置项

    
        hadoop.tmp.dir
        /usr/local/hadoop/data/tmp
    
    
        fs.defaultFS
        hdfs://vm91:9000
    


    
        dfs.replication
        1
    
    
        dfs.namenode.name.dir
        file:///usr/local/hadoop/data/dfs/name
    
    
        dfs.datanode.data.dir
        file:///usr/local/hadoop/data/dfs/data
    
    
        dfs.permissions
        false
    


    
        yarn.resourcemanager.admin.address
        vm91:8033
    
    
        yarn.nodemanager.aux-services
        mapreduce_shuffle
    
    
        yarn.nodemanager.aux-services.mapreduce_shuffle.class
        org.apache.hadoop.mapred.ShuffleHandler
    
    
        yarn.resourcemanager.resource-tracker.address
        vm91:8025
    
    
        yarn.resourcemanager.scheduler.address
        vm91:8030
    
    
        yarn.resourcemanager.address
        vm91:8050
    
    
        yarn.resourcemanager.scheduler.address
        vm91:8030
    
    
        yarn.resourcemanager.webapp.address
        vm91:8088
    
    
        yarn.resourcemanager.webapp.https.address
        vm91:8090
    


5、配置免密登录
ssh-keygen -t rsa
ssh-copy-id vm91

6、启动集群
start-all.sh

7、测试
alias wordcount='hadoop jar /root/hadoop-mapreduce-examples-2.7.7.jar wordcount'
wordcount input/  ./output/00
http://vm91:50070/explorer.html#/output/00

你可能感兴趣的:(大数据)