hadoop2.2.0 eclipse 运行

具体步骤如下:

  1. 在本地或者远程安装部署hadoop2.2.0;
  2. 安装eclipse以及hadoop插件;
  3. 在eclipse上配置hadoop;
  4. 新建mapreduce工程,编写wordcount测试程序
  5. 在eclipse run-configure中配置hadoop参数
1安装部署hadoop2.2.0参考官方教程
2安装hadoop插件
    将hadoop-eclipse-plugin-2.2.0.jar 拷贝到eclipse下的plugins文件夹下:文件下载:http://download.csdn.net/detail/anbo724/7380295

3eclipse上配置hadoop

hadoop2.2.0 eclipse 运行_第1张图片

然后打开eclipse的resource界面,就可以看到eclipse获取的hdfs文件信息:

hadoop2.2.0 eclipse 运行_第2张图片

4编写mapreduce程序

新建mapreduce程序:

package com.test.mapr;


import java.io.IOException;
import java.util.StringTokenizer;


import org.apache.hadoop.conf.Configuration;
import org.apache.hadoop.fs.Path;
import org.apache.hadoop.io.IntWritable;
import org.apache.hadoop.io.LongWritable;
import org.apache.hadoop.io.Text;
import org.apache.hadoop.mapreduce.Job;
import org.apache.hadoop.mapreduce.Mapper;
import org.apache.hadoop.mapreduce.Reducer;
import org.apache.hadoop.mapreduce.lib.input.FileInputFormat;
import org.apache.hadoop.mapreduce.lib.input.TextInputFormat;
import org.apache.hadoop.mapreduce.lib.output.FileOutputFormat;
import org.apache.hadoop.mapreduce.lib.output.TextOutputFormat;


public class WordCount {
	
	public static class Map extends Mapper {
        private final static IntWritable one = new IntWritable(1);
        private Text word = new Text();
        public void map(LongWritable key, Text value, Context context) 
            throws IOException, InterruptedException{
                String line = value.toString();
                StringTokenizer tokenizer = new StringTokenizer(line);
                while (tokenizer.hasMoreTokens()) {
                    word.set(tokenizer.nextToken());
                    context.write(word, one);
                }
            }
    }
    public static class Reduce extends Reducer {


        @Override
            public void reduce(Text key, Iterable values, Context context)
            throws IOException, InterruptedException {
                // TODO Auto-generated method stub
                int sum = 0;
                for (IntWritable val : values)
                    sum += val.get();
                context.write(key, new IntWritable(sum));
            }
    }
    public static void main(String[] args) throws Exception {
        Configuration conf = new Configuration();


        Job job = new Job(conf, "mywordcount");
        job.setJarByClass(WordCount.class);


        job.setOutputKeyClass(Text.class);
        job.setOutputValueClass(IntWritable.class);


        job.setMapperClass(Map.class);
        job.setReducerClass(Reduce.class);


        job.setInputFormatClass(TextInputFormat.class);
        job.setOutputFormatClass(TextOutputFormat.class);


        FileInputFormat.addInputPath(job, new Path(args[0]));
        FileOutputFormat.setOutputPath(job, new Path(args[1]));


        job.waitForCompletion(true);


    }
	


}
5配置运行参数:

在eclipse的run-configure中配置:

hdfs://an:54310/user/an/input hdfs://an:54310/user/an/output

输入文件要自己建好,并且要放如要处理的文件,Output文件夹要求不存在;


eclipse源代码:

http://download.csdn.net/detail/anbo724/7380351

你可能感兴趣的:(云计算)