Hadoop代写MapReduce Map Reduce代写php语言程序、R课程设计帮做

& AimThis assignment aims to assess your understanding of Map Reduce framework and programming a distributed program using this framework.DescriptionThis assignment consist of 2 parts: Theoritical part (asssessed by a Quiz) and Practical part.Part 1: QuizA closed book Mylo Quiz will be conducted during lecture 8. Its weightage will be 3%. The multiple type questions will be asked from Lecture slides 6 and 7.Part 2: Practical PartHere, you need to implement a Map Reduce code for Hadoop that analyses given weather data. This part of the assignment consists of two further sub-tasks: Basic level and Advance level.DataInput data will be several .csv files for different years. Each file contains several rows giving information about weather conditions at different weather stations on different days of the year. The data is from ftp://ftp.ncdc.noaa.gov/pub/data/ghcn/daily/by_year/ There are at least two measurements each day, one for the maximum temperature (TMAX) and one for the minimum temperature (TMIN), and sometimes one for the precipitation (PRCP). Each row contains following relevant information:The weather station idthe date in format yyyymmddtype of measurement (for this homework we care about the maximum temperature TMAX and TMIN)temperature in tens of degrees (e.g. -90 = -9.0 deg. C., -184 = -18.4 deg. C.)Outline of TasksBasic level: Finding AverageIn first task, your goal is to write a Map Reduce program that can find the average maximum temperature at each station in different years. The input to your program will be the csv files for different years provided to you. The ouptut should have rows with three fields: Stationid Year AverageTemp. For example a sample output file will look like:ITE00100554 1789, -63ITE00100554 1789 -90GM000010962 1789 4EZE00100082 1789 -103Advanced level: finding similarity between different stationsThe goal of this task is to implement a MapReduce program that can find similarity between different weather stations. Similarity between two stations is calculated based on the following:You can assume output from the previous task as input to this task. Output for this task will be in following format:weatherStationID1 weatherStationID2 SimilarityScore.Submissiona) Source Codes of 2 Tasksb) A report explaining map/reduce program. If any optimisation such as using combiner to reduce number of keys, is done to improve the performance, please also specify with that explaination. If you have taken inspiration from some MapReduce programs to complete these tasks, please give their reference.本团队核心人员组成主要包括BAT一线工程师,精通德英语!我们主要业务范围是代做编程大作业、课程设计等等。我们的方向领域:window编程 数值算法 AI人工智能 金融统计 计量分析 大数据 网络编程 WEB编程 通讯编程 游戏编程多媒体linux 外挂编程 程序API图像处理 嵌入式/单片机 数据库编程 控制台 进程与线程 网络安全 汇编语言 硬件编程 软件设计 工程标准规等。其中代写编程、代写程序、代写留学生程序作业语言或工具包括但不限于以下范围:C/C++/C#代写Java代写IT代写Python代写辅导编程作业Matlab代写Haskell代写Processing代写Linux环境搭建Rust代写Data Structure Assginment 数据结构代写MIPS代写Machine Learning 作业 代写Oracle/SQL/PostgreSQL/Pig 数据库代写/代做/辅导Web开发、网站开发、网站作业ASP.NET网站开发Finance Insurace Statistics统计、回归、迭代Prolog代写Computer Computational method代做因为专业,所以值得信赖。如有需要,请加QQ:99515681 或邮箱:[email protected] 微信:codehelp QQ:99515681 或邮箱:[email protected] 微信:codehelp

你可能感兴趣的:(Hadoop代写MapReduce Map Reduce代写php语言程序、R课程设计帮做)