APACHE SPARK作业代做、代写MLLIB/ML留学生作业、java程序设计作业代做、代写java语言作业代做Database|调试Matlab程序

ASSIGNMENT 2 – APACHE SPARKIntroductionIn this assignment, you will use MLLIB/ML, which are Apache Spark based machinelearning libraries on real world datasets.Before you start working on the assignment, you must have completed the in-classexercise (based on http://spark.apache.org/docs/latest/quick-start.html) and the MachineLearning Library (MLlib) at http://spark.apache.org/docs/latest/mllib-guide.htmlDatasets1. US fatal road accident data for automobiles, 1998 to 2010.2. Consumer ComplaintsDownload the datasets from: FACULTY COURSE RESOURCESBig Data and LargescaleComputingDataSetsforAssignment2M19. The datasets are easy tounderstand. Just study the header row for attribute information.Task 1 (50 points) – Write a SPARK program for classification Select any two classification learning algorithms available in Spark’s MachineLearning Library. Select a target attribute from each of the datasets provided and learn aclassification model to predict the target attribute. Use 70% training and 30% test splits of data. For both datasets, print the test error rates. A useful JAVA example for decision tree learning can be found here:https://github.com/apache/spark/blob/master/examples/src/main/java/org/apache/spark/examples/mllib/JavaDecisionTreeClassificationExample.java2Task 2 (50 points) – Write a SPARK program to cluster data Select K-means and Gaussian mixture clustering algorithms from Spark’sMachine Learning Library. Select appropriate attributes to cluster the data in each of the two datasets. Apply the clustering algorithms to the transformed datasets. For the Gaussian mixture clustering your program should output the parametersof the mixture model and for K-means the “Within Set Sum of Squared Errors”. A useful JAVA example for k-means can be found here:https://github.com/apache/spark/blob/master/examples/src/main/java/org/apache/spark/examples/mllib/JavaKMeansExample.javaSubmission requirements and grading Upload the source code for your program in a zipped file to Canvas. Demonstrateboth tasks to the TA during the Lab or consultation hours.Remember that all work must be your own.本团队核心人员组成主要包括BAT一线工程师,精通德英语!我们主要业务范围是代做编程大作业、课程设计等等。我们的方向领域:window编程 数值算法 AI人工智能 金融统计 计量分析 大数据 网络编程 WEB编程 通讯编程 游戏编程多媒体linux 外挂编程 程序API图像处理 嵌入式/单片机 数据库编程 控制台 进程与线程 网络安全 汇编语言 硬件编程 软件设计 工程标准规等。其中代写编程、代写程序、代写留学生程序作业语言或工具包括但不限于以下范围:C/C++/C#代写Java代写IT代写Python代写辅导编程作业Matlab代写Haskell代写Processing代写Linux环境搭建Rust代写Data Structure Assginment 数据结构代写MIPS代写Machine Learning 作业 代写Oracle/SQL/PostgreSQL/Pig 数据库代写/代做/辅导Web开发、网站开发、网站作业ASP.NET网站开发Finance Insurace Statistics统计、回归、迭代Prolog代写Computer Computational method代做因为专业,所以值得信赖。如有需要,请加QQ:99515681 或邮箱:[email protected] 微信:codehelp QQ:99515681 或邮箱:[email protected] 微信:codehelp

你可能感兴趣的:(APACHE SPARK作业代做、代写MLLIB/ML留学生作业、java程序设计作业代做、代写java语言作业代做Database|调试Matlab程序)