代写COMP4331 Data Mining帮做Python编程作业

代写数据挖掘作业，实现K-Means，Fuzzy Clustering EM算法等等。Submission GuidelinesYou need to zip the following two files together:A3_itsc_stuid_report.pdf/.docx: Please put all your reports in this file. (Attachments should be original .pdf or .docx, NOT compressed)A3 itsc stuid code.zip: The zip file contains all your source codes for the first assignment.All attachments, including report and code, should be named in the format of: Ax_itsc_stuid.zip.Submissions not following the rules above are NOT accepted.20 marks will be deducted for every 24 hours after the deadline.Your grade will be based on the correctness, efficiency and clarity.Major TasksThis assignment consists of the following tasks:To acquire a better understanding of clustering methods.To learn to implement K-means for clustering.To learn to implement Fuzzy Clustering EM for clustering.To learn to use a DBSCAN model for clustering.Clustering MethodsYou are required to implement the three clustering methods and report the cost time. Please employ euclidean distance as the distance metric.K-means You are required to implement the clustering method K-means by yourself. You are not allowed to use any existing K-means package (But basic computation package is okay, eg., Numpy, Scipy). You should test your algorithm with different K values, K {2, 10, 20, 30}.Fuzzy Clustering EM You are required to implement the Fuzzy clustering method by using the EM algorithm (reference to slides 11ClusAdvanced.pdf). You are not allowed to use any existing EM package (But basic computation package is okay, eg., Numpy, Scipy). Please test your algorithm by setting K = 2.DBSCAN You are required to use the DBSCAN model for clustering. You may use the DBSCAN model implemented by scikit learn. Please test your model on the dataset by setting = 0.12 and M inP ts = 3.Each model is required to output a text file with clusters. Your programs should be written in such a way that the TA can run them easily to verify the results reported by you.Data SetYou are required to test your models on the given dataset. The dataset contains 500 2-dimensions points.Report WritingYou are expected to also report the time (using python time package) required by each method to complete the task, excluding the time needed for loading the data files. For K-means, please report cost time of different K settings. For Fuzzy clustering EM, please report sum of squared error (SSE) and center points in each iteration.Grading SchemeK-means:Build the K-means model. (15 points)Output 4 text files with clusters information based on differnt K. (10 points)Fuzzy Clustering EM:Build the Fuzzy Clustering EM model. (15 points)Print SSE and center points in each iteration. (5 points)Output the text file with clusters information. (10 points)DBSCAN:Build DBSCAN model. (5 points)Output the text file with clusters information. (10 points)Project Report (30 points)While you may discuss with your classmates on general ideas about the assignment, your submission should be based on your own independent effort. In case you seek help from any person or reference source, you should state it clearly in your submission. Failure to do so is considered plagiarism which will lead to appropriate disciplinary actions.本团队核心人员组成主要包括BAT一线工程师，精通德英语！我们主要业务范围是代做编程大作业、课程设计等等。我们的方向领域：window编程数值算法 AI人工智能金融统计计量分析大数据网络编程 WEB编程通讯编程游戏编程多媒体linux 外挂编程程序API图像处理嵌入式/单片机数据库编程控制台进程与线程网络安全汇编语言硬件编程软件设计工程标准规等。其中代写编程、代写程序、代写留学生程序作业语言或工具包括但不限于以下范围:C/C++/C#代写Java代写IT代写Python代写辅导编程作业Matlab代写Haskell代写Processing代写Linux环境搭建Rust代写Data Structure Assginment 数据结构代写MIPS代写Machine Learning 作业代写Oracle/SQL/PostgreSQL/Pig 数据库代写/代做/辅导Web开发、网站开发、网站作业ASP.NET网站开发Finance Insurace Statistics统计、回归、迭代Prolog代写Computer Computational method代做因为专业，所以值得信赖。如有需要，请加QQ：99515681 或邮箱：[email protected] 微信：codehelp QQ：99515681 或邮箱：[email protected] 微信：codehelp

代写COMP4331 Data Mining帮做Python编程作业

你可能感兴趣的:(代写COMP4331 Data Mining帮做Python编程作业)