讲解:STAT 3675Q、STATISTICAL、R、RR|Python

STAT 3675Q - STATISTICAL COMPUTING UCONNFall 2019 Marcos Prates1. ObjectiveThe IMDB Movies Dataset (file imdb.csv) contains information about over 10,000 movies.The names of the first twelve columns are self-explanatory (the duration is in seconds). Therest of the variables (Action, Adult, Adventure, . . .) are dummy variables (0/1) indicatingif the movie has the given genre.In this project, you will apply a number of statistical methods that have been covered duringthe course using R.• Projects are to be completed individually, or with someone.• The project is worth 25% of the final grade.Directions. You are asked to write a preliminary report and a report. Please follow carefullythe following guidelines2. Preliminary report [30 points]• Provide a single file with the format name_3675_prelim.pdf (or name1_name2_3675_prelim.pdfif you work with someone), where name is your full name.• The preliminary report is due on Sunday, November 22, 2019 at 11:59 PM. Submit itvia HuskyCT. The pdf must be generated using Rmarkdown.Your preliminary report must contain the following elements.(a) A preliminary exploratory analysis including summary statistics and basic graphs (4代做STAT 3675Q、代写STATISTICAL、代做Rpages max)(b) Pose scientific questions that are interesting to you and indicate what statistical methodsmay help answer those questions (1 page)(c) Include the R code and all outputs.3. Report [70 points]For the report, provide a single file with the format name_3675_report.pdf (orname1_name2_3675_report.pdf), where name is your full name. The pdf must begenerated using Rmarkdown.• The report must be at least 10 pages long, without exceeding 30 pages (including thecode and the graphs).• The report is due on Sunday, December 8, 2019 at 11:59 PM. Submit it via HuskyCT.1(a) Include the preliminary report(b) Include at least one regression method(c) Include at least one ANOVA analysis(d) Include at least one classification methodFor each method,• Express all statistical models using mathematical formulae, and clearly state the meaningof the notations, and the assumptions.• Insert R code and necessary comments. Your output must contain the R code (do notuse the echo=FALSE option).• Interpret extensively all outputs and graphs that you include.4. Important dates• November 22, 2019: Preliminary report is due• Decebmer 8, 2019: Report is due2转自:http://www.3daixie.com/contents/11/3444.html

你可能感兴趣的:(讲解:STAT 3675Q、STATISTICAL、R、RR|Python)