Search for question
Question

DATA 220 Final Project Guidelines In this assignment, you will go through the full process of data science that you have learned throughout the semester in this class. You need to write a 12-font, Times New Roman, double spaced six(or more)- page paper (without the R scripts) under the following headings; introduction, data collection and description, methods, results, and conclusion. Data: You can either research and find a dataset from online, or data for this final project. Basic structure of your research in this project: 1) You pick and form your research question. 2) Collect/find your dataset. 3) Clean your data. 4) Explore and learn your data. you can collect your own 5) Run Hypothesis test, T-test and/or Linear regression analysis based on your data 6) Make some conclusions and Explain. *** This paper is informal: No bibliography or citations, etc. (except a link to data set used) need to be included. 1 DATA 220 Paper structure: Topic/ Title: Introduction (40 points): Briefly (one paragraph) summarize what your paper is all about; the purpose, data used, what you analyzed and your finding (s). Also, indicate what question you want to answer (this is called research question). The research question can be a few but needs to be related to the same dataset. If applicable, indicate your hypotheses here. Data Collection and Descriptions (80 points): Briefly describe your data collection method and your dataset. 1) In this course, we spent a few weeks exploring data collection methods. Specify the method/process for your data collection. And explain the method. (If you found your dataset online and there is no description about it, try to make your assumption and explain.) Please pretend that your audience does not have any knowledge about data science when you explain things. 2) What is your dataset about and what does it contain? Describe kinds of variables; qualitative or quantitative, continuous vs. discrete, and scales (nominal, ordinal, ratio, interval), etc. Methodology (80 points): Based on your research question to achieve the purpose of the paper/research, describe what statistical procedure is used. Explain the steps you took (you do not need to repeat the data collection procedure since you already talked about it under data collection and description). You need to include what statistical test used (T-test or Linear regression analysis). Explain what it is. How it is appropriate to answer your research question. Again, pretend that your audience do not have any knowledge about data science when you explain things. *** Remember to include any output(s) under results section in this paper. Results (80 points): Report graphs and statistics computed here. For each of the graphs mention what they contain—a sentence will suffice. This is the main part of the paper. 1) The first part of this section should be about the descriptive statistics. Share the outcome and explain what they mean. 2 DATA 220 2) After the descriptive statistics, it's finally the time to talk about what you found in this project. Share your graphs and describe any trends/changes of the graphed data here. • Show the outcome and explain what they mean. • Please tell me about your research story here! Do not just simply show the graphs and outcomes. 3) Feel free to add more here. The things listed here are minimum requirements. Conclusion (40 points): This section is the climax of your paper. What did you learn from the results? Do you find any limitations in your project? Your conclusion should strictly be derived from your results, not what you think or anticipate. Interpret the results in the context to your research question. Appendix (40 points): NOT included in 6 pages requirement Add all the R scripts here. It does not need to be double-spaced here. These should not be screenshots. Rubric: Worth 400 points. • Introduction 40 points ● Data Description—80 points • Methodology—80 points Results 80 points ● Conclusion 40 points Appendix 40 points Mechanics - 40 points—grammar, number of pages (6th page needs to be fully written. A few sentences on the 6th page are not considered as meeting the requirement), double spaces, Times New Roman, and 12-point font size Do not wait until last minute!! Please let me know if you have any questions. 3