data 220 final project guidelines in this assignment you will go throu
Search for question
Question
DATA 220
Final Project Guidelines
In this assignment, you will go through the full process of data science that you
have learned throughout the semester in this class. You need to write a 12-font,
Times New Roman, double spaced six(or more)- page paper (without the R
scripts) under the following headings; introduction, data collection and description,
methods, results, and conclusion.
Data:
You can either research and find a dataset from online, or
data for this final project.
Basic structure of your research in this project:
1) You pick and form your research question.
2) Collect/find your dataset.
3) Clean your data.
4) Explore and learn your data.
you can collect
your own
5) Run Hypothesis test, T-test and/or Linear regression analysis based on
your data
6) Make some conclusions and Explain.
*** This paper
is informal:
No bibliography or citations, etc. (except a link to data set used) need to be
included.
1 DATA 220
Paper structure:
Topic/ Title:
Introduction (40 points):
Briefly (one paragraph) summarize what your paper is all about; the purpose, data
used, what you analyzed and your finding (s). Also, indicate what question you want
to answer (this is called research question). The research question can be a few but
needs to be related to the same dataset. If applicable, indicate your hypotheses here.
Data Collection and Descriptions (80 points):
Briefly describe your data collection method and your dataset.
1) In this course, we spent a few weeks exploring data collection methods.
Specify the method/process for your data collection. And explain the method.
(If you found your dataset online and there is no description about it, try to
make your assumption and explain.) Please pretend that your audience does
not have any knowledge about data science when you explain things.
2) What is your dataset about and what does it contain? Describe kinds of
variables; qualitative or quantitative, continuous vs. discrete, and scales
(nominal, ordinal, ratio, interval), etc.
Methodology (80 points):
Based on your research question to achieve the purpose of the paper/research,
describe what statistical procedure is used. Explain the steps you took (you do
not need to repeat the data collection procedure since you already talked about it
under data collection and description).
You need to include what statistical test used (T-test or Linear regression
analysis). Explain what it is. How it is appropriate to answer your research
question. Again, pretend that your audience do not have any knowledge about data
science when you explain things.
*** Remember to include any output(s) under results section in this paper.
Results (80 points):
Report graphs and statistics computed here.
For each of the graphs mention what they contain—a sentence will suffice. This is
the main part of the paper.
1) The first part of this section should be about the descriptive statistics.
Share the outcome and explain what they mean.
2 DATA 220
2) After the descriptive statistics, it's finally the time to talk about what you
found in this project. Share your graphs and describe any trends/changes of
the graphed data here.
•
Show the outcome and explain what they mean.
•
Please tell me about your research story here! Do not just simply show
the graphs and outcomes.
3) Feel free to add more here. The things listed here are minimum
requirements.
Conclusion (40 points):
This section is the climax of your paper. What did you learn from the results? Do
you find any limitations in your project? Your conclusion should strictly be derived
from your results, not what you think or anticipate. Interpret the results in the
context to your research question.
Appendix (40 points): NOT included in 6 pages requirement
Add all the R scripts here. It does not need to be double-spaced here. These should not be
screenshots.
Rubric: Worth 400 points.
•
Introduction 40 points
● Data Description—80 points
• Methodology—80 points
Results 80 points
● Conclusion 40 points
Appendix 40 points
Mechanics - 40 points—grammar, number of pages (6th page needs to be fully
written. A few sentences on the 6th page are not considered as meeting the
requirement), double spaces, Times New Roman, and 12-point font size
Do not wait until last minute!!
Please let me know if you have any questions.
3