STT 277 Project

Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due

STT 277 Project

Purpose

The focus of the course is for the student to be able to:
• Perform exploratory data analysis
• Execute statistical analyses using SAS, R and JMP
• Communicate the statistical results from analysis

The purpose of this Project assignment is to allow the student to apply concepts, methods and techniques covered in this class to a problem and topic of interest to them.

This activity will give the students the opportunity to formulate a question, research information to see what is available regarding this question, gather data, apply analytic methods, summarize the analysis and possibly formulate a hypothesis that would lead to more data collection or more research.

The intent of this activity is not to find an answer to your question but to investigate and analyze available data that may help answer the question or lead to formulation of a hypothesis which would require more data collection.

Requirements

Students will work individually and will use SAS to perform the analyses. Since many students have taken STT 276 (the R course), programming SAS will enable students to learn another program. If this is problematic let the instructor know.

1. Decide on a question you would like answered regarding a topic of interest to you. Select a topic that you would still be interested in investigating even after you have completed this class. Choose a topic you are passionate about and have a high level of interest.

2. Use the library resources to conduct research on your topic.
3. Record a brief presentation (5 minutes) introducing your topic to the class.
4. Submit a written introductory report.
5. Obtain a data set through the Internet for you to explore. Develop an Analytical Plan on how you will be analyzing the data to help answer your question.
6. Perform an Exploratory Data Analysis. Create graphical displays and summary statistics to see distributions, trends, and comparisons between the variables in your data set.
7. Write code and perform the analysis.
8. You will record a 15-minute presentation on your project as a final deliverable.
9. Submit your data, analysis, code and presentation deck.10/02/2024
Your presentation will contain the following sections:
A. Introduction – This section will provide background and the question you are exploring.
B. Research
C. Methodology
D. Analysis – Provide analysis of the data you observed by providing graphical displays and summary statistics.
E. Summary / Recommendations / Limitations / References – Provide a summary of your findings.

NOTE

It is not required for you to answer the question you are looking into. Your analysis could lead to another question or to a hypothesis you would want to test. You can explain this in your summary.
Project Schedule

Your final presentation is due at 11:59 PM November 23rd. You need to provide portions of this assignment according to the following schedule.

Item
Section
Percent of grade
Start Date
Due Date
A.
Record and load a video explaining your question (no mor than 5 minutes.)
2
9/24
10/6
B.
Written Introduction
12
10/7
10/16
C.
Written Introduction and Analytical Plan
10
10/17
10/31
D.
Submit Exploratory Data Analysis
15
11/01
11/09
E.
Submit presentation recording
30
11/10
11/23
F.
Submit presentation deck, data, and data analysis
5
11/10
11/23
G.
Submit Code
20
11/10
11/23
H.
Submit individual student evaluation on assigned projects
5
11/24
12/4
H.
Grade from peer reviews
3


Your assignment will be rated based on discussion and explanation of your question, types of graphics, analysis and tables, coding, and the overall summary.

Library Resources
1. Librarians by Subject: https://www.library.rochester.edu/research/subject-librarians
2. Business Section: https://libguides.lib.rochester.edu/business
Industry & Market Research
Academic Articles
3. Open Data Sets By Topic - * Data and statistics - LibGuides at University of Rochester
4. If you are using datasets from Kaggle make sure your topic has not been analyzed already.

Deliverables

The following deliverables are meant to build the different pieces for the final presentation.

They are also structured so you work on the project throughout the semester in pieces and not have to do everything at the end of the semester.

1. Video recording

Video record using Panopto a five-minute session describing the question you will be investigating.

You can use slides to focus your delivery. Include in your presentation the following:

• Background on your topic
• The question you are investigating
• The population you will be making inferences on
• Research related to your topic
• Reasons as to why studying this topic is relevant

2. Written Introduction

The purpose of the written Introduction is to provide a solid background on the question you are investigating. You will take the information you have presented in the video presentation and enhance it into a written report.

In addition to the background, you will discuss your research pertaining to the question you are addressing. State the question you are looking to answer specifically and succinctly. Mention why this topic is relevant and needs to be addressed. Be specific with respect to which population you are studying and identify a timeframe / scope (if applicable).

Relying on your research specify your hypotheses in plain language. Be thorough in your discussion on your research. Explain how your research leads you in developing your hypothesis.

Specify which measure you will be your main response. Specify what data sources you will be using. And provide a list of references.

You will also submit a Group Grade Allocation. See the last item in this section (Group Grade Allocation).

The rubric for grading the Written Introduction is found in Appendix B.

3. Written Introduction and Analytical Plan

The Analytical Plan will mainly map out how you will be analyzing the data. In your Introduction you have stated the population of interest, a scope for your project and stated your hypothesis in plain language.

Your Analytical Plan does not need to be written in paragraph form. Include the following in your Analytical Plan:

• State the question / problem you are studying
• Convert the high-level hypothesis stated in the Introduction Report into a statistical set of hypotheses.
• Discuss the data source you will be using. Explain how the data was collected, by whom, what time period it covers and what was the purpose of the data collection (why was it collected). Also specify what sampling criteria was used. If this information is not available, try to find it by contacting the owner of the data.
• Mention how many observations and variables are in your dataset. Mention your response variable and any exploratory variables (if applicable).
• Provide a data dictionary – a table listing the variables in your dataset, the type of variable (numeric, character, date), a description of the variable, and possible values the variables may take.
• Mention you will execute an Exploratory Data Analysis.
• Identify which analytic technique you will use to answer your question specifying the significance level (if applicable) and how you will confirm validation of assumptions of the technique used (if applicable).

4. Exploratory Data Analysis (EDA)

Determine which variables in your dataset are relevant for your project. Perform an EDA on your data set as demonstrated in class. Include summary statistics for all your variables in your reduced data set, histograms and boxplots, scatter plots, and correlations.

The rubric for grading the EDA is found in Appendix C.
5. Final Presentation

Your main deliverable will be a Panopto video presentation (no more than 15 minutes). It will consist of a title slide with your name provided. Followed by the Introduction, provide background information on your project illustrating that the topic has been researched. The next section would consist of the Question that is being addressed and why it is relevant to study this. And hypotheses are stated in general terms.

The next section will include the Methodology mentioning the source of the data, how the data was collected, who collected it, what was collected, why was it collected, and the time period. The description of the variables in the dataset and the sample size are also specified (data dictionary).

Then the analysis will follow showing relevant and concise information with graphical displays and results from statistical tests. 10/02/2024

Then the Conclusions will follow connecting the question with the analysis and the results.

This section would also include any additional insights or recommendations of future work that can be carried out. (This item is optional). Include a limitation of your study and finish with references.

Again, make sure your voice is audible, that you are in a private setting where outsiders will not interfere with the audio, and your frame does not cover your slides. Review your presentation before submitting it.

The rubric for grading the Final Presentation is found in Appendix D.
6. Submit your presentation deck, data, analysis output, and programming code on Blackboard.

7. Review Classmates Presentations

You will be asked to view two or three other team’s presentation and grade them on specified criteria.

Appendix A – Written Introduction

Introduction
Points
Clear introduction
1
Provides bibliography
0.25
Researches topic
2
Connects research to own problem
0.75
States Question
1
States hypothesis in plain language
1
Population of interest specified and scope defined
1
Response identified
1
Data set specified
1
Why is this topic relevant?
1
Total

Appendix B – Analytical Plan Rubric

Analytic Plan
Points
Hypotheses stated in statistical terminology

Data source
0.5
Info about data: Why, who, when, what and how
2
Sampling criteria if appropriate
0.5
Sample size and number of variables
1
Data Dictionary
2
What is the response and exploratory variables
1
Graphical display and for which variables
1
Analysis technique
1
Total
10

Appendix C – Exploratory Data Analysis Rubric

Exploratory Data Analysis
Points
Summary / Overview
4
Data Dictionary
1
Summary Statistics
1
Frequency Tables
1
Histograms / Boxplots
3
Bivariate Plots
3
Correlation Table
2
Total
15

Appendix D – Final Presentation Rubric

Presentation Total Score
30


Title Slide with student name
1


Research
5
Clear background and research information as it sets up the question.

Sufficient research has been presented showing the topic has been investigated.



Question / Problem Definition

The question investigated is clearly stated.
2
The question investigated is relevant, interesting and a topic worthwhile to be investigated.
1
Hypotheses have been stated as to what the analysis may provide.
1


Analysis

The source of the data is provided (Info about data: Why, who, when, what and how)
2
Description of the variables in the data set is provided (Data Dictionary)
2
Number of observations is included
1
The appropriate techniques to answer the question were used.
2
The analysis will help answer the question at hand.
3


Conclusions

There is a connection between the question proposed and the conclusion.
6


Caveats / Limitations of your study
2


Presentation Delivery / Professionalism
1


References
1


发表评论

电子邮件地址不会被公开。 必填项已用*标注