Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
COMM5000 Data Literacy Vinho Verde Final Report
Assessment Task |
Weighting |
Due Date* |
Course Learning Outcomes
|
Milestone 1: Case Study Preliminary Insight Development |
20% |
Week 4 |
1, 2 |
Milestone 2: Case Study project proposal: hypothesis tests |
20% |
Week 7 (Friday 5PM) – 25 October |
1, 2, 3, 4 |
Smarthinking/Studiosity/Feedback Hub feedback (formative task) |
0% |
Week 9 (Thursday 5PM) – 7 November |
|
Case Study business report |
60% |
Week 10 (Friday 5PM) – 15 November |
2, 3, 4, 5 |
|
|
|
|
* Due dates are set at Australian Eastern Standard/Daylight Time (AEST/AEDT). If you are located in a different time-zone, you can use the time and date converter. |
If you are instructed to submit your assessment via Turnitin, you will find the link to the Turnitin submission in your Moodle course site. You can submit your assessment well before the deadline and use the Similarity Report to improve your academic writing skills before submitting your final version.
The parameters for late submissions are outlined in the UNSW Assessment Implementation Procedure. For COMM5000, if you submit your assessments after the due date, you will incur penalties for late submission unless you have Special Consideration (see below). Late submission is 5% per day (including weekends), calculated from the marks allocated to that assessment (not your grade). Assessments will not be accepted more than 5 days late.
You are expected to manage your time to meet assessment due dates. If you do require an extension to your assessment, please make a request as early as possible before the due date via the special consideration portal on myUNSW (My Student profile > Special Consideration). You can find more information on Special Consideration and the application process below. Lecturers and tutors do not have the ability to grant extensions.
Smarthinking/Studiosity/Feedback Hub
The Feedback Hub at UNSW is a valuable resource for students looking to improve their writing skills. It offers personalized feedback on drafts of essays, reports, and literature reviews, focusing on structure, grammar, referencing, and language choice. The service is free for all UNSW students and is available 24/7, ensuring help is always accessible. Students can expect comprehensive feedback within 24 hours, making it a quick and efficient way to enhance their writing. For more details, visit the Feedback Hub website.
What are circumstances beyond my control?
These are exceptional circumstances or situations that may:
Available here is a list of circumstances that may be beyond your control. This is only a list of examples, and your exact circumstances may not be listed.
You can find more detail and the application form on the Special Consideration site, or in the UNSW Special
CASESTUDYINFORMATION—Vinho Verde
Business context: In recent years, the growing interest in wine has fuelled the expansion of the wine industry. As a result, companies are investing in new technologies to enhance both wine production and sales. Quality certification plays a vital role in these processes and currently relies heavily on wine tasting by human experts.
Case/Scenario: You consult a winery and help this company to predict or estimate human wine taste preferences at the certification step. Knowing the wine quality will allow the winery to be better positioned to predict available amounts and yearly sales. It will also support the oenologist wine tasting evaluations by potentially improving the quality and speed of their decisions, and improve wine production. Furthermore, similar techniques can help in target marketing by modelling consumer tastes from niche markets. In order to predict wine quality you will use a dataset consisting of 4898 white and 1599 red vinho verde samples from Portugal's northwest region, and the statistical methods covered in this course.UNSW Business School
Description of Final Report task
Hypothesis testing is very informative, but does not provide us with a means to control for confounding effects. For example, suppose you conclude that you reject a null hypothesis of equal population means of alcohol contentbetween wine types. In that case, you can’t conclude that this difference is solely due to alcohol content. Other factors may affect alcohol content even if you control for wine type. In other words, rejecting the null hypothesis here is driven by factors other than differences in wine type.
To provide a ceteris paribus analysis, students are asked to use a regression model to identify and estimate the wine type effect (if any) on wine quality, while controlling for other factors. Other factors are defined by the data included in the Vinho Verde dataset. No variables outside of the provided Vinho Verde dataset are to be considered.
Data Considerations
In your Final Report, your dependent variable (Y) will be wine quality, from your dataset. The independent variables (X) will be one of more other variables from the dataset, and must include a dummy for wine type, when appropriate for the model specification.
Statistical Analysis Required for the Final Report
Using the results from the three estimated regression models, discuss what the explanatory variables - or determinant factors - for wine quality are, if any. More specifically:
Ethics Considerations
Report Structure
The final report unifies the insights you have gathered in M1 and M2 and this final analysis. It showcases all the statistical techniques you have learnt in COMM5000. This report is a final take on this problem and should include the key results you wish the winery to get from the analysis. Your report will consist of the following core components. You can restructure the section as you see fit for your report.
- Executive summary This is the punch line of this case study. This should tell the winery what you have found in clear and precise language. This should not be technical. It should clearly address the relevance of the findings for the winery, the usefulness, or lack thereof, of the statistical findings.
- Introduction: The introduction should present the problem analysed in this case study. The background and the expected outcomes or target questions of the analysis. Your introduction should include some key conclusions from M1 or M2 and how these conclusions provide some basis for developing the model for wholesale company profit characteristics. This is not a copy/paste of all your reports for the first two milestones. It should report the take-home message(s) from the detailed analysis you have completed—no need to re- report the tables and graphs from those milestones.
- Section 1: Data and analytical considerations, any major limitations in and approach to the data or analysis performed. This will inform the reader about the kind of approach that you have taken.
- Section 2: Analysis of the determinants of wine quality, including your analysis of the wine type effect. Make sure you correctly identify the unit of measurement when you discuss coefficients. If not statistical relationships are detected, make sure you check the statistical significance of the overall model via a discussion of the F-test. This is where you would include Table 1 with the results of the three models. Remember to discuss not only statistical significance, but also the size of the coefficients.
- Section 3: Robustness analysis and model limitations. Any statistical analysis is based on assumptions that ensure that the inference you perform, and the estimation of the model parameters is also correct. SomeUNSW Business School points to discuss in your analysis are the key assumptions:
- Zero conditional means: does your model satisfies the no omitted variable bias condition? This will ensure that no confounding factors will bias the estimates of your key target parameters in the model. A brief explanation of what this assumption implies for the context of the model(s) you are estimating. Whether you believe it holds and why? What can be done to ensure it is satisfied if you had more data and more time to develop the model? If it isn’t satisfied, what are the implications on the model estimation and inference?
- Multicollinearity: With the selected set of regressors in your models, check whether the regressors satisfy the assumption of no perfect collinearity and/or no large (multi)collinearity. Explain what it means in your context and the implications of the regressors failing the condition on the validity of the inference.
General advice
Submission instructions