Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
Assignment 2
For SOC252H1, Fall 2024
Regression analysis using data from Canadian Election Study 2021
For this assignment, you will use Canadian Election Study 2021 data. The complete survey can be found here: http://www.ces-eec.ca/2021-canadian-election-study/
In this assignment, you have been provided with a selection of variables from this survey.
The variables are minimally cleaned but may require additional modifications before analysis. For instance, variables may be part of a skip pattern with many missing values, so use caution when selecting your variables.
The file is called ces.Rdata. There is a code book associated with the CES named ces_codebook.pdf. The survey also includes technical documentation consisting of the questionnaire and is available here.
The assignment is out of 100 points.
. Caution |
Your submission must follow the submission method policy. If it is not readable, i.e., it does not include the code, associated output, and interpretation in PDF/Word format, a 10% penalty will be imposed. |
Scenario
Imagine you are a data analyst at a company that advises new candidates.
Your client is a new candidate who wants to enter the Canadian electoral field. The client wanted to remain anonymous so your perception of them does not affect the answers you may give them.
Your client is interested in Canadians’ perceptions of political parties and the demographics and reasons that influence these perceptions. You have access to the Canadian Election Study 2021, and you will respond to the following questions for your client.
Question 1:
Your client wants to know about people’s demographic characteristics based on whether they identify with the Liberal party ornot (hint: the variable for federal party identification is cps21__fed__id). Using a summary table and two visualizations, describe the demographic characteristics (namely, gender, age, education, income, and province of residence).
Grading criteria: 30 points total
• At least 1 table
• At least two visualizations
• Max 150 words of explanation, emphasizing comparing Liberal party supporters versus not.
Question 2:
Your client is interested in how people’s views of the Liberal Party are shaped by their char- acteristics and opinions (hint: Perception about the Liberal Party is measured on a scale of 1-100 using cps21_party_rating_liberal).
Using five independent/predictor variables, describe using a linear regression how people who participated in this survey differ in their views of the Liberal party.
Grading criteria: 40 points total
-
In the five variables you choose, include at least three demographic variables and two variables related to opinion on political issues - see codebook for help.
- Provide a regression table.
- Interpret coefficients and 95% confidence intervals.
- Write a 150-word summary for your client about people’s views of the Liberal Party based on the regression findings.
Question 3:
In expanding on the previous question, you are also asked to provide the information to your client based on whether people identify with the Liberal or the Conservative party (hint: a variable representation of Liberal or Conservative can be created from cps21_fed_id). Using the same five variables from question 2 as independent variables, construct a logistic regression with Liberal or Conservative as the dependent variable and describe the findings.
Grading criteria: 40 points total
• Use the same five variables you chose in question 2.
• Provide a regression table.
• Interpret coefficients (log-odds and odds-ratios) and 95% confidence intervals.
• Write a 150-word summary for your client about people’s identification with the Liberal or the Conservative Party.