STAT7055 Introductory Statistics for Business and Finance


Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due


Research School of Finance, Actuarial Studies and Statistics

ASSIGNMENT

Semester 1, 2024

STAT7055 Introductory Statistics for Business and Finance

INSTRUCTIONS TO STUDENTS

Due Date

.  The assignment is due at 9:00am on Friday May 10.

. Late submission of the assignment is not permitted.  An assignment submitted without an extension after the due date will receive a mark of 0.

Writing your Assignment

. The assignment is an individual piece of assessment and must be completed on your own.

. You are not permitted to use any form of tutoring services (e.g., online, in-person, etc.) or any AI tools (e.g., ChatGPT, etc.).

. You will be required to write a report in an R Markdown document that contains R code (with R code comments), R output and written text. An example of an R Markdown document, which you can use as a template, has been provided on Wattle.

. All R code must have accompanying R code comments that sufficiently describe what the code is doing.

. When answering the assignment questions in your report, you will need to include all your R code and R output that you used to calculate any answers and you must also write your answers in proper sentences.  For example, if you are required to calculate a sample mean, then you would include your R code for calculating the sample mean and the R output of the sample mean value and you would also write a proper sentence in the report such as “The sample mean is equal to ...” .

.  Make sure to be clear and concise in your answers.

. A good way to approach writing your report is to imagine that you are a statistical consultant and that a client has asked you to do some statistical analyses. When presenting the results of your analyses to the client, you wouldn ’t just give them pages of R code, R output, calculations, etc.  Rather, you should give them a proper report which clearly outlines and explains the results of the analyses and which also includes the R code and R output used to produce the results.

. Therefore, presentation is very important. Marks will be deducted for poorly presented reports.

.  Once you have finished writing your report in your R Markdown document, you will need to render the document by pressing the Knit button in RStudio to create a HTML file of your report.

. Further to the above point, it is good practice to regularly Knit your R Markdown document as you write your report.   This  is useful for  checking that it ’s rendering properly.

Submitting your Assignment

.  Submission of the assignment will be through Wattle and further details regarding assignment submission will be provided on Wattle.

. For submission you will need to submit two files: the R Markdown file of your report (i.e., a “ .Rmd” file) and the rendered HTML file of your report produced by pressing the Knit button in RStudio (i.e., a “ .html” file).

. Please name your two files as “uNNNNNNN.Rmd” and “uNNNNNNN.html” , where uNNNNNNN is your student number.

. No other file types will be accepted or marked, e.g., “ .R” , “ .docx” , “ .RData” , “ .zip” , etc. In particular, do not submit any compressed files.

Other Important Details

. You may only use built-in functions available in the default installation of R and you are not permitted to use functions in any additional R packages (e.g., ggplot2).

. You must use the appropriate R functions (and not the statistical tables) to calculate any critical values or p-values used when performing any hypothesis tests.

. You must use R for all calculations.

.  Round all final numeric answers to 4 decimal places. However, as you will be using R, keep all decimals during all intermediate steps to ensure the accuracy of your final numeric answer.

.  Please use the help function if you want to learn more about a particular R function, e.g., enter help(mean) in the R console to learn more about the mean function.

.  For questions that require writing mathematical symbols, you are welcome to use short- hand notation, provided you make the meaning clear (e.g., using “Mu” for μ , or “ !=” for ≠).

. Answers (including hypotheses, explanations, conclusions, etc.) need to be written in the text of the R Markdown document and not in the R code comments or the R output.

. Do not print out entire data sets in your R Markdown document or HTML file, as this will only take up unnecessary space.

Question 1   [11 marks]

As part of a study on salaries in different industries, a random sample of people were selected and for each person, their salary in thousands of dollars (X1) and the industry in which they work (X2) were recorded. You have been tasked to perform some analyses on the data.  The data are stored in the file AssignmentData.RData in the data frame salaries.df.

(a)  [4 marks]  Based on what was covered in the course, create a single plot that would be most appropriate for describing the overall distribution of salaries for all people not in the secondary education industry. Make  sure to  give your plot a proper descriptive title and appropriate axis labels  (do not just use the default title or label). Provide a clear description of the overall distribution of salaries for all people not in the secondary education industry. Be specific in your description, making sure to mention any interesting and/or important aspects of the distribution.

(b)  [3 marks]  Calculate an 87% confidence interval for the population mean salary of people in the education industry (i.e., secondary or tertiary education). Give a clear interpretation of the confidence interval. Do not use any R functions that are designed to calculate confidence intervals or perform hypothesis tests.

(c)  [4 marks]  Test whether the population proportion of salaries in the finance indus- try that are less than 85.15 thousand dollars is equal to the population proportion of salaries in the technology industry that are greater 102.65 thousand dollars. Clearly state your hypotheses,  making sure to define any parameters,  and  use a signifi- cance level of α = 4%. Do not use any R functions that are designed to perform hypothesis tests.

Presentation   [4 marks]

Marks will be allocated for how well presented your report is, e.g., clear and distinct headings, concise answers with information clearly communicated, all R code sufficiently commented, etc.

发表评论

电子邮件地址不会被公开。 必填项已用*标注