Hello, if you have any need, please feel free to consult us, this is my wechat: wx91due
MAST20005/MAST90058 Statistics
Assignment 1
Semester 2, 2024
Due time: 11PM, Wednesday August 21
Writing and submission:
• Write your answers on A4 paper. Page 1 should only have your student number, the subject code and the subject name. Write on one side of each sheet only. Each question should start on a new page. The question number must be written at the top of each page.
• Put the pages in question order and all the same way up. Use a scanning app to scan all pages to PDF - do not take photos with your phone camera ! Scan directly from above. Crop pages to A4.
• Submit your scanned assignment as a single PDF file. Gradescope will give you an opportunity to match questions with pages in your submission - please take the time to do this as it will make marking more efficient. Once submitted, carefully review the submission in Gradescope. Scan again and resubmit if necessary.
Exercise 1 (6 marks): Find a graph representing a data set from a reputable source. This can be from an academic journal, a government report, a reputable news website, or a professional organisation. Ensure the graph is complex enough to allow for a detailed analysis. You can use a graph that shows a good example or a graph that demonstrates a bad example.
1. Provide context in no more than 100 words: Briefly describe the graph you have chosen. Include the source, the main topic or subject of the graph, and its intended message or purpose.
2. Using the five principles of good graphic design that you learned in Module 1, evaluate the graph and write a detailed commentary addressing each principle (Total number of words 250 for all the questions below):
• Is the data shown clearly in the graph and why?
• Does the graphic use good alignment on common scales for quantities to be compared and why?
• Does the graph use simplicity in design while successfully conveying the message and why?
• Does the graph have good visual encoding and why?
• Does the graph use standard forms?
Exercise 2 (12 marks): Let X1, X2, . . . , Xm be a random sample from a Binomial distribution Binomial(n, √p).
1. Compute the maximum likelihood estimator for p. (Please note that √p) ≥ 0). Please show all your workings.
2. For the particular case when X1 = 1, X2 = 3, X3 = 3 are the realisations of a random sample from a Binomial(n = 5, √p), what is the maximum likelihood estimate for the parameter p? Please justify your answer.
Exercise 3 (12 marks): Given a random sample of size n from a distribution with density function given by
(in what follows we assume that θ ≥ 1). Hint:
1. Using the method of moments, find an estimator for θ. Please show all your workings.
2. We have now two samples drawn from a the distribution:
• Sample 1: X1 = 1.333, X2 = 1.577, X3 = 1.523, X4 = 1.645, X5 = 1.447, X6 = 1.684, X7 = 1.529, X8 = 1.556, X9 = 1.414.
• Sample 2: X1 = 1.333, X2 = 1.333, X3 = 1.523, X4 = 1.333, X5 = 1.333, X6 = 1.333, X7 = 1.333, X8 = 1.333, X9 = 1.333.
In addition to the variance, what summary statistics could we use to compare the variability between these samples? Compute the summary statistics for those two samples and (by hand), draw two plots displaying the variability of the summary statistics. Please show all your workings. What can you say about the variability between these two samples? Hint : Consider the following definition of the sample median and how you can use it to find the summary statistics in this question:
Exercise 4 (18 marks): An Australian Olympic athlete has been training with a tennis ball launcher whose speed follows a Normal distribution N(µ, σ2 = 25 km/h2 ). Two independent random samples are taken from the machine: X1, X2, . . . , X6 and Y1, Y2, . . . , Y6.
1. Are the estimators T1 = X, T2 = Y, and T3 = Yi unbiased estimators of µ? Please justify your answer and show all your workings.
2. Given the estimator T4 = aT1 + (1 − a)T2, compute its mean square error (MSE). Please show all your workings.
3. What is the sampling distribution of T4? Please justify your answer and show all your workings.
Exercise 5 (6 marks): Use R to generate a dataset of 1000 random numbers from a skewed distribu-tion. Create a histogram of the data and a QQ plot comparing the quantiles of your skewed dataset to the quantiles of a standard normal distribution. Submit the QQ plot and include the R code used to generate the plot. Explain the resulting curvature of the points in the QQ plot. What does this tell you about the relationship between the skewed distribution and the normal distribution?