MCD2080 BUSINESS STATISTICS Trimester Three 2022 Sample Exam
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
BUSINESS STATISTICS - MCD2080
Trimester Three 2022 Sample Exam
Question 1 [2 + 2 + 2 + 4 + 1 + 1 + 3 + 1 + 1 + 5 = 22 marks]
a) Shown below is the table illustrating the number of employees from two different
depart5ments: Production and Engineering (P&E) ad Purchasing by gender and the
educational attainment (in years).
Use Exhibit 1 below to answer the following questions.
Using the two pivot tables in Exhibit 1,
(i) Calculate the proportion of the female and the proportion of the male employees from the P&E department. Show all workings to 4 decimal places.
(ii) Similarly, calculate the proportion of the female and male employees from the Purchasing
department. Show all workings to 4 decimal places.
(iii) From part (i) and (ii), compare the percentage of the employees from the two departments.
b) Next, we want to examine the variable of employee’s experience and the distribution of employees. Pivot tables (Exhibit 2) below show the employee experience from P&E and Purchasing departments.
Based on the above reported probabilities, discuss how the employees prior experience affects the distribution of the employees as their experience increases. Give examples.
c) The pivot table below (Exhibit 3) gives the employee’s prior experience by employees’ gender for all company’s departments.
From this Exhibit 3, answer the following questions.
(i) What does the value 5.39 present?
(ii) Explain the meaning of the value 9.80%.
d) Below is a chart of showing the distribution of employees’ salaries by their prior experience for all departments.
Exhibit 4.
Employees' salaries by prior experience
0-4 5-9 10- 14 15-20 Prior Experience (Years) 0-50 50- 100 100- 150 150-200 |
For the following questions, refer to above Exhibit 4.
(i) Does the prior experience really matter in terms of employee’s salary? Discuss giving the evidence in terms of values and the percentage estimates.
(ii) Which range of years of experience has the highest percentage of group of annual salaries?
State the salary group and the percentage as well.
(iii) Which is the most salary group with the most years of experience?
What percentage is that group?
e) The pivot table below (Exhibit 5), examines the distribution of employees’ salaries by their prior experience for all departments.
Here we want to find out if the employees’ prior years of experience is independent of their annual salaries.
Specifically, using Exhibit 5, establish if “Salaries=0-50” is independent of “Experience=0-4”
In support of your explanation, write down the required probability statements supporting.
Question 2 [3 + 2 + 2.5 + 4 + 5 + 3 + 4 = 23.5 marks]
a) Summary statistics and the distribution of the employees’ salaries for all departments are provided below (Exhibit 6).
Exhibit 6
Statistics |
Annual Salary ($'000) |
Mean |
71.27 |
Median |
68.40 |
Standard Deviation |
30.25 |
Minimum |
12.40 |
Maximum |
163.90 |
Count |
204.00 |
Range |
151.50 |
First Quartile |
51.55 |
Third Quartile |
86.98 |
Interquartile Range (IQR) |
35.43 |
Coefficient of Variation (CV) |
42.45% |
Exhibit 7
Total
0-20 20-40 40-60 60-80 80-100 100-120 120-140 140-160 160-180 |
(i) What type of chart given in the Exhibit 7
Give three essential items/elements that will improve this type of chart.
(ii) Using Exhibit 6 and 7, describe the shape of the “Annual Salaries” of the employees.
(iii) It is hard to state the mode in the continuous data. However, from Exhibit 7 what would be
the annual salaries modal class?
Obtain the percentage of this modal class. Show your calculations.
(iv) Using Exhibit 7 interpret in context the central tendency of the annual salaries.
(v) To describe the variation of data, we use Range, interquartile range (IQR) and coefficient of variation (CV).
From Exhibit 7, illustrate how we calculate Range, IQR and CV
Give interpretations for two of any of these three measures.
b) To use the Empirical Rule we assume the data is approximately normally distributed. If this rule is applied to the employee’s annual salaries, use Exhibit 7 to answer the following questions.
(i) Calculate the annual salary that 2.5% of employees who receive a salary equal to or lower than this value. Show your workings.
(ii) What is an approximate probability of a randomly selected employee who receive a salary
between $41,022 and $101,527?
Show your workings.
Question 3 [1 + 3 + 4 + 2.5 + 2 + 6 = 18.5 marks]
a) Using the data “US Income.xlsx” let us define the following variables:
Income (Per Year) = Individual annual income ($).
Female = 1 if an individual is a female and 0 if a male
Proportion = values of 1’s
Using excel we obtain the regression of variable the “Female” on the variable “Proportion” . The out is given below, Exhibit 8.
Exhibit 8
Use the above Exhibit 8 to answer the following question
(i) State and interpret the coefficient of the “proportion” variable.
(ii) Calculate the male proportion and compare it with the female proportion.
(iii) State and interpret the 95% confidence interval for the proportion variable in Exhibit 8 in context.
b) (i) As we are dealing with the sample proportion, explain the concept of “repeated sampling” and give a “mathematical” interpretation of the 95% confidence interval in Exhibit 7.
(ii) Explain what effect the sample size being decreased will have on the confidence interval in
Exhibit 1 if all other factors are assumed constant.
Explain your answer.
(iii) What would happen to the confidence interval in Exhibit 1 if we instead constructed a 90%
confidence level given that all other factors remain constant?
Explain your answer.
c) Next, we use Excel to obtain a regression of Income (Per Year) on Female. The output is given below in the Exhibit 9.
Exhibit 9
Use the above Exhibit 9 to answer parts i, ii & iii
(i) State and give an interpretation of the intercept coefficient.
(ii) State and give an interpretation of the slope coefficient.
(iii) Carry out a hypothesis test to establish if the difference between the male and female annual
incomes is significant at 1% level of significance
Use the p-value approach and show all the steps.
Question 4 [3 + 3.5 + 4.5 + 2.5 = 13.5 marks]
We want to explore the variables that might influence the amount the customer spends at HyTex on purchases. We define the following variables:
Y variable:
AmountSpent = Total amount spent on purchasesfrom HyTex ($).
X variables:
Male = 1 if a person is a male and 0 iffemale.
OwnHome |
= 1: if person owns home and 0 if renting. |
Married |
= 1 if person is married and 0 otherwise. |
Salary |
= Combined annual salary of person and spouse (if any). |
Children |
= Number of children living with person. |
Catalogs |
= Number of catalogs sent to person this year. |
a) (i) First, carry out a correlation matrix of variable of “AmountSpent” on the above variables” . Name that Exhibit 10.
Exhibit 10
Male OwnHome Married Salary Children Catalogs AmountSpent |
|||||||||
Male |
1 |
|
|
|
|||||
OwnHome |
0.084433 |
1 |
|
|
|||||
Married |
0.116057 |
0.264009 |
1 |
|
|||||
Salary |
0.261492 |
0.460736 |
0.675633 |
1 |
|||||
Children |
-0.10547 |
-0.03227 |
0.00977 |
0.049663 |
1 |
||||
Catalogs |
0.087351 |
0.093132 |
0.13706 |
0.183551 |
-0.11346 |
1 |
|||
AmountSpent |
0.201687 |
0.350811 |
0.475879 |
0.699598 |
-0.2223 |
0.472644 |
1 |
(ii) From Exhibit 10, which variable have the most influence and the least impacts on the
“AmountSpent”?
Sate and interpret one of these two values.
b) Excel is used to regress “AmountSpent” on Male, OwnHome, Married, Salary, Children, and Catalogues. The output is given below in the Exhibit 11.
Exhibit 11
SUMMARY OUTPUT |
||||||
Regression Statistics |
||||||
Multiple R R Square Adjusted R Square Standard Error Observations |
0.8119 0.6592 0.6571 562.7842 1000 |
|||||
ANOVA |
||||||
df |
2023-01-19