Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

BBA3047 Applied Mathematics and Statistics in Hospitality Business

Business Analytical Report

CASE STUDY: Project 2 - Arabian cities

Context:

Your consulting team has been asked to analyse data collected in July 2019 for samples of accommodations in three Arabian cities.

As a part of your research, you should analyse the following variables: the ACCOMMODATION'S DISTANCE TO CENTRE (in km) and the ACCOMMODATION PRICE (in CHF). Moreover, you are also asked to conduct a correlation and regression study between the ACCOMMODATION'S RATING SCORE and the ACCOMMODATION PRICE (in CHF).

Some important information about the data:

- The accommodations’ distances to centre, the accommodations’ rating scores and the accommodation prices were extracted from the Booking.com website.

- The accommodations’ rating scores are based on the average evaluation of accommodations’ quality and operations, rated by customers on a scale from 1 to 10,
with 1 indicating the worst and 10 the best result.

- All the accommodations are of at least 4-star standard.

All the data needed for these studies can be found in the Excel file:
P02 - ARABIAN CITIES.xlsx

You should analyse the data from JEDDAH with the corresponding set of hotels (from the Excel file provided).

Variable: ACCOMMODATION'S DISTANCE TO CENTRE (in km)

1. Summarise this variable using a grouped frequency distribution and present it on a histogram.

2. Calculate measures of central tendency (mean, median, mode) for this variable.

3. Calculate measures of spread (range, variance, standard deviation and coefficient of variation) for this variable.

Variable: ACCOMMODATION PRICE (in CHF)

1. Calculate measures of central tendency (mean, median, mode) for this variable.

2. Calculate measures of spread (range, variance, standard deviation and coefficient of variation) for this variable.

3. Calculate the five-number summary for the boxplot. Calculate the lower limit and the upper limit for the outliers. Identify if there are any outliers and, if yes, list them on the summary table with the results.

Variables: ACCOMMODATION'S RATING SCORE and ACCOMMODATION PRICE (in CHF)

1. In your analysis use variable ACCOMMODATION'S RATING SCORE as the independent variable, and ACCOMMODATION PRICE (in CHF) as the dependent variable.

2. Present the underlying data on the scatter diagram.

3. Calculate the correlation coefficient r, regression coefficients a and b, and the coefficient of determination R2. Add the regression line, the regression equation and the R2 value to the scatter diagram.

Important: All the results of calculations (presented with two decimals) should be summarized in ONE table (provided on the next page) and presented in a first section of the summary report: Data and Results Presentation, together with the histograms and the scatter diagrams for all three Arabian cities.  

For the details concerning the report structure, please refer to the project outline.

As only the content of your summary report will be graded, you must present all the results of calculations from the Excel file in the word document. Excel file MUST NOT be submitted and WILL NOT be graded.

Table to include in the summary report (in this way AND order exactly, all the results should fit to ONE PAGE in your final report):

 

JEDDAH

RIYADH

DOHA

Variable: ACCOMMODATION'S DISTANCE TO CENTRE (in km)

Mode(s)/Modal class

 

1-2.9; 3-4.9

3-4.9

Median

 

3.10

3.88

Mean

 

3.18

3.83

Shape of distribution

 

Right-skewed,
Bimodal

Left-skewed,
Unimodal

Range

 

7.00

7.00

Sample

Variance

 

3.42

2.97

Sample st. deviation

 

1.85

1.72

Coefficient of variation

 

58.27%

45.06%

Variable: ACCOMMODATION PRICE (in CHF)

Mode(s)/Modal class

 

140

No mode

Median

 

191.50

189.50

Mean

 

246.63

211.40

Shape of distribution

 

Right-skewed,
Unimodal

Right-skewed,
Uniform

Range

 

581.00

405.00

Sample

Variance

 

22760.24

7609.43

Sample st. deviation

 

150.86

87.23

Coefficient of variation

 

61.17%

41.26%

Five-number summary

Min for boxplot

 

81.00

96.00

Q1

 

140.00

151.00

Q2

 

191.50

189.50

Q3

 

316.50

238.75

Max for boxplot

 

560.00

359.00

Lower limit for outliers

 

-124.75

19.38

Upper limit for outliers

 

581.25

370.38

Outliers (values)

 

587.00 and 662.00

500.00 and 501.00

Correlation and Regression

r

 

0.51

0.35

a

 

-737.29

-635.13

b

 

122.72

98.49

R2

 

26.40%

11.96%

1. In the ANALYSIS AND DISCUSSION section, you should compare your individual results with the data provided in the case study (in the above table) and address the following issues:

ACCOMMODATION'S DISTANCE TO CENTRE (in km):

In which Arabian city is the ACCOMMODATION'S DISTANCE TO CENTRE (in km) typically the lowest and in which Arabian city is the highest, and how this analysis can be linked to the shape of the underlying distributions?

In which Arabian city is the variability of ACCOMMODATION'S DISTANCE TO CENTRE (in km) the largest and which measure of spread would be the most appropriate to analyse this variable?

ACCOMMODATION PRICE (in CHF):

In which Arabian city is the ACCOMMODATION PRICE (in CHF) typically the lowest and in which Arabian city is the highest?

Which measure of central tendency should be used for each Arabian city to represent their typical ACCOMMODATION PRICE (in CHF)?

In which Arabian city is the variability of the ACCOMMODATION PRICE (in CHF) the largest and which measure of spread would be the most appropriate to analyse this variable?

Present all boxplots (for all three Arabian cities) on one diagram and analyse this variable using boxplots: what similarities and differences can you notice?

ACCOMMODATION'S RATING SCORE and ACCOMMODATION PRICE (in CHF)

Compare the scatter diagrams and the values of correlation coefficient, r, and analyse in which Arabian city the ACCOMMODATION'S RATING SCORE has the greatest impact on the ACCOMMODATION PRICE (in CHF).

Analyse whether the intercepts from the regression equations are useful for the practical analysis of the underlying data.

How would you interpret the differences/similarities in the values of the slopes from the regression equations calculated for the underlying data?

What can influence the accuracy of the forecast conducted using the regression equations calculated for the underlying data? Comment on the other factors than the ACCOMMODATION'S RATING SCORE that may have impact on the variability in the ACCOMMODATION PRICE (in CHF).

Remember to always justify your analysis and answers.

2. In the RECOMMENDATIONS section propose practical recommendations arising from the data analysis.