Summer 2021 MIEF Quantitative Methods I (Basic Econometrics) Final Exam
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
Final Exam
Summer 2021 MIEF Quantitative Methods I (Basic Econometrics)
August 24-26, 2021
Instructions. This is an open book, open note exam. Please handwrite your answers on plain paper or lined notebook paper. Please refer to any books or notes on a computer or notebook that you wish. Please do watch your time. For any calculations you are asked to carry out, please use 3 to 4 significant figures (not more, not less). Please write formulas and show calculations where necessary; draw pictures of probability distributions as appropriate.
Please copy the following statement at the beginning of the first page of your answers which is in addition to the Honor Code quiz:
I will complete this assessment in accordance with the rules and spirit of the school’s Honor Code and Academic Integrity Policy as outlined in the Red Book. I will not receive or give assistance to another student completing this assessment, not will I consult any internet resources, or other unauthorized materials before, during, or after the test.
The questions below lead you through an analysis of the Stata computer output contained in two exam appendices. Each question is worth 5 points so please try to answer all questions. Please watch your time. You have 4.5 hours for the exam. Assuming you use 30 minutes to scan and upload your written answers, you will have 4 hours or 240 minutes or 12 minutes per question on average. Keep in mind that some questions will take longer than other questions to answer.
Questions 1 through 16 use the data and analysis from Appendix 1 which is based on a modified Wooldridge data set “airplane” that consists of data on a large number of airline routes. The data for 2017 and 2020 are used in this analysis. See page 1 for the first 40 observations for 2020 and page 13 for both years listed. Questions 1-14 use only the data for 2020. Questions 15-18 use the data for both time periods. The variables are:
fare average one-way fare ($)
year 2017 or 2020
Id route identifier (1 to 1147 – therefore 2298 observations)
dist distance in miles
passen average number of passengers per day
bmktshr fraction of market, biggest carrier (ie, big market share)
y20 =1 if year == 2020, otherwise = 0
Note: Other variables are computed in the Stata output
Questions 1-18 use Appendix 1
1. Regression #1 on Page 2 sets out to explain fare in terms of the distance (dist), number of passengers on that route per day (passen), and a measure of competition on a route (bmktshr). Write out the function estimated with standard errors and t-values in parentheses. Put in words the regression coefficient on dist and bmktshr (pay attention to the units of bmktshr). Put in words the R squared
2. Regression #1 continued. Stata produces a 95 percent confidence interval for the population coefficient on passen. Compute a 99 percent confidence interval. Explain why it is narrower or wider. Please show a clear picture of the probability distribution.
3. Regression #1 continued. What hypothesis is being tested with the F statistic displayed? Show the hypothesis test with a picture and critical F value (use α = .01).
4. Following Regression #1 are some predictions and other calculations and a listing of some of the values below and on page 3, including the r variable. Describe the r variable and what this tells you about certain airline routes.
5. Regression #2 on page 3 carries out a test. What is this test called, what is it testing, and what conclusions do you draw from this test?
6. Some new variables were computed on the top of page 4 and added to those in Regression #1 leading to Regression #3 on page 4. Write out the function estimated with the standard errors below each slope coefficient. (Leave space for an additional standard errors). Discuss the statistical significance of the two new variables and what that says about the functional form. Compare the adjusted R squared between Regression #1 and Regression #3. What role did the statistical significance plan in that change? Using Regression #3, if there is a route with 500 passengers per day, and the number of passengers increases by one, what is the predicted increase in the fare?
7. Consider a route with dist = 1000 and bmktshr =.5. Using Regression #3 on page 4, compute the predicted change in fare when dist increases by 1. Continuing using Regression #3 on page 4, compute the predicted change in fare when bmktshr increases by 1.
8. State in symbols and words the homoscedasticity assumption when Regression #3 was carried out. Regression 4 and 5 on pages 4 and 5 are run testing whether the homoscedasticity assumption was violated. State clearly the names of these two tests and the assumptions involved for each test. From Regression #4, can you tell which variable is causing the problem? Explain briefly.
9. Regression #6 on page 5 uses the same variables as Regression #3. Go back to Question 6 and add the standard errors from Regression #6 written appropriately below each standard error for each slope coefficient. Describe briefly why the standard errors from Regression #6 are more appropriate than the standard errors reported in Regression #1.
10. Discuss the assumptions made with Regression #7 on page 6. Do you think those assumptions are appropriate? Regression #8 on page 6 and Regression #9 on page 7 follow. Explain why Regression #9 is better than Regression #7 (if it is). Discuss briefly the variables that are used in Regression #7 and Regression #9.
11. What is the analysis of Regression #10 on page 8 and Regression #11 on page 8 called? Be specific about where else the slope coefficient in Regression #11 is found.
12. Some subtraction variables are generated prior to Regression #12 on page 9. They are used in Regression #12. Use the analysis of Regression #12 to answer the following question. From Regression #1 on page 2, what is the predicted fare for a new route, not in the original sample, with dist = 1000, passen = 500, and bmktshr = .5? Develop a 90 percent confidence interval for that prediction.
13. Regression#13 on page 9 uses a different functional form from Regression #1 on page 2. Put in words the slope coefficient on lndist. Regression #14 on page 10 uses the same independent variables as Regression #3 on page 4. Put in words the slope coefficient on dist. Show the calculations for the exact change. Following Regression #15 on page 10 are some calculations. Briefly explain those calculations and from the result identify which function fits the data better,
Regression #3 or Regression #14.
14. Some new variables are calculated and listed on page 11 and used in Regression #16 on page 12. Put in words the first value of zfare listed (-.846) on page 11. Put in words the coefficient on zpassen in Regression #16. Why is the constant essentially zero?
Data for 2017 are added to the analysis; several new variables are generated; and the data is listed on page 13
15. Using Regression #17, Regression #18, and Regression #19 on page 14, carry out the Chow test (SSR version) to see if the coefficients in the function estimated change between 2017 and 2020. Use a .01 level of significance. Show your formula, fill in the formula, show some calculations and your calculated F. Then show this F value on a picture of the F distribution with as statement of the hypothesis and conclusion.
16. Regression #20 on page 15 adds two additional variables to that of Regression #17 on page14. Explain clearly the meaning of the constant, the coefficient on bmktshr, and the coefficients on the two new variables.
17. Write out the model being estimated in Regression #21 on page 15. Following that regression is a test. State the null and alternative hypothesis for this test. What is your conclusion? Are the results the same as Question 15? Use the same level of significance as Question 15.
18. Page 16 carries out some rearrangement of the data to prepare to carry out Regression #22 on page 16. Discuss briefly what type of data is being constructed and how that data is used in Regression #16. What are the assumptions made in Regression #22. What happened to the variable “dist”? What other variable might have been appropriate to include in the analysis from Regression #1 through Regression #10 and how would that variable be dealt with in Regression #11? What assumption of OLS would be violated if there were another appropriate variable that was correlated with one of the independent variables in Regression #17?
Questions 19 and 20 use Appendix 2
19. Appendix 2 presents the value of Ecommerce sales as quarterly, not seasonally adjusted values in millions of U.S.$. Also listed is the percent of overall retail sales carried out by Ecommerce but that variable was not used. Put in words the coefficient on the constant, Time, Q1, Q2, and Q3 from the Regression on page 2.
20. Continuing with Appendix 2. On page 3 is a new variable generated that is included in the Regression on page 3. Put into words the coefficient on Time and Q3. Make sure that for Time and for Q3 you include two values for the impact on ECOM (Ecommerce).
. list Year id fare dist passen bmktshr in 1/40, clean
2022-08-22