APH420 Generalized Linear Models – Project 3
Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit
APH420 Generalized Linear Models – Project 3
Data: You are given a dataset UNLifeExpecatancy.csv and related data for human development reports, available at https://hdr.undp.org/en/data. Please refer to the data description file and Table 1 for the region description.
Table 1: Description of Region 1-8 in the data set UNLifeExpectancy.csv.
Region |
Description |
Region |
Description |
1 |
Arab states |
5 |
Southern Europe |
2 |
East Asian and the Pacific |
6 |
Sub-Saharan Africa |
3 |
Latin American and the Carribean |
7 |
Central and Eastern Europe |
4 |
South Africa |
8 |
High-income OECD |
Tasks: You are asked to explore how the national life expectancy changes and differs in regions and in potential health care systems, using at least a linear regression model and a generalized linear regression model. You should also include an explorative analysis to motivate the model building. Interpret your model and apply it if possible.
Instructions: Please attach/copy your R code at the end of your report. No Code No Grade! File type should be either WORD or PDF. Page limit: 20 pages. The final marks of the statistical reports consist of two parts: 60 marks for the scientific questions and 40 marks for the report structure. The quality of your analysis will be assessed according to the grading rubrics in Table 1, and your report should follow the basic criteria for an academic report mentioned in Section 1.
Late penalty: 5% of the total marks will be deducted for each day past the due date. Work submitted after 5 days (i.e., 120 hours past the due date) will normally receive a mark 0.
1 Structure of the Report (40 marks)
A complete, well-structured report convinces readers of the scientific findings of significance. Please write an informative and brief report based on your analysis of national life expectancy. The report’s structure and its criteria are stated as below.
1. (5%) Title, abstract and key words
The title should be concise and attractive, with informative abstract convincing the readers of the questions, methodology and findings, with the help of three to five key words. Please also include your name and student ID as the author in this cover page.
2. (10%) Introduction
For a general report, the introduction should be partitioned into three sections:
a. Orientation material (describe purpose).
b. Key aspects of the report.
c. A plan of the paper.
3. (20%) Data characteristics
a. Describe the data provided and the data used in the analysis. Document source of the data.
b. In the data characteristics section, identify the nature of data (longitudinal versus cross-sectional, observational versus experimental, etc).
c. Quality of scatter plots to indicate primary relationships in cross-sectional data or time series plots to indicate most important trends in longitudinal data.
d. Quality of statistics to indicate primary relationships or to emphasize trends.
e. Do the plots, and concomitant summary statistics, serve to identify unusual points that are worthy of special consideration?
f. Do the details presented in this section foreshadow the development of the model in the subsequent section?
4. (40%) Model selection and interpretation
a. An outline of the section to address the following questions: Is the purpose behind this model selection clear (understanding versus forecasting)? A statement of the recommended model. Is the model reasonable?
b. An interpretation of the model: parameter estimates and any broad implications of the model. Have the variables and coefficients of the model been adequately interpreted?
c. The basic justifications of the model. Has the model been justified (coefficient of determination(R2 ), s, t-ratios, residual plots, out-of-sample validation)?
d. An outline of a thought process that would lead up to this model.
e. A discussion of alternative models. Have alternative models been explored (transforms, deleting parts of data, analysis of components)?
5. (15%) Summary and concluding remarks
a. Rehash the results of the report in a concise fashion, in different words than the executive summary.
b. Comments on quality of data and reliability of concomitant inferential statements (what type of data would improve the reliability of your statements)?
c. Include ideas that you have about future investigations.
6. (5%) References
The reference list should include at least those recent studies on the similar topics, concerning its methods, data and models:
a. Is each reference helpful and relevant for the literature review for the discussions involved in the topic?
b. Is each reference be clearly written with all compulsory elements, e.g., authors, titles, journals, pages, years?
7. (5%) Appendix
In a statistical report or an academic paper, we may put the techniques or main theory in Appendix. In general, we may put all codes in the Appendix. To have a brief appendix, keep the following two points in mind:
a. Is there a good relationship between the discussion in the main report and the presentation in the Appendix?
b. Is each portion of the Appendix be clearly identified, especially with respect to its relation to the main body of the report?
2 Grading rubrics
Responses will be marked as follows.
Figure 1: Grading Rubrics.
2023-04-18