关键词 > EC204

EC 204-A1, C1, E1 Empirical Project Information Fall 2025

发布时间:2025-12-13

Hello, dear friend, you can consult us at any time if you have any questions, add WeChat: daixieit

Empirical Project Information

EC 204-A1, C1, E1

Fall 2025

The empirical paper is a very important assessment for this class. Doing a good job on this paper can have long term benefits in terms of your job and/or graduate school prospects. This is a group project; no individual papers are allowed. Unless necessitated by class size or other circumstances, all groups must have 3 students. Since this is a big project, there are huge positive externalities from group work. Additionally, learning to work in groups is very valuable for your future professional career, be it industry or academics. According to Finley (2021), ability to work effectively in teams is listed as the most important skill among college graduates by employers.

Everyone in the group gets the same points, unless group member(s) bring to my attention about non-participation of one or more members for any of the assessments related to the project. In that case, the non-participating group member(s) will not be earning the same grade as the group on the paper assessments. In the extreme case, their grade may even be a 0 on the entire paper. If you are unable to contact your group mates or they aren’t doing their assigned work on time, please bring this to my attention ASAP, so that I can intervene early on.

The empirical paper requires you to use Current Population Survey (CPS) data. This data is housed in the Integrated Public Use Microdata Series (IPUMS) data repository. All projects MUST  use  CPS  data,  no  other  data  set  use  is  allowed  for  this  project.  There  are  some exceptions, for example, you may use macro level price data to convert nominal variables into real terms. The following are some steps that need to be completed before starting work on your paper.

1.   Have access to Stata either by buying it or from public computers on campus (check options for this semester).

2.   Create    a    login   to    access    IPUMS   –   CPS    by    visiting   the    following   website https://cps.ipums.org/cps/.

3.   Start familiarizing yourself with the  CPS data by reading DOCUMENTATION on the left on the above website.

4.   Practice  downloading  data. You will need some basic familiarity with Stata, a data extraction software, and your login. Details about downloading data from IPUMS-CPS has been  explained  in  a  video  uploaded  under  “Videos  related  to  the  class” folder on blackboard.

5.   Understand the variables in the downloaded data by reading the variable definitions.

6.   Practice cleaning data, creating variables, and running some regressions in Stata.

7.   Learn how to read and analyze regression outputs.

There are three required assessments for this project and the project in total is 30% of your final grade:

1.   Submission of your paper proposal on blackboard (3% of your total grade) by 11:59 pm on W, 10/08

2.   Presentations on Simple Regression Models (7% of your total grade) in-class on W, 11/12; F, 11/14; M, 11/17

3.   Final version of the paper (20% of your total grade) due on turnitin by 11:59 pm on W, 12/10

Details on each of these assessments are below:

1.   Paper Proposal Submission:

You  must  spend  time  as  a  group  examining  the  data  set  before  submitting  your proposal.  A  sample  proposal  form  has  been  uploaded  on  blackboard  under  the “Empirical  Paper”  folder  with  the  questions  below.  Each  group  will  submit  ONE proposal form with a few lines for each of the questions listed below. The document must not exceed one page with a font size of 12. The second page can be for citations.

i)          What is the relationship that you will be examining in your paper? You must be able to examine this relationship using CPS data. (0.5 point)

ii)         Why is the relationship important? Evidence and citation must be presented to support this importance. (0.5 point)

iii)        Time point(s) for your analyses and whether it is annual or monthly and why choosing those time points would be interesting. Note: both x and y need to be available in the same time points. (0.5 point)

iv)        Appropriate main x and y variables to capture the relationship, and more

importantly, how you would clean your x and y variables (what observations would you drop, would you need to create dummy variables etc.). (1 point)

v)          Mention the other variables from the data set that you will be including to

control for endogeneity and omitted variables bias in your relationship. (0.5 point)

The points assigned above are approximate.

Please don’t choose both your main x and y as dummy variables, one of them as a dummy variable is fine. Some popular and doable topics are based on health, fertility (nchild), unemployment, citizenship, voting behavior, racial or gender differences. I talk about gender and racial differences in wages extensively in class, so ideally, you should choose a different dependent variable, if you are passionate about focusing on these  demographic  differentials.  Unique  topics  are  always  welcome,  increase  the chances of publishing, and will earn you some bonus points, but ambition needs to be matched  with  attainability.  To  fulfill  the  requirements  of  the  class,  it  is  more important to have a polished paper by the end of the semester.

This proposal submission will help me give you feedback on your topic and let you know early on whether you are on the right track, and potentially suggest a revised topic based on your interests.

2.   Presentations on Simple Regression Models:

The aim of these presentations isto gauge whether you have correctly estimated your simple  regression  models  and  know  how  to  interpret  the  estimates  from  them. Presentations  are  short,   no  more  than   5-6   minutes  with  some  additional  time allocated for my feedback.

Your slides must contain the summary statistics table, summary statistics according to your  main x,  all  possible  functional  forms  and  how you  cleaned  your x  and y variables. However, you can focus on explaining coefficient, intercept, slope, elasticity and semi-elasticity from only one main regression model. Presentations must include a title for your project.

All group members must be participating equally, not necessarily presenting equally, but must be able to answer all questions that I direct at them.

At this point, copying and pasting the Stata tables on the slides would be enough, make sure the font size is big enough to be visible at the back (my eyes are not great). It is your group’s job to note down my comments.

You will be graded on the following. The points assigned are approximate.

i)           Clarity of your slides (1 point)

ii)         Presentation of tables – this includes steps before getting your tables like data cleaning, paying careful attention to sample size etc. (2 points)

iii)        Interpretation of coefficients (3 points)

iv)        Groupmates participating equally (1 point)

3.   Final version of the paper: The final paper should have the following sections in the order listed:

Title Page:

Must have:

i)          your name(s)

ii)         title

iii)        abstract (no more than 200 words): Abstract must have your research question, the relationship that you are examining, anything new that you did with respect to past research, and your findings. You must also include information on your data set and variables. When you explain your findings, choose the ones that are most interesting/statistically significant. Do not exclude findings from your interaction terms.

Introduction:

Must include the following:

i)           Research question, which is the relationship that you are examining in your paper

ii)         Why that question is relevant (motivation). The motivation needs to be backed by citations. In this section, citations can be from news, policy reports etc.

iii)        A brief description of the economic model(s) used in the analyses

iv)        Summary of the major findings ofthe analyses.

v)          Final paragraph should have the outline of the different sections in the paper (section II is lit review, section III is econometric model etc..)

Literature Review:

Must contain a summary of 3 similar studies conducted in the same area as your paper. When you summarize a paper, explain the innovation in the paper relative to past studies, data set, important variables and results that are relevant to your paper. You should also mention how your paper is similar and different from these studies that you are reviewing. Try to include regression-based papers, which will make it easier to compare similarities and differences with your paper. Explain in terms of the data set, variables, research question etc. If you cannot find papers that analyze the  exact  relationship  that  you  are  looking  at,  try  to  find  close  enough  papers. Literature review should be at most 2 pages.

Econometric Model:

Write the equation for the simple and multiple population regression models and state the parameter of interest (which is the β associated with the main x). Explain the econometric models that you are running – log linear, log log etc, interaction terms, hypothesis tests, specification tests (if we are able to do) that you are doing. Use Microsoft Equation Editor for writing equations.

Explain the economic theory behind the econometric model that you are estimating. How do the additional x variables help you control for omitted variables bias and endogeneity? Why have you chosen to create these interaction terms? Why are you running the hypothesis tests? If you can support your reasoning with citations, that is even better.

There  is  no  need  to  write  regression  equations  for  all  the  models  that  you  are estimating.  In  this  section,  you  should  be  describing  all  the  analyses  that  you’re conducting in your paper. Do not explain the results in this section, that is done in the “Description of Results” section.

Data and Descriptive Statistics:

Explain your data  (years  chosen, description of data set, sample size etc) and the description  of  the  dependent  and  independent  variables  used  in  your  analyses. Explain how you cleaned data from missing observations. Do not devote too many pages in describing the variables, since Table 1 in appendix will have the variable definitions. All tables are in the appendix section. Explain tables 2 and 3 highlighting the  interesting  trends/findings.  You  can  tie  this  to  past  studies/news/general observations. Write in paragraphs.

Estimation Results:

Explain the relevant and interesting results in your analyses (Tables 4 – 7). Say how the main estimate changes in different models. Refer Table#s and model#s in the tables when explaining. Pay careful attention to the units when you interpret. You must  link  your   results  to  past  studies,  wherever  possible.  Results  must  be  in paragraph form. You can choose to focus more on one result but try to mention all tables in your explanations.  In  other words,  don’t just present tables without mentioning them in the text of your paper.

Conclusions:

i)           Reiterate the relationship that you are examining

ii)         Briefly describe how your study is related to other studies

iii)        State your important result(s) again and the broad implications from your results

iv)        Drawbacks/limitations of your study

v)          Interesting extensions from your paper

References:

•   You should include a detailed reference section citing all your work. You can use any of the acceptable formats for academic work (but all references must follow the same format). Please see the following link for APA formatting:

APA Style Introduction // Purdue Writing Lab

•   The references that you cite at the end, should also appear in the main text of your paper. See ‘other considerations related to the paper’ for more information.

•    References must be in alphabetical order.

•   Cite CPS. The citation is as follows. It is also available in this link IPUMS CPS

Sarah Flood, Miriam King, Renae Rodgers, Steven Ruggles, J. Robert

Warren, Daniel Backman, Annie Chen, Grace Cooper, Stephanie Richards, Megan Schouweiler, and Michael Westberry. IPUMS CPS: Version 12.0

[dataset]. Minneapolis, MN: IPUMS, 2024.

https://doi.org/10.18128/D030.V12.0