Running example - Templates for Reproducible Research Projects in Economics

template_project that will be installed with the templates is a simple empirical project. Its abstract might read:

This paper estimates the probability of smoking given age, marital status, and level of education. We use the stats4schools Smoking dataset and run a logistic regression. Results are presented in this paper; you may also want to consult the accompanying slides.

We can translate this into tasks our code needs to perform:

Clean the data
Estimate a logistic model
For each of the categorical variables, predict the smoking propensity over the lifetime
Create figures visualizing the results
Create tables with the results
Include the results in documents for dissemination (paper, presentation)

In these templates, we categorize these tasks into four groups:

Data Management: task 1
Analysis: tasks 2 & 3
Final: tasks 4 & 5
Documents: task 6

Naturally, different projects have different needs. E.g., for a simulation study, you might want to discard the data management part. Doing so is trivial by just deleting the respective directory (once you do not need the example any more). For most economics research projects, however, the basic structure has proven to strike a good balance between keeping related code in one place and dividing it up into chunks of manageable size.

The remainder of this section provides much more detail on why we made these choices.