Blog posts
In addition to the general Fragile Families documentation, the following blog posts provide more details about the data and the scientific goals of the project.
- Blog posts about outcomes
- Apply to participate
- Build a model
- Upload your contribution
- Evaluating submissions
- Overview of FF Challenge data files
- Contents of the Stata .dta file containing metadata
- Getting started with Stata
- Using .dta files in R
- Using .dta files in Python
- Helpful ideas
- Reading survey documentation
- Missing data
- How much should I trust the leaderboard?
- Prior research can inform your model
- Compare to the baseline and avoid overfitting
- Participant-generated resources
- Codebook support (.csv file, beta mode)
- Machine-readable codebook
- Constructed variables: Data dictionary
- A data pipeline for the Fragile Families Challenge
- Timeline of challenge
- Progress report from Cos 424 pilot at Princeton
- Progress prizes (May 10, 2017, 2pm Eastern time)
- Final submission deadline (August 1, 2017, 2pm Eastern time)
- Deadline to submit a manuscript to a special issue of Socius (October 1, 2017, 11:59pm Eastern time)
- Weekly office hours
- Scientific goals
- Discover unmeasured and important factors
- Prioritize issues for intervention: Causal inference
- Compare modeling approaches
Olajide Ajayi - April 18, 2017
“background.csv contains 4,242 rows (one per child) and 12,944 columns”. I saw 12,943 columns – missing one??
Ian Lundberg - April 19, 2017
You are correct – there are 12,943. Just updated the “Get the data” blog post. Thanks for catching this!
Ian Lundberg - April 19, 2017
(for those reading this afterward: that blog post is called “Overview of FF Challenge data files above”)