First, thanks FiveThirtyEight for making your data available--this is very cool to see, and interesting to be able to replicate results.
One request I have, which may or may not be tenable, is to make data available from the earlier stages of variable construction. For instance, in Nate's recent piece on airline safety, the data we have access to is the number of incidents, fatalities, etc from 1985 to 1999, and again from 2000-2014. While the data is interesting to see, the same data by year would be even more interesting, as would a list of all incidents and how they are coded.
For example, it's easy to imagine for example analysis that could be done by combining the by-year incidents with airline sales in subsequent (something Nate alludes) to. However, this isn't something we're able to do, given that we can only see the data in 15 year chunks. The earlier the data is available to us, the more flexibility we'll have in using your data to develop our own theories and test them, which makes it of greater use to us.
I can appreciate the reasons why this might not be a good idea for FiveThirtyEight (in particular, it means any assumptions and decisions made in cleaning are potentially open to criticism) but to the extent possible, making the earlier stages of your data publicly available would be very appreciated.