From Debra's email: Matt Rocklin suggested using some data sets in common through the book, so feel free to coordinate with others on the project. The Dask chapter will also be written using the data and projects described in some of the other chapters.
@mrocklin do you have an overview of data sets already in use? For the SciPy chapter we'd be happy to reuse something as well.