From edfd8dfe8910992e4d097d0fb7bf88febcb773e8 Mon Sep 17 00:00:00 2001 From: Maneesha Sane Date: Mon, 6 Apr 2026 15:32:54 -0400 Subject: [PATCH 1/4] typo (clean up, not cleanup) --- episodes/00-before-we-start.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/episodes/00-before-we-start.md b/episodes/00-before-we-start.md index c499585c4..856df6c50 100644 --- a/episodes/00-before-we-start.md +++ b/episodes/00-before-we-start.md @@ -102,7 +102,7 @@ when you have multiple projects. In general, you may wish to create separate dir your scripts, data, and documents. - **`data/`**: Use this folder to store your raw data. For the sake of transparency and provenance, - you should always keep a copy of your **raw data**. If you need to cleanup data, do it + you should always keep a copy of your **raw data**. If you need to clean up data, do it programmatically (*i.e.* with scripts) and make sure to separate cleaned up data from the raw data. For example, you can store raw data in files `./data/raw/` and clean data in `./data/clean/`. From de77feb836806f6f345af2c5d4d0029e49e9a591 Mon Sep 17 00:00:00 2001 From: Maneesha Sane Date: Mon, 6 Apr 2026 15:33:26 -0400 Subject: [PATCH 2/4] typo (choose/chose) --- episodes/04-data-types-and-format.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/episodes/04-data-types-and-format.md b/episodes/04-data-types-and-format.md index 7a930e9f3..d30b2f769 100644 --- a/episodes/04-data-types-and-format.md +++ b/episodes/04-data-types-and-format.md @@ -348,14 +348,14 @@ df1['weight'].mean() 38.751976145601844 ``` -We can fill `NaN` values with any value that we chose. The code below fills all +We can fill `NaN` values with any value that we choose. The code below fills all `NaN` values with a mean for all weight values. ```python df1['weight'] = surveys_df['weight'].fillna(surveys_df['weight'].mean()) ``` -We could also chose to create a subset of our data, only keeping rows that do +We could also choose to create a subset of our data, only keeping rows that do not contain `NaN` values. The point is to make conscious decisions about how to manage missing data. This From c1dd66816b36b2ab31a4738a86baf931e04d523d Mon Sep 17 00:00:00 2001 From: Maneesha Sane Date: Mon, 6 Apr 2026 15:33:58 -0400 Subject: [PATCH 3/4] typo (missing word) --- episodes/06-loops-and-functions.md | 2 +- 1 file changed, 1 insertion(+), 1 deletion(-) diff --git a/episodes/06-loops-and-functions.md b/episodes/06-loops-and-functions.md index d4a7aaa4b..000a715c2 100644 --- a/episodes/06-loops-and-functions.md +++ b/episodes/06-loops-and-functions.md @@ -325,7 +325,7 @@ AND the file name itself. 2. Let's say you only want to look at data from a given multiple of years. How would you modify your loop in order to generate a data file for only every 5th year, starting from 1977? -3. Instead of splitting out the data by years, a colleague wants to do analyses each species separately. How would you write a unique CSV file for each species? +3. Instead of splitting out the data by years, a colleague wants to do analyses of each species separately. How would you write a unique CSV file for each species? ::::::::::::::::::::::: solution From d69b98cccba070d61db2bf1d1900e8b43163943e Mon Sep 17 00:00:00 2001 From: Maneesha Sane Date: Mon, 6 Apr 2026 15:34:23 -0400 Subject: [PATCH 4/4] typo (question mark) --- episodes/09-working-with-sql.md | 4 ++-- 1 file changed, 2 insertions(+), 2 deletions(-) diff --git a/episodes/09-working-with-sql.md b/episodes/09-working-with-sql.md index 05afb3db4..c43159259 100644 --- a/episodes/09-working-with-sql.md +++ b/episodes/09-working-with-sql.md @@ -23,7 +23,7 @@ exercises: 25 ## Python and SQL When you open a CSV in python, and assign it to a variable name, you are using -your computers memory to save that variable. Accessing data from a database like +your computer's memory to save that variable. Accessing data from a database like SQL is not only more efficient, but also it allows you to subset and import only the parts of the data that you need. @@ -225,7 +225,7 @@ con.close() results to their own tables in the portal database. 2. What are some of the reasons you might want to save the results of your queries back into the - database? What are some of the reasons you might avoid doing this. + database? What are some of the reasons you might avoid doing this? ::::::::::::::::::::::: solution