Course structure
Week | Week beginning | Lecture | Self-study Notebooks | Exercise Notebooks | Workshop Notebooks | Assessment |
---|---|---|---|---|---|---|
1 | 21 Sept. | Python | Python 1 to 4 | Python 3 and 4 | ||
2 | 28 Sept. | Python 5 to 9 | Python 5 to 9 | Python Workshop 1 | ||
3 | 5 Oct. | Python 10 to 14 | Python 10 to 14 | Python Workshop 2 | Python 1 | |
4 | 12 Oct. | Python 15 to 18 | Python 15 to 18 | Python Workshop 3 | Python 2 | |
5 | 19 Oct. | Python 19 to 23 | Python 19 to 23 | Python Workshop 4 | Python 3 | |
6 | 26 Oct. | EDA | EDA 1 to 3 | EDA 1 to 3 | Python Workshop 5 | Python 4 |
7 | 2 Nov. | EDA 4 to 7 | EDA 4 to 7 | EDA Workshop 1 | EDA 1 | |
8 | 9 Nov. | EDA 8 to 12 | EDA 9, 10 and 12 | EDA Workshop 2 | EDA 2 | |
9 | 16 Nov. | EDA Workshop 3 | EDA 3 | |||
10 | 23 Nov. | Data Analysis |
EDA self-study and workshop Notebooks
1 - Exploring data with Python
Displaying data
Data analysis in Python
Body mass of Alaskan sockeye salmon
Reading a dataset from file
Examining the dataset
2 - Plotting data - one numerical variable
Plotting data
One numerical variable: histograms
Label your graphs
3 - One categorical variable
Death by tiger
Categories of the categorical variable
Counting categories: Frequency table
Relative frequencies
Rounding numbers in DataFrames
Plotting frequencies: bar plot
A principle of good table and graph design
Never use pie charts!
EDA Workshop 1: Datasets of one variable
4 - Two numerical variables
Guppy ornamentation
5 - Two categorical variables
Avian malaria and reproduction
Contingency table
Displaying two categorical variables in a bar plot
6 - A categorical and one numerical variable
Threespine sticklebacks
Seaborn
Stripplot
Bar plot (a commonly used but poor graph)
Boxplot
Combine strip and boxplots
How to interpret a boxplot
Multiple histogram method
7 - A categorical and two numerical variables
Kenyan finches
EDA Workshop 2: Datasets of two or more variables
8 - Describing data with summary statistics
Summary statistics of categorical variables
Summary statistics of numerical variables
9 - Describing location
Walking in circles
Mean or average
Median
Mode
What do the "mean" and "median" mean?
10 - Describing variability or spread
Range
Inter quartile range
Standard deviation
All summary statistics
11 - Normal distribution and standard deviation
What is the standard deviation?
The normal distribution
Heights of college students
The 68-95-99.7 rule
12 - Comparing statistics across categories
Threespine sticklebacks
Grouping data by category