Links to all tutorials, auxiliary lessons, and lecture notes are available on the schedule page, but they will be listed here for convenient reference.
Tutorials marked with an asterisk are available to Caltech personnel only. This is because they contain unpublished information that the researchers who generously supplied the data would prefer be shared in their own publication. Apologies to non-Caltech people who may otherwise find these tutorials useful.
Similarly, many of the data sets are also only available to Caltech personnel. This is because many of them are unpublished and/or being used for ongoing research.
Tutorials
- Tutorial 0a
- [Before first class] Setting up a Python environment for scientific computing
- Tutorial 0b
- [Before first homework] Introduction to Jupyter notebooks
- Tutorial 0c
- [Before first homework] Introduction to LaTeX
- Tutorial 0d
- [Before first homework] A sample homework problem and solution
- Tutorial 1a
- [09/28] Introduction to Python
- Tutorial 1b
- [09/28] Exploratory data analysis (data set)
- Tutorial 2a
- [10/03] Managing data sets
- Tutorial 2b
- [10/03] Defining parameters and estimating them (data set)
- T2 exercise
- [10/03] solutions (data set)
- Tutorial 3a
- [10/10] Parameter estimation by optimization (data set)
- Tutorial 3b
- [10/10] Probability distributions and their stories
- T3 exercise
- [10/10] solutions
- Tutorial 4a
- [10/17] Mamimum likelihood estimation (data set)
- Tutorial 4b
- [10/17] Parameter estimation with Markov chain Monte Carlo
- T4 exercise
- [10/17] solutions
- Tutorial 5a
- [10/24] Credible regions
- Tutorial 5b
- [10/24] Outliers (data set)
- T5 exercise
- [10/24]
- Tutorial 6a
- [10/31] Model selection
- Tutorial 6b
- [10/31] Parallel tempering Markov chain Monte Carlo (data sets 1, 2)
- T6 exercise
- [10/31]
- Tutorial 7a
- [11/07] Hacker stats and frequentist methods (data set)
- Tutorial 7b
- [11/07] Dancing statistics (data set)
- T7 exercise
- [11/07]
- Tutorial 8a
- [11/14] Time series and data smoothing (data set)
- Tutorial 8b*
- [11/14] Extracting data from images (data set)
- T8 exercise
- [11/14]
- Tutorial 9a*
- [11/21] Basic filtering and thresholding (data set)
- Tutorial 9b*
- [11/21] Segmentation
- T9 exercise
- [11/21]
- Tutorial 10a
- [11/28] Colocalization (data set)
- Tutorial 10b
- [11/28] Data validation (files for in-class instruction)
- T10 exercise
- [11/28]
Lecture notes
- Lec 1
- (W 09/28) Probability, Bayes's theorem and the logic of science
- Lec 2
- (W 10/05) An example of Bayesian parameter estimation: The mean and variance from repeated measurements
- Lec 3
- (W 10/12) Markov chain Monte Carlo
- Lec 4
- (W 10/19) Model selection
- Lec 5
- (W 10/26) Parallel tempering Markov chain Monte Carlo
- Lec 6
- (W 11/02) Frequentist methods
- Lec 7
- (W 11/09) Intro to images
- Lec 8
- (W 11/16) Hierarchical models
Auxiliary lessons
- Aux 1
- (Th 09/29) Review of concepts in probability (HK)
- Aux 2
- (Th 10/06) Introduction to Bokeh for interactive plotting (JB) [data sets: 1, 2, 3, 4, 5]
- Aux 3
- (Th 10/13) Discussion on priors (JB) [data set]
- Aux 4
- (Th 10/20) Principle component analysis (HK) [data set]
- Aux 5
- (Th 10/27) Kernel density estimation (GC) [data sets 1, 2, 3]
- Aux 6
- (Th 11/03) K-means clustering (PQ) [data set 1, data set 2]
- Aux 7
- (Th 11/10) Support vector machines (PQ/JA)
- Aux 8
- (Th 11/17) t-distributed stochastic neighbor embedding (HK) [data set 1, data set 2]
- Aux 9
- (Th 12/08) Watershed algorithm for segmentation (GC) [data set]
Homeworks
- HW 1
- due 1pm, Oct. 2 (data set) [solutions]
- HW 2
- due 1pm, Oct. 9 (data set 1, data set 2) [solutions]
- HW 3
- due 1pm, Oct. 16 (data set 1, data set 2, data set 3, data set 4) [solutions]
- HW 4
- due 1pm, Oct. 23 [solutions]
- HW 5
- due 1pm, Nov. 6 (data set 1, data set 2, data set 3) [solutions]
- HW 6
- due 1pm, Nov. 6 (data set 1, data set 2, data set 3) [solutions]
- HW 7
- due 1pm, Nov. 13 (data set 1, data set 2, data set 3) [solutions]
- HW 8
- due 10am, Nov. 23 (data set 1, data set 2) [solutions]
- HW 9
- due 1pm, Dec. 7 (data set 1, data set 2) [solutions]
- HW 10
- due 1pm, Dec. 9