(c) 2017 Justin Bois. This work is licensed under a Creative Commons Attribution License CC-BY 4.0. All code contained therein is licensed under an MIT license.
This homework was generated from an Jupyter notebook. You can download the notebook here.
In Homework 1, we plotted data of microtubule catastrophe times. In this problem, we return to the data from the Gardner, Zanic, et al. paper We will carefully analyze the data and make some conclusions about the processes underlying microtubule catastrophe.
In the file gardner_mt_catastrophe_only_tubulin.csv
(which you can download here), we have observed catastrophe times of microtubules with different concentrations of tubulin. To start with, we will consider the experiment run with a tubulin concentration of 12 µM. So, our data set $D$ consists of a set of measurements of the amount of time to catastrophe; $D=\{t_i\}$. We will consider three models for microtubule catastrophe. In the first model ($M_1$), our likelihood is Exponential,
\begin{align} f(D\mid \tau, M_1) = \prod_{i\in D} \frac{\mathrm{e}^{-t_i/\tau}}{\tau}. \end{align}
In the second model ($M_2$), the likelihood is Gamma,
\begin{align} f(D\mid \tau, a, M_2) = \prod_{i\in D} \frac{\left(t_i/\tau\right)^a}{t_i\Gamma(a)}\,\mathrm{e}^{-t_i/\tau}, \end{align}
where $\Gamma(a)$ is the gamma function. Finally, in the third model ($M_3$), the likelihood is Weibull,
\begin{align} f(D\mid \tau, \beta, M_3) = \prod_{i\in D} \frac{\beta}{\tau}\left(\frac{t_i}{\tau}\right)^{\beta-1}\mathrm{e}^{-(t_i/\tau)^\beta}. \end{align}
a) Describe the three models in words. Give physical descriptions of the meanings of the parameters $\tau$, $a$, and $\beta$. Tutorial 3b might be useful.
b) Use optimization to estimate the parameter values for the respective models.
c) Plot the ECDF of the time to microtubule catastrophe. Overlay the theoretical CDFs for the three models and comment on what you see.
d) Compute the approximate odds ratios among the models using the results from your parameter estimation calculation done by optimization. You should use the Laplace approximation and comment on the relative contributions of the goodness of fit and Occam factor.
e) Repeat steps (b) through (d) using MCMC. Use the WAIC for your model selection. Be sure to display relevant plots.
f) Using whichever model you found most probable when you computed odds ratios, compute $\tau$ along with $a$ or $\beta$ as appropriate for the other tubulin concentrations. Given that microtubules polymerize faster with higher tubulin concentrations, is there anything you can say about the occurrence of catastrophe by looking at the values of the parameters versus tubulin concentration?
g) (20 pts extra credit) The files gardner_mt_catastrophe_kip3.csv
(download) and gardner_mt_catastrophe_mcak.csv
(download), contain measurements of catastrophe times in the presence of the kinesins Kip3 and MCAK with 12 µM tubulin. Analyze these data and discuss conclusions about their respective roles in microtubule catastrophe. Note: This part of the problem is intentionally open-ended. You should think carefully, and perform a complete analysis to draw your conclusions.