% Intro for Survival Analysis (312f2023) % \documentclass[serif]{beamer} % Serif for Computer Modern math font. \documentclass[serif, handout]{beamer} % Handout mode to ignore pause statements \hypersetup{colorlinks,linkcolor=,urlcolor=red} \usefonttheme{serif} % Looks like Computer Modern for non-math text -- nice! \setbeamertemplate{navigation symbols}{} % Supress navigation symbols \usetheme{AnnArbor} % CambridgeUS Blue and yellow, Shows current section title % \usetheme{Berlin} % Blue: Displays section titles on top % \usetheme{Frankfurt} % Displays section titles on top: Fairly thin but still swallows some material at bottom of crowded slides \usepackage[english]{babel} % \definecolor{links}{HTML}{2A1B81} % \definecolor{links}{red} \setbeamertemplate{footline}[frame number] \mode % \mode{\setbeamercolor{background canvas}{bg=black!5}} \title{Survival Analysis\footnote{See last slide for copyright information.}} \subtitle{STA312 Fall 2023} \date{} % To suppress date \begin{document} \begin{frame} \titlepage \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Survival Time Data} \pause %\framesubtitle{} \begin{itemize} \item Data could be from cancer patients who have undergone some treatment. \item The outcome (dependent variable, response variable) will be length of time until death. \pause \item Does not have to be a bad event, and it does not have to be medical. \item Response variable could be time required for an unemployed person to find work. \item Length of time a prisoner actually spends in jail. \item Time required for a lightbulb to burn out. \pause \item Generally, ``time-to-event" data. \item But much of the terminology and notation assumes it's survival time -- time until death. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{What makes survival analysis special?} %\framesubtitle{} \begin{itemize} \item Time $T$ is a continuous random variable with $P(T>0)=1$. \item That is, it's positive-valued. \item More important is \emph{censoring}. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Censoring} \framesubtitle{Some of the times cannot be observed exactly.} \pause \begin{itemize} \item In a cancer study, it's impractical to wait for all the patients to die. \item Patient 172 was diagnosed 6 years ago, and she's still alive. \item The study's over. \item What do you record for Patient 172? A six? \pause \item That's no good. We know she lived more than 6 years. \item We just don't know how much more. \item Giving her a 6 would bias the results downward, toward shorter average survival times. \pause \item How about discarding her data altogether, because it's incomplete? \item That's even worse. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame}[fragile] \frametitle{What do censored data look like?}\pause %\framesubtitle{} \begin{itemize} \item Let $T^*$ represent time to failure. \item Let $U$ represent censoring time. \item We observe $T=\min(T^*,U)$. \item And an indicator for whether failure occurred: $\delta = 1$ if uncensored, and zero if censored. \pause \item All this is $i=1, \ldots,n$, so we observe $n$ pairs $(T_1,\delta_1), (T_2,\delta_2), \ldots, (T_n,\delta_n)$. \pause \item A simple data file might look like this: \end{itemize} \begin{verbatim} Patient Time Uncensored (delta) 1 5 1 2 6 0 3 8 1 4 3 1 5 22 1 \end{verbatim} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Types of censoring}\pause %\framesubtitle{} \begin{itemize} \item \textbf{Right censoring}: Event happened after some specified time. \item Example: Ex-con is murdered before being re-arrested. \pause \item \textbf{Left censoring}: Event happened before some specified time. \item Example: Person is known to have died before some date because that's when the obituary appeared. \pause \item \textbf{Interval censoring}: Event happened between time $A$ and time $B$. \item Example: Cancer came back between two monthly check-ups. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{More types of censoring}\pause %\framesubtitle{} \begin{itemize} \item \textbf{Type I censoring}: Censoring times are pre-specified. In a smoking cessation study, participants are followed until they relapse or 180 days, whichever comes first. \pause \item \textbf{Type II censoring}: Subjects are followed until a specified fraction fail. Mostly used in industrial settings. \pause \item \textbf{Random censoring}: Drop-out happens according to some probability distribution, usually unknown. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Random censoring} %\framesubtitle{} \begin{itemize} \item In random censoring, it is critical that the cause of drop-out be unrelated to the process being studied. \item For example, if a paient withdraws from the study (before death) because she is too sick to participate, the result can be biased estimation and inference. \pause \item A good design is common in medical studies. Patients are continuously enrolled in the study, and anyone still alive when the study ends is censored. \pause \item[] \item In this course, we will focus on \emph{right censored} data with \emph{random censoring}. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Roadmap} %\framesubtitle{} \begin{itemize} \item Single sample, parametric. \pause % \item Multi-sample, parametric. \item Single sample, non-parametric. \pause % \item Multi-sample, non-parametric. \item With explanatory variables, parametric. \item With explanatory variables, non-parametric. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Tools} %\framesubtitle{} \begin{itemize} \item Maximum likelihood. \item A little large sample theory. \item R. \end{itemize} \end{frame} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{Copyright Information} This slide show was prepared by \href{http://www.utstat.toronto.edu/~brunner}{Jerry Brunner}, Department of Statistics, University of Toronto. It is licensed under a \href{http://creativecommons.org/licenses/by-sa/3.0/deed.en_US} {Creative Commons Attribution - ShareAlike 3.0 Unported License}. Use any part of it as you like and share the result freely. The \LaTeX~source code is available from the course website: \href{http://www.utstat.toronto.edu/brunner/oldclass/312f23} {\footnotesize \texttt{http://www.utstat.toronto.edu/brunner/oldclass/312f23}} \end{frame} \end{document} %%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%%% \begin{frame} \frametitle{}\pause %\framesubtitle{} \begin{itemize} \item \pause \item \pause \item \end{itemize} \end{frame}