The Department of Biostatistics offers courses for three degree programs within the department (PhD, MS, and MPH) as well as courses for students from other departments and program. Students in the Department of Biostatistics also take courses offered by the Department of Statistics and the College of Public Health and Health Professions.
MS and PhD Courses
All courses in the MS and PhD program require three semesters of calculus and one semester of linear algebra.
GMS 6827 – Advanced Clinical Trials (3)
This course covers the statistical principles and methods used in the design and analysis of clinical trials. Topics include group sequential designs, adaptive clinical trials, and Statistical Monitoring of Clinical Trials. Syllabus
PHC 4094 – Introduction to Biostatistics for Health Science and Public Health (3)
This 3-credit course provides an introduction to some concepts and methods of biostatistical data analysis that are widely used in health sciences and public health. The topics include analysis of variance to compare three or more population means, correlation, simple linear regression, multiple linear regression, nonparametric and distribution-free statistical methods, and some basic concepts about survival analysis. Public health examples are used for demonstration. Students will practice preparing and interpreting data analysis reports. Syllabus (M,W) Syllabus (T,R)
PHC 6020 – Clinical Trials Methods (3)
This course will introduce some basic statistical concepts and methods used in Epidemiology and will focus on the statistical principles and methods used in clinical trials, including phase I to IV clinical trials. Although the class will have emphasis on phase III trials, we will also discuss the feature and statistical issues in phase I and II clinical trials. For phase III trials, we will discuss ways of treatment allocation that will ensure valid inference on treatment comparison. Other topics include sample size calculation, survival analysis and early stopping of a clinical trial and noncompliance. Syllabus
PHC 6050c – Biostatistical Methods I (3)
This course is the first in a two-course sequence that provides students with the fundamentals of biostatistical data analysis. The main emphasis of the course is on linear models, focusing on the theory and practice of regression and analysis of variance. Specific topics include simple and multiple regression for quantitative and categorical data, random effects models for correlated data, factorial and block designs, and nonparametric regression. Students will learn to use the statistical package R for data analysis. Syllabus
PHC 6051 – Biostatistical Methods II (3)
Biostatistical data analysis using generalized linear models, generalized linear mixed models, semiparametric and nonparametric regression, and neural networks; theory and practice in the health sciences. Syllabus
PHC 6063 – Biostatistical Consulting (3)
This course covers communication, management, organization, computational and biostatistical thinking skills necessary to consulting in biostatistics. Syllabus
PHC 6068 — Biostatistical Computing (3)
PHC 6084 – Bayesian Biostatistical Methods (3)
To equip students with an understanding of the basics of Bayesian statistics, with special emphasis on practical implementation. Syllabus
PHC 6088 – Statistical Analysis of Genetic Data (3)
An introduction to statistical procedures in human and animal genetics, including Hardy-Weinberg equilibrium, basic linkage analysis, linkage disequilibrium, and association with disease. The goal is to prepare students for potential research in statistical genetics and genomics. Syllabus
PHC 6089 – Public Health Computing (3)
This is a three credit course which covers using SAS and R to manage and analyze public health data. Students will learn how to import, modify, visualize and perform common analyses of public health data using SAS and R. Syllabus
PHC 6092 — Introduction to Biostatistical Theory (3)
Concepts and principles of statistical theory, including probability and random variables, parameter estimation, confidence intervals, hypothesis testing, asymptotic analysis, Bayesian inference, statistical decision theory and linear models. Syllabus
PHC 6097 – Statistical Learning with Applications in Health Sciences (3)
This course should be useful to second-year master students or PhD students in biostatistics or related fields as preparation for research and professional advancement. This course covers a broad range of statistical learning methods that are useful for modern data analysis, specifically in the analysis of high-dimensional data (p<n). Many of these methods go far beyond the classical statistical methods (e.g., linear regression) and are developed for addressing various problems (e.g., non-linearity) we encounter in real situations. Statistical learning methods covered in this class also includes some newly developed methods, such as deep learning, which has achieved great success in many areas (e.g., computer vision and natural language processing). We will demonstrate the use of these methods with applications, especially in the context of health science research. Syllabus
PHC 6790 – Biostatistical Methods Using SAS (3)
The purpose of this course is to introduce and prepare students for biostatistical computing using the SAS statistical software. It builds on the knowledge obtained in the Biostatistical Methods I and II courses by reinforcing the material and focusing on application within the SAS framework. Topics covered include data management, frequency tables, linear and non-linear models, longitudinal data analysis, Matrix programming, simulation, and using SAS macros. Syllabus
PHC 6791 – Biostatistical Graphics & Data Visualization (3)
The world is growing increasingly reliant on collecting and analyzing information to help people make decisions. Because of this, the ability to communicate effectively about data is an important component of future job prospects across nearly all disciplines. In this course, students will learn the foundations of information visualization and sharpen their skills in communicating using health science data. Throughout the semester, we will use R and other software to explore concepts in graphic design, storytelling, data wrangling and plotting, and biostatistics as they apply to data-driven communication. Whether you’re an aspiring data scientist or you just want to learn new ways of presenting health science information, this course will help you build a strong foundation in how to talk to people about data. Syllabus
PHC 6937 – Introduction to Biostatistical Computing (3)
The Introduction to Biostatistical Computing course is intended to develop your programming skills to perform statistical computing. The course will focus on both R programming language (using RStudio interface) and Python programming language (using Anaconda interface), both of which are free and open-source software programs. The R language part will cover programming topics including vectorization, data input and output, data visualization (ggplot2), data manipulation (tidyverse), building R packages, and building R Shiny applications. R markdown will be used for direct integration and dynamic reporting. The Python language part will cover programming topics including objective oriented programming, scientific computing (numpy), data manipulation (pandas), data visualization (matplotlib), and text mining. The Jupitor notebook will be used for direct integration and dynamic reporting. Basic statistical inferences (hypothesis testing and linear regression model) will be included for both R and Python language. In addition, this course will introduce GitHub as the version control system and will also include the use of high-performance computing resources at the University of Florida such as HiPerGator. Syllabus
PHC6937 – Statistical and Computational Analysis of Genomic Data (3)
The course is designed to be master-level elective course but also open to MS/PhD students, who are interested in bioinformatics/computational biology. The course will focus on statistical and computational methods/software on next generation sequencing data analysis. Specific topics include (i) Using R/Bioconductor packages to handle common types of genomic data; (ii) DNA-seq, DNA methylation and metagenomics; (iii) RNA-seq, ChIP-seq, ATAC-seq and Hi-C; (iv) Single-cell genomics (RNA-seq, CITE-seq, spatial transcriptome). The course will include both didactic class time to learn skills, as well as lab time to hone them. Learning in the course is primarily assessed by three homework assignments and a final course project. Syllabus
PHC 6937 – Analysis of Multivariate Data (3)
This course covers linear models methodology including simple and multiple regression and analysis of variance including factorial and block designs. The course covers regression for categorical data, random effects models for correlated data, and nonparametric and semiparametric regression. Syllabus
PHC 6937 – Analytic Methods for Infectious Diseases (3)
This course will introduce concepts of infectious disease epidemiology and study designs and analytic methods for evaluating interventions. Especially the relation between the underlying transmission dynamics and the design and evaluation of interventions will be discussed. Special emphasis will be on the design and evaluation of vaccination and vaccination programs. We will present methods for real-time statistical evaluation of interventions of emerging infectious diseases. Statistical and mathematical methods include survival analysis, likelihood methods, stochastic processes, network theory, and stochastic and deterministic transmission models. Examples include case studies in influenza, Ebola, dengue, Zika, cholera, and others. Presentations are largely statistical and mathematical, but with a focus on concepts. Syllabus
PHC 6937 – Applied Longitudinal Data Analysis (3)
Concepts and methods for longitudinal data analysis based on randomized trials and observational studies. Syllabus
PHC 6937 – Advanced Bayesian Methods (3)
This course introduces students to advanced concepts of Bayesian biostatistics with special emphasis on practical implementation in bioassay, measurement error, survival analysis, longitudinal studies, and spatial statistics. Students will be able to fit Bayesian models to analyze real data sets using freely available software such as WinBUGS and R. Syllabus
PHC 6937 – Causal Inference (3)
Concepts and methods for causal inference from randomized trials and observational studies. Syllabus
PHC 6937 – Deep Learning Research (3)
This course will learn deep learning methods and research skills that are necessary for new deep learning method development and modern data analysis. The methods and tools learned from this course can enhance students’ ability in data analysis, method development, and professional advancement. Syllabus
PHC 6937 – Frontiers in Biostatistics (3)
This course will introduce biostatistics Masters and PhD students to current issues and methods in modern biostatistics research. Current faculty will present selected topics from their current research. Syllabus
PHC6937 – Introduction to Statistical Learning (3)
This course gives a brief introduction to commonly used methods (e.g., trees and deep learning) in statistical/machine learning. These methods have been increasingly used in modern data analysis to address various problems (e.g., biomedical problems) we encounter in real-world situations. Syllabus
PHC 6937 – Stochastic Modeling (3)
The student will learn both the theory and practice of stochastic processes and modeling. This will include the theory of random phenomena that is concerned with the flow of events in time and space, especially those exhibiting highly variable behavior that can be described by probability distributions. Specifically, the student will learn to deal with the branching process, random walks, martingales, Markov processes, Poisson process, counting processes, birth and death processes as applied to the health sciences biology. Many of the examples and illustrations of the methods will be in the area infectious diseases. There will be an emphasis on learning methods of strong scientific importance as opposed to purely mathematical theory. Syllabus
PHC 6937 – Stochastic Epidemic Modeling (1)
The student will learn theory and applications of modelling epidemic outbreaks and statistical inference for such. The focus will however be on methodology. The theory involves deterministic models, usually presented with sets of differential equations, and stochastic models. Large population properties will be derived using probabilistic methods such as central limit theory, branching process theory, theory for population processes, l random graph theory and coupling. Statistical methods will also be presented using e.g. martingales, counting processes and the likelihood theory. Syllabus
PHC 6937 – Infectious Disease Data Analysis (3)
Infectious disease data arises from complex mechanisms, including transmission, surveillance, and the accrual of immunity. The goal of this course is to introduce common statistical approaches to the analysis of infectious disease data, with a particular focus on developing the underlying models describing disease transmission, and on methods for parameter estimation. The focus of this course will be on generally applicable methods, but we will use a variety of diseases as case studies, including COVID-19, dengue virus, measles, and other diseases of humans and wildlife. Syllabus
PHC 7056 – Longitudinal Data Analysis (3)
Likelihood-based and semiparametric methods for longitudinal data and methods to deal with missing data in both settings. Discussion of impact of missing data both theoretically and practically on inference, and approaches to conduct sensitivity analysis for inference. Syllabus
PHC 7066 – Large Sample Theory (3)
Detailed introduction to large sample theory and its application in univariate and multivariate parametric and nonparametric estimation. Syllabus
PHC 7090 – Advanced Biostatistical Methods I (3)
Theory and application for estimation and hypothesis testing for independent data using linear models. Principles of Frequentist and Bayesian estimation and inference. Application using statistical software. Writing data analysis reports. Syllabus
PHC 7091 — Advanced Biostatistical Methods II (3)
Theory and application for independent and dependent data using generalized linear models and generalized linear mixed models. Bayesian and Frequentist inference. Application using statistical software. Writing data analysis reports. Syllabus
PHC 7925 – Biostatistics Journal Club (1)
This class will meet weekly to present, review and discuss current articles in biostatistics or statistics journals or discipline-specific (e.g. medicine, public health, epidemiology) articles with substantive biostatistical content. Syllabus
PHC 7979 – Advanced Research (Variable)
PHC 7980 – Research for Doctoral Dissertation (Variable)
STA 6177 – Applied Survival Analysis (3)
This course covers survival analysis, Kaplan-Meier estimates, proportional hazards model, related tests, phase I, II, and III clinical trials, designs, and protocols. Syllabus (Online) Syllabus (Campus)
This course is intended for students from statistical or other quantitative majors (statistics, biostatistics, etc).
STA 7179 – Advanced Survival Analysis (3)
Theoretical introduction to statistical inferential procedures useful for analyzing randomly right censored failure time data.
Courses for Students Not in MS or PhD Programs in Biostatistics
These courses require no prerequisites:
PHC 6050 – Statistical Methods Health Science 1 (3)
This course covers statistical vocabulary, methods for descriptive data analysis, the fundamentals of probability and sampling distributions, methods for statistical inference and hypothesis testing based on one or two samples, categorical data analysis and linear regression. Data analysis will be conducted in SPSS. Syllabus
PHC 6052 – Introduction to Biostatistical Methods (3)
Introduction to the concepts and methods of biostatistical data analysis. Topics include descriptive statistics, probability, standard probability distributions, sampling distributions, point and confidence interval estimation, hypothesis testing, power and sample size estimation, one- and two-sample parametric and non-parametric methods for analyzing continuous or discrete data, and simple linear regression. SAS statistical software for data management, statistical analysis and power calculations. Syllabus (On-Campus, Fischer) Syllabus (Online, Fischer)
PHC 6075 – Biostatistical Literacy (3)
This course covers concepts and techniques, including survival data, multiple-group comparisons, and non-linear regression, necessary to read, interpret, and critically evaluate statistical results in health science literature relevant to the interests of the student. This course offers no formal training in statistical software. Syllabus
These courses do not require mathematical prerequisites, but they do require prerequisites that are other biostatistical courses for non-biostatistics majors:
PHC 6022 – Design and Conduct of Clinical Trials (3)
This course focuses on various study designs, including phase I-IV, single-arm, crossover, factorial, and sequential multi-stage, plus the means to allocate study participants to appropriate treatment groups using randomization (blocked or stratified) and prognostic factors. In addition, the protection of study participants and the need for equipoise is covered, including regulatory restrictions and the latest patient privacy regulations for the dissemination and use of data associated with the participants in clinical trials. The importance of informed consent and the use of intent-to-treat analysis will also be emphasized. Syllabus
PHC 6053 – Regression Methods for Health and Life Sciences (3)
This course introduces graduate students in fields other than statistics to a wide range of modern regression methods. Emphasis is on modeling driven by actual data from studies in a variety of areas, primarily from health, biology, and ecology. The primary topics are multiple linear regression, logistic regression, and Poisson regression. A main goal is to learn what approach to use among the linear and nonlinear models, and how to determine whether the fit is adequate. By the end of the course, students will achieve competency in carrying out the analyses in SAS. Syllabus
PHC 6059 – Introduction to Applied Survival Analysis (3)
This course covers survival analysis, Kaplan-Meier estimates, proportional hazards model, related tests, phase I, II, and III clinical trials, designs, and protocols. Syllabus
PHC 6064 – Survey of Advanced Biostatistical Methods for the Health Sciences (3)
This course uniquely blends the fundamentals inference with an introduction to advanced statistical techniques critical for the analysis of today’s ever-growing body of health-related data. Topics span the analysis of high-dimensional, categorical, and longitudinal data, and applications are undertaken with the statistical software R and SAS. Syllabus
PHC 6937 – Survey of Biostatistical Methods (3)
This course is a survey of biostatistical methods beyond one and two sample techniques covered in PHC 6052. Advanced topics will be selected from areas such as multiple linear regression, study design and ANOVA, contingency tables, logistic regression, Poisson regression, repeated measures and longitudinal data analysis, missing data methods, model/variable selection, survival analysis, multivariate methods, or non-parametric methods. Focus will be on the application of these techniques to data from the health sciences. Examples will make use of SAS and R for this course. Syllabus
Public Health Courses
The following courses offered by the MPH program in the College of Public Health and Health Professions are taken by MPH, MS, and PhD students in the Department of Biostatistics.
- Archived PH course syllabi: http://mph.ufl.edu/current-students/courses/syllabus-archives/.
PHC 6001—Principles of Epidemiology in Public Health (3)
Overview of epidemiology methods used in research studies that address disease patterns in community and clinic-based populations. Includes distribution and determinants of health-related states or events in specific populations and application to control of health problems.
HSA 6114—Introduction to the U.S. Health Care System (3) – This course is designed to familiarize students with basic concepts and ideas concerning the distribution of health and illness, the organization of the health care system and the relationship of one to the other. Definitions of health and illness, as well as the historical context for developments of our health care system are discussed and debated. The course concludes with a discusssion on trends that could impact the health care system in the future. Students should come to class ready to discuss and debate the major themes related to health and distribution of disease, the ability of the U.S. health care system to meet the needs of the population, as well as the policy environment that influences access to health care services.
PHC 6313—Environmental Health Concepts in Public Health (3 or 2)
Survey of major topics of environmental health. Sources, routes, media, and health outcomes associated with biological, chemical, and physical agents in environment. Effects of agents on disease, water quality, air quality, food safety, and land resources. Current legal framework, policies, and practices associated with environmental health and intended to improve public health.
PHC 6410/formerly 6406—Psychological, Behavioral, and Social Issues in Public Health (3)
Health behavior from an ecological perspective; includes primary, secondary and tertiary prevention across a variety of settings; incorporates behavioral science theory and methods.
PHC 6937 – Introduction to Public Health (3)
The purpose of this course is to provide a broad introduction to public health as well as an understanding about how their PhD specializations contribute to achieving the goals of public health. A full syllabus for this course can be found in the archives here: http://mph.ufl.edu/current-students/courses/syllabus-archives/.
PHC 6089 – Public Health Computing (3) – Formerly PHC6055, PHC6080, PHC6081
This is a three credit course which covers using SAS and R to manage and analyze public health data. Students will learn how to import, modify, visualize and perform common analyses of public health data using SAS and R. Syllabus