SSC Annual Meeting 2022, Biostatistics Workshop
This website contains materials for Biostatistics Workshop at the 2022 SSC Annual Meeting Online.
In this workshop, we introduce methods for phenotyping with electronic health record (EHR) data.
Slides and code can be found for an example phenotyping problem below.
We use real EHR data from PheCAP, an R package that implements high-throughput phenotyping using a common automated pipeline.
Please copy and paste the following code into R Console to check and load the packages. We are using R version \(\geq\) 3.6.0.
# Specify the packages from CRAN.
packages <- c(
"PheCAP",
"PheNorm",
"MAP",
"glmnet",
"tidyverse",
"ggplot2",
"data.table",
"mltools",
"pROC",
"parallel",
"randomForestSRC",
"SVMMaj"
)
# Load the packages.
# Missing package(s) will be first installed and loaded.
package.check <- lapply(
packages,
FUN = function(x) {
if (!require(x, character.only = TRUE)) {
install.packages(x, dependencies = TRUE)
library(x, character.only = TRUE)
}
}
)
To start with, please git clone
the repository to local.
@github.com:jlgrons/EHR-Phenotyping-Workshop.git git clone git
Publicly available EHR dataset
NLP software
This website was made with the distill package, drawn mostly from Silvia Canelón’s Sharing Your Work with xaringan.