Character String Manipulation: Regular Expressions and the R Package stringr

Boriana Pratt is a Statistical Programmer at the Office of Population Research at Princeton University.  She earned a Masters degree in Biostatistics from the University of California at Berkeley.

Dawn Koffman is a Statistical Programmer at the Office of Population Research at Princeton University. She earned an MS in Computer Science from University of Wisconsin-Madison, and an MPH in Epidemiology and Biostatistics from UMDNJ and Rutgers University.

1/14/2019 from 9:30 AM to 11:45 AM ~ Location to be determined

This workshop introduces character string manipulation in R. String data is often unstructured, and regular expressions provide a concise mechanism to describe text patterns that may be contained within string data. It may take a little while to get accustomed to using regular expressions, but they are extremely useful. Stringr is an R package for string manipulation. It includes all of the common string operations one might need, including pattern matching. Although stringr is not part of the tidyverse core, it is built with similar goals in mind – consistency, simplicity and producing output that can easily be used as input.

Lunch will be provided following the workshop.  To register, send an email to from your email address with the subject "stringr".


Attendees should have previous experience using R.


Lecture, discussion and hands-on exercises.


Attendees should bring a laptop with R, RStudio, and the R package stringr already installed.