Character String Manipulation: Regular Expressions and the R Package stringr
Dawn Koffman is a Statistical Programmer at the Office of Population Research at Princeton University. She earned an MS in Computer Science from University of Wisconsin-Madison, and an MPH in Epidemiology and Biostatistics from UMDNJ and Rutgers University.
This workshop introduces character string manipulation in R. String data is often unstructured, and regular expressions provide a concise mechanism to describe text patterns that may be contained within string data. It may take a little while to get accustomed to using regular expressions, but they are extremely useful. Stringr is an R package for string manipulation. It includes all of the common string operations one might need, including pattern matching. Although stringr is not part of the tidyverse core, it is built with similar goals in mind – consistency, simplicity and producing output that can easily be used as input.
Attendees should have previous experience using R.
Lecture, discussion and hands-on exercises.
Attendees should bring a laptop with R, RStudio, and the R package stringr already installed.