Tag Archives: Data Wrangling

R Workshops, a tidyverse approach, Fall 2019

A bit late posting about this, but my R workshops start tomorrow.  This year I am revising my materials to reflect a tidyverse-centric approach.  I am not a tidyverse convert or even a particular fan, but I would like to teach this popular and coherent ecosystem as an entry-point to R.  I hope it does not discourage learning the entire diversity of the R space.

These workshops are open to all without registration.

Bring your own laptop to these sessions to get the most out of them!

Later in the semester, there are plans to repeat these as webinars (schedule to come in late September).

R for data analysis: a tidyverse approach

  • Wednesday, September 25 – 12:00-1:20 pm, LSM Conference Room
  • Thursday, October 3– 2:50-4:10 pm, Alexander Library Room 415

The session introduces the R statistical software environment and basic methods of data analysis, and also introduces the “tidyverse”.  While R is much more than the “tidyverse”, the development of the “tidyverse” set of packages, led by RStudio, has provided a powerful and connected toolkit to get started with using R.  Note that graphics and data manipulation are covered in subsequent sessions.

R graphics with ggplot2 

  • Wednesday, October 2 – 12:00-1:20 pm, LSM Conference Room
  • Thursday, October 10– 2:50-4:10 pm, Alexander Library Room 415

The ggplot2 package from the tidyverse provides extensive and flexible graphical capabilities within a consistent framework.  This session introduces the main features of ggplot2. Some prior familiarity with R is assumed (packages, structure, syntax), but the presentation can be followed without this background.

R data wrangling with dplyr, tidyr, readr and more

  • Wednesday, October 9 – 12:00-1:20 pm, LSM Conference Room
  • Thursday, October 24 – 2:50-4:10 pm, Alexander Library Room 415

Some of the most powerful features of the tidyverse relate to its abilities to import, filter, and otherwise manipulate data.  This session reviews major packages within the tidyverse that relate to the essential data handling steps require before (and during) data analysis.

R for interactivity: an introduction to Shiny

  • Wednesday, October 23 – 12:00-1:20 pm, LSM Conference Room
  • Thursday, October 31 – 2:50-4:10 pm, Alexander Library Room 415

Shiny is an R package that enables the creation of interactive websites for data visualization.   This session provides a brief overview of the Shiny framework, and how to edit and publish Shiny sites in RStudio (with shinyapps.io).  Familiarity with R/RStudio is assumed.

R for reproducible scientific documents: knitr, rmarkdown, and beyond

  • Wednesday, October 30 – 12:00-1:20 pm, LSM Conference Room
  • Thursday, November 7 – 2:50-4:10 pm, Alexander Library Room 415

The RStudio environment enables the easy creation of documents in various formats (HTML, DOC, PDF) using Rmarkdown, while knitr allows the incorporation of executable R code to produce the tables and figures in those documents. This session introduces these concepts and other packages and practices supporting reproducibility with the R environment.