The setup

The statistical programming language R is quickly becoming one of the standard tools for data journalists around the world. Its use can help us expand our understanding of data and how to analyze it.

Module objectives

Students will be able to:

  • Recognize basic R vocabulary
  • Utilize essential functions of R, such as importing data and generate simple summary stats
  • Propose and answer questions using essential R operations
  • Perform data analysis to craft basic story ideas for further reporting
  • Create and analyze simple data visualizations using R packages

Module contents

  • Individual technical work outside the classroom
  • Class walkthroughs
  • Class exercises
  • Group work
  • Class discussion

This module is designed to be taught over 1 to 3 class meetings, depending on the skill levels of students. Before getting started, consider assigning the installation steps as out-of-class work.

Total suggested in-class time: 2 hours, 30 minutes

Total suggested out-of-class time: 30 minutes

What you’ll need

  • An Internet connection and Web browser
  • R
  • RStudio

The data

NY COVID data | Data dictionary
The COVID Tracking Project Archive’s full data on testing and outcomes from the state of New York.
NV COVID race and ethnicity data | Data dictionary
Data reported by the state of Nevada on the race and ethnicity of COVID-19 cases, deaths and tests captured by the COVID Tracking Project.
Nevada census data
Data from the 2020 5-year American Community Survey on the Nevada population by Hispanic or Latino origin by race (BO3002).
Statewide COVID testing and outcome data
The COVID Tracking Project’s full dataset of COVID testing and outcomes for all states through March 7, 2021.

Next submodule: