The setup
The statistical programming language R is quickly becoming one of the standard tools for data journalists around the world. Its use can help us expand our understanding of data and how to analyze it.
Module objectives
Students will be able to:
- Recognize basic R vocabulary
- Utilize essential functions of R, such as importing data and generate simple summary stats
- Propose and answer questions using essential R operations
- Perform data analysis to craft basic story ideas for further reporting
- Create and analyze simple data visualizations using R packages
Module contents
- Individual technical work outside the classroom
- Class walkthroughs
- Class exercises
- Group work
- Class discussion
Recommended pacing
This module is designed to be taught over 1 to 3 class meetings, depending on the skill levels of students. Before getting started, consider assigning the installation steps as out-of-class work.
Total suggested in-class time: 2 hours, 30 minutes
Total suggested out-of-class time: 30 minutes
What you’ll need
- An Internet connection and Web browser
- R
- RStudio
The data
- NY COVID data | Data dictionary
- The COVID Tracking Project Archive’s full data on testing and outcomes from the state of New York.
- NV COVID race and ethnicity data | Data dictionary
- Data reported by the state of Nevada on the race and ethnicity of COVID-19 cases, deaths and tests captured by the COVID Tracking Project.
- Nevada census data
- Data from the 2020 5-year American Community Survey on the Nevada population by Hispanic or Latino origin by race (BO3002).
- Statewide COVID testing and outcome data
- The COVID Tracking Project’s full dataset of COVID testing and outcomes for all states through March 7, 2021.