Week 4 Overview
This wraps up the content for week 3. Now we require some practice! You should head back to our Moodle site to check out your homework assignment for this week.
We are now ready to really look at bringing data in and how to handle that data more seamlessly. We’ll look at working within the tidyverse
. This is a suite of packages that all work together and allow you to read in data and do most common data manipulations.
Week 4 Additional Readings/Learning Materials
Manipulating Data
- Chapters 3, 4, and 5 of R 4 Data Science
- (Optional) R Packages book
- (Optional) List of CRAN approved packages
- (Optional) <a href=“https://support.posit.co/hc/en-us/articles/201057987-Quick-list-of-useful-R-packages” target=“_blank”List of useful R packages
Reading Data
Week 4 Learning Objectives
Upon completion of this week, students will be able to:
Reading Data
read delimited data, SAS data files, SPSS files, and other file types into R
- describe the term delimiter
- read comma separated value files into R using the readr package
- explain how the read_ functions determine column types
- describe the readxl package and its functions
- compare and contrast tibbles and data frames
- describe the term delimiter
write a stored data set to a file using different delimiters
Manipulating Data
use logical statements and indexing vectors to subset common data objects using common functions such as
[
, subset, or dplyr::filter functions
list favorable things to look for in an R package
- describe the general purpose of the tidyverse package
- outline the difference between require and library
- explicitly use functions from a particular package using the :: operator
- discuss the idea of masking of R functions and objects
- describe the general purpose of the tidyverse package
describe the uses of and program with functions from the dplyr package
- explain the benefits of using the dplyr package over base R methods
- program with the arrange, filter, select, and rename functions from the dplyr package
- optimize selecting variables from a data frame using the select function’s options (such as starts_with)
- combine functions in the dplry package to subset and summarize a data set in R
- describe the uses of and program with the mutate, group_by, and summarise functions in the dplyr R package
- combine functions in the dplry package to subset and summarize a data set in R
- explain the benefits of using the dplyr package over base R methods
program using the chain of commands or chaining/piping operators
Use the table of contents on the left or the arrows at the bottom of this page to navigate to the next learning material!