Files corresponding to Short Course: Introduction to Data Science Using R
Below you’ll find the presentation files for the course and the exercises we’ll do during the hands on sessions.
Note: You also have access to the source material at the github repo.
We’ve also included links to materials to go further learning R at the bottom of the page!
It may help to have just the R code from the slides so you can easily run the code without typing it out. Below you’ll find .R scripts with just the R code from the notes (notice that there is some extra stuff as we often use R code to bring in images and things like that).
Justin teaches the following three courses for Data Matters, a week-long series of one and two-day courses aimed at students and professionals in business, research, and government.
The course provides a modern introduction to the R through the extremely popular suite of packages called the tidyverse. A rough outline is given below:
Day 1:
Basics of how R stores data
R Packages and the tidyverse
Reading data from common formats into R (readr package)
Using R Markdown for reproducibility (rmarkdown and knitr packages)
Common data manipulations and creating new variables (dplyr package)
Day 2:
Reshaping data for summarizing and modeling (tidyr package)
Types of data and numeric summaries (including across groups)
Creating publication ready graphs (ggplot2 package)
The course provides a brief overview of R data structures followed by the following topics:
Loops in R
Vectorized functions (apply family of functions)
How R functions work
Function writing
Students will learn about the following topics:
The R Markdown language
How to automate reports with R Markdown
Use of git and github for collaboration and version control
Basic creation of websites through Markdown and github
R Shiny web apps
This eBook (link)[https://dtkaplan.github.io/DataComputingEbook/] by
Danny Kaplan and Matt Beckman is a free resource for an introduction to
R course. The book has been used as a primary textbook for undergraduate
courses that teach an introduction to R & tidyverse
with no assumed
prerequisites in statistics or programming. The book includes chapters
devoted to basic RStudio IDE, data wrangling, visualization, and other
topics useful for data processing & exploration.
This books has almost everything you could want to know about using R Markdown.
A commonly used introduction to the tidyverse.
A book on using the ggplot2 package for plotting.