In this homework assignment we will be focusing on data analysis and visualization with the tidyverse.

Before starting this assignment, please download R and RStudio Desktop on your computer. Both are open-source and free to use.

Detailed installation instructions can be found here

To complete this assignment, students must download the R notebook template and open the file in their RStudio application. Please click the button below to download the template.

After completing the assignment, please upload the template (.Rmd file) to Blackboard as your submission.




Load Packages and Data

The R code chunk below will load the tidyverse and tidymodels packages as well as an auto_claims data set.

Note: If you get an error running the code below, make sure that you have installed the required packages in your RStudio desktop environment. To install any package, navigate to the bottom right pane of RStudio, select the Packages tab and click the Install button.



library(tidyverse)

auto_claims <- read_rds(url('https://gmudatamining.com/data/auto_claims.rds'))



The auto_claims data contains information on auto accident claims processed by an insurance company in the western part of the United States.

Each row represents a claim made by a customer and includes customer demographics, their policy characteristics, monthly premium, vehicle information, claim amount, and customer lifetime value


auto_claims