R – Tidyverse – Dplyr – pipe %>% – Answer needed today

You will need to download the R studio, tidyverse, and associated packages if you don’t already have them.

Q3. 

In this exercise you’ll use R package tidyverse (see chapter 4 of Introduction to Data Science Data Analysis and Prediction Algorithms with R by Rafael A. Irizarry. You need to go through chapter 4 before attempting the following questions. Using dplyr functions (i.e., filter, mutate ,select, summarise, group_by etc. ) and “murder” dataset (available in dslab R package) and write appropriate R syntax to answer the followings: 

a. Calculate regional total murder excluding OH, AL, and AZ (Hint: filter(! abb %in% x) # here x is the exclusion vector) 

b. Display the regional population and regional murder numbers. 

c. How many states are there in each region? (Hint: n ()) 

d. What is Ohio’s murder rank in the Northern Central Region (Hint: use rank(), row_number()) 

e. How many states have murder number greater than its regional average (Hint: nrow() ) 

f. Display 2 least populated states in each region (Hint: slice_min() ) 

Use pipe %>% operator for all the queries. Show all the output results  

Tags: No tags