12.1 Load packages and import data

Load the tidyverse, skimr, naniar, knitr, ggmosaic, and janitor packages:


We’ll also need a new package called epitools, so install that now if you haven’t done so.

## Attaching package: 'epitools'
## The following objects are masked from 'package:binom':
##     binom.exact, binom.wilson

We’ll use two datasets described in the Whitlock & Schluter text:

  • the “cancer.csv” dataset (described in Example 9.2 in the text, page 238)
  • the “worm.csv” dataset (described in Example 9.4 in the text, page 246)
cancer <- read_csv("https://raw.githubusercontent.com/ubco-biology/BIOL202/main/data/cancer.csv")
worm <- read_csv("https://raw.githubusercontent.com/ubco-biology/BIOL202/main/data/worm.csv")
Take a look at the cancer dataset:

cancer %>%
(#tab:cont_datalook_cancer)Data summary
Name Piped data
Number of rows 39876
Number of columns 2
Column type frequency:
character 2
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
aspirinTreatment 0 1 7 7 0 2 0
response 0 1 6 9 0 2 0

And the worm dataset:

worm %>%
(#tab:cont_datalook_worm)Data summary
Name Piped data
Number of rows 141
Number of columns 2
Column type frequency:
character 2
Group variables None

Variable type: character

skim_variable n_missing complete_rate min max empty n_unique whitespace
infection 0 1 6 10 0 3 0
fate 0 1 5 9 0 2 0

Both datasets are formatted “tidy” format. For a refresher on this, review the Biology Procedures and Guidelines document chapter on Tidy data.