The aim of this package is to run audits on datasets to check for errors or typos in values. For instance, with a column called MonthsPerYear, the values should be integers that go from 1 to 12. If there are violations to this pattern, the values need to be investigated at the raw data source. This is a companion package to the main PROMISE package (see website).

In general, this audit package is a thin wrapper around assertr. It simplifies some common uses of assertr as well as creating an easy interface to viewing the errors from multiple audit checks.

Installation

Only on GitLab right now, and is in development.

# install.packages("pak")
pak::pak("gitlab::promise-cohort/PROMISE.audit")

Usage

library(PROMISE.audit)
library(magrittr)
audit <- swiss %>% 
    chk_in_range(3, 20, "Examination") %>% 
    chk_in_set(1:10, "Education") %>% 
    chk_outliers(3, "Fertility")
aud_report(audit) %>% 
    knitr::kable()
Column Fails Values RowNum
Examination 14 21, 22, 26, 31, 25, 29, 35, 37 12, 15, 18, 19, 21, 23, 29, 39, 40, 41, 42, 44, 45, 47
Education 17 12, 15, 13, 28, 20, 19, 11, 32, 53, 29 1, 5, 10, 12, 14, 18, 19, 23, 29, 38, 39, 40, 41, 42, 45, 46, 47