When removing duplicates, users can specify a set columns to consider with
the target_columns
argument.
Value
A data frame or linelist without the duplicated rows identified from all or the specified columns.
Examples
no_dups <- remove_duplicates(
data = readRDS(
system.file("extdata", "test_linelist.RDS", package = "cleanepi")
),
target_columns = "linelist_tags"
)
#> Found 57 duplicated rows in the dataset. Please consult the report for more details.