When removing duplicates, users can specify a set columns to consider with
the target_columns
argument.
Value
The input data <data.frame>
or <linelist>
without the
duplicated rows identified from all or the specified columns.
Examples
no_dups <- remove_duplicates(
data = readRDS(
system.file("extdata", "test_linelist.RDS", package = "cleanepi")
),
target_columns = "linelist_tags"
)
#> ! Found 57 duplicated rows in the dataset.
#> ℹ Use `attr(dat, "report")[["duplicated_rows"]]` to access them, where "dat" is
#> the object used to store the output from this operation.