Check whether the order of the sequence of date-events is valid.
Source:R/check_date_sequence.R
check_date_sequence.Rd
Checks whether a date sequence in a vector of specified columns is in order or not.
Arguments
- data
The input data frame or linelist
- target_columns
A vector of column names for events. Users should specify at least 2 column names in the expected order. For example:
target_columns = c("date_symptoms_onset", "date_hospitalization", "date_death")
. When the input data is alinelist
object, this parameter can be set tolinelist_tags
if you wish to use the date sequence across only the tagged columns columns only. The date values in the target columns should be in the ISO8601 format (2024-12-31). Otherwise, use thestandardize_dates()
function to standardize the target columns.
Value
The input dataset. When found, the incorrect date sequences will be
stored in the report and can be accessed using
attr(data, "report")
.
Examples
# import the data
data <- readRDS(system.file("extdata", "test_df.RDS", package = "cleanepi"))
# standardize the date values
data <- data %>%
standardize_dates(
target_columns = c("date_first_pcr_positive_test", "date.of.admission"),
error_tolerance = 0.4,
format = NULL,
timeframe = NULL
)
# check the date sequence in two columns
good_date_sequence <- check_date_sequence(
data = data,
target_columns = c("date_first_pcr_positive_test", "date.of.admission")
)
#> Warning: Detected 2 incorrect date sequences at line(s): 6, 8