util_int_duplicate_ids_segment {dataquieR} | R Documentation |
Check for duplicated IDs
Description
This function tests for duplicates entries in identifiers. It is possible to check duplicated identifiers by study segments or to consider only selected segments.
Usage
util_int_duplicate_ids_segment(
level = c("segment"),
id_vars_list,
study_segment,
repetitions,
study_data,
meta_data,
meta_data_segment = "segment_level",
segment_level
)
Arguments
level |
character a character vector indicating whether the assessment should be conducted at the study level (level = "dataframe") or at the segment level (level = "segment"). |
id_vars_list |
list id variable names for each segment or data frame |
study_segment |
vector the segments or data frame names being assessed |
repetitions |
vector an integer vector indicating the number of allowed repetitions in the id_vars. Currently, no repetitions are supported. |
study_data |
data.frame the data frame that contains the measurements, mandatory. |
meta_data |
data.frame the data frame that contains metadata attributes of the study data, mandatory. |
meta_data_segment |
data.frame – optional: Segment level metadata |
segment_level |
data.frame alias for |
Value
a list with
-
SegmentData
: data frame with the results of the quality check for duplicated identifiers -
SegmentTable
: data frame with selected duplicated identifiers check results, used for the data quality report. -
Other
: named list with inner lists of unique cases containing each the row indices of duplicated identifiers separated by "|" , if any. outer names are names of the segments. Useprep_get_study_data_segment()
to get the data frame the indices refer to.
See Also
Other integrity_indicator_functions:
util_int_duplicate_content_dataframe()
,
util_int_duplicate_content_segment()
,
util_int_duplicate_ids_dataframe()
,
util_int_unexp_records_set_dataframe()
,
util_int_unexp_records_set_segment()