match_all_result {sae4health}R Documentation

Matching Results Between DHS and GitHub Indicators, Imported from surveyPrev

Description

The dataset 'match_all_result' contains the results of matching indicators from DHS (Demographic and Health Surveys) with those extracted from GitHub. The dataset includes similarity scores, recoding names, indicator definitions, and positional information within the matched text.

Usage

match_all_result

Format

A data frame with 132 rows and 20 variables:

indicator_ID_DHS

Character. Unique ID of the indicator in DHS.

DHS_label

Character. Label assigned to the indicator in DHS.

DHS_definition

Character. Description or definition of the DHS indicator.

Combined

Character. Combined text representation for similarity matching.

Similarity

Numeric. Similarity score between DHS and GitHub indicators.

indicator_ID_Github

Character. Unique ID of the indicator from GitHub.

recode_name

Character. Initial recoding name for the indicator.

updated_recode_name

Character. Revised recoding name after updates.

indicator_ID_Github_raw

Character. Raw version of the GitHub indicator ID.

indicator_def_github

Character. Processed indicator definition from GitHub.

indicator_def_github_raw

Character. Unprocessed raw definition from GitHub.

indicator_chapter

Character. The chapter or category of the indicator.

indicator_R_bundle

Character. Associated R bundle or package for processing.

start_position

Integer. Start position of the matched pattern in text.

end_position

Integer. End position of the matched pattern in text.

matched_pattern

Character. The exact text pattern matched between sources.

batch_recode_group

Character. Grouping variable for batch recoding.

indicator

Character. Final matched indicator name.

X

Unknown. This variable needs clarification or may be redundant.

ID_first_two_letters

Character. First two letters of the indicator ID, possibly used for grouping or sorting.


[Package sae4health version 1.2.3 Index]