match_all_result {sae4health} | R Documentation |
Matching Results Between DHS and GitHub Indicators, Imported from surveyPrev
Description
The dataset 'match_all_result' contains the results of matching indicators from DHS (Demographic and Health Surveys) with those extracted from GitHub. The dataset includes similarity scores, recoding names, indicator definitions, and positional information within the matched text.
Usage
match_all_result
Format
A data frame with 132 rows and 20 variables:
- indicator_ID_DHS
Character. Unique ID of the indicator in DHS.
- DHS_label
Character. Label assigned to the indicator in DHS.
- DHS_definition
Character. Description or definition of the DHS indicator.
- Combined
Character. Combined text representation for similarity matching.
- Similarity
Numeric. Similarity score between DHS and GitHub indicators.
- indicator_ID_Github
Character. Unique ID of the indicator from GitHub.
- recode_name
Character. Initial recoding name for the indicator.
- updated_recode_name
Character. Revised recoding name after updates.
- indicator_ID_Github_raw
Character. Raw version of the GitHub indicator ID.
- indicator_def_github
Character. Processed indicator definition from GitHub.
- indicator_def_github_raw
Character. Unprocessed raw definition from GitHub.
- indicator_chapter
Character. The chapter or category of the indicator.
- indicator_R_bundle
Character. Associated R bundle or package for processing.
- start_position
Integer. Start position of the matched pattern in text.
- end_position
Integer. End position of the matched pattern in text.
- matched_pattern
Character. The exact text pattern matched between sources.
- batch_recode_group
Character. Grouping variable for batch recoding.
- indicator
Character. Final matched indicator name.
- X
Unknown. This variable needs clarification or may be redundant.
- ID_first_two_letters
Character. First two letters of the indicator ID, possibly used for grouping or sorting.