Skip to contents

Validation functions

confront_data()
Wrapper for create_rules() and validate::confront()
create_rules() create.rules()
Create rules from the codebook using validate::validator()
find_col_info()
Find column information
validate_colnames()
Validate column names against codebook

Standardize data to codebook functions

standardize_cols_by_cb()
Standardize the columns in the order specified by the codebook

Access codebook functions

get_col_index()
Get column index for codebook variables in a list of data frames
get_variety_db() get.variety_db()
Return cultivar names along with unique internal identifier (no spaces or special characters)
list_db_books() list.db_books()
List the names of the codebooks in the database
list_db_var() list.db_var()
List the variables (column names) in a given database codebook
readin_db() readin.db()
Read in all codebooks and create a combined cultivar file

Data model (dm) functions

create_dm()
Create a data model object from the main codebook

Match variety modules

do_exactmatch()
Do the steps in the exact match
find_fuzzymatch()
Find fuzzymatch for variety using stringdist using methods "lv" and "jw"
process_fuzzymatch()
Module to process fuzzymatches given an auxiliary file
process_std_new_names()
Process standardized new names that were created from process_fuzzymatch()

Match variety functions

collect_final_matches()
Module to collect all the matches from each stage of the variety matching process
create_intid()
Create variety id (var_id) internal id (intid) for variety names
find_entries_raw_names()
Find entries for the cultivar names that need to be checked