Read-in functions

list_sheetnames() list.sheetnames()

List the names of all sheets within given files

read_excelsheet() read.excelsheet()

Read one excel sheet with read_excel

read_header() read.excelheader()

Read headers that span over multiple rows

read_multsheets() read.multsheets()

Read multiple files (csv or excel) and/or multiple sheets (within an excel file) and handle headers that span multiple rows.

read_ods_multsheets()

A wrapper around read_ods to read multiple sheets in a .ods file

read_sheets() read.sheets()

Read in sheets

Split tables functions

split_df()

Split a large table into smaller tables if there are blank columns or rows

split_direction()

Split a large table in one direction if there are blank columns or rows

Clean columns functions

rename_col() rename.col

Rename columns with a data.frame containing old and new column names

rm_dup_col() rm.dup_col()

Remove duplicate columns that may have different column names

Join functions

full_join_compare()

Full join data.frames to compare columns add a suffix before the merge to track where the column originated

Clean column contents

plot_map()

Check field map layout for errors and duplicate range/row locations.

set_values_to_na()

Set -9 and 0s to NA for given columns

Summarize column functions

find_col_info()

Find column information in a data.frame

select_colsfromlist() select.colsfromlist

Select columns from data.frames stored within a list

summarize_colnames() summarize.colnames

Create a list of column names that match a given regex for all data.frames in given list The function returns the proportion of the data.frames that contain the column name

summarize_ncol() summarize.ncol

Summarise the number of columns per data.frame that match a given regex over a list of data.frames This function detects columns with similar names within a dataset

summarize_variables() summarize.variables

Summarize information on variables that match a given regex