High-level convenience wrapper that processes FFIEC Call Report bulk zip files into Parquet format, optionally creating item-level metadata.
Usage
ffiec_process(
zipfiles = NULL,
raw_data_dir = NULL,
data_dir = NULL,
schema = "ffiec",
create_item_pqs = TRUE,
keep_process_data = NULL,
use_multicore = FALSE
)Arguments
- zipfiles
Optional character vector of FFIEC bulk zip file paths. If
NULL, zip files are discovered automatically from the resolved raw data directory.- raw_data_dir
Optional parent directory containing FFIEC bulk zip files. If provided and
schemais notNULL, files are expected underfile.path(raw_data_dir, schema). IfNULL, the environment variableRAW_DATA_DIRis used.- data_dir
Optional parent directory for Parquet output. If provided and
schemais notNULL, files are written underfile.path(data_dir, schema). IfNULL, the environment variableDATA_DIRis used.- schema
Schema name used to resolve input and output directories (default
"ffiec"). IfNULL, directories are resolved directly without appending a schema subdirectory.- create_item_pqs
Logical; if
TRUE, create or update FFIEC item metadata Parquet files as part of processing.- keep_process_data
Logical; whether to write the processing log returned by
ffiec_process()to"ffiec_process_data.parquet"in the resolved output directory. IfNULL, defaults toTRUEwhenzipfilesisNULLandFALSEwhenzipfilesis supplied.- use_multicore
Logical; whether to attempt parallel execution when reading Parquet metadata. If
TRUEand the optional packages future and furrr are installed, operations are parallelized using a multisession plan. Defaults toFALSE.