Reads FFIEC bulk XBRL zip files (each containing many *.xbrl.xml inner
files), extracts reported XBRL facts, and writes a Parquet file per zipfile
into the resolved Parquet output directory.
Usage
ffiec_process_xbrls(
zipfiles = NULL,
raw_data_dir = NULL,
data_dir = NULL,
schema = "ffiec",
use_multicore = FALSE,
ns_prefix = "cc",
prefix = ""
)Arguments
- zipfiles
Optional character vector of FFIEC bulk XBRL zip file paths. If
NULL, zip files are discovered automatically in the resolved input directory (usingffiec_list_zips(type = "xbrl")).- raw_data_dir
Optional parent directory containing FFIEC bulk zip files.
- data_dir
Optional parent directory for Parquet output.
- schema
Schema subdirectory name (default
"ffiec"). Set toNULLto useraw_data_dir/data_dirdirectly.- use_multicore
Logical; if
TRUEand packagesfutureandfurrrare installed, process zip files in parallel usingfuture::multisession().- ns_prefix
XML namespace prefix to extract (default
"cc").- prefix
Optional filename prefix for output Parquet file names.
Value
A tibble with one row per processed zipfile, including the output
Parquet basename. The attribute "out_dir" contains the resolved
output directory.
Details
Directory resolution follows the same rules as ffiec_process():
raw_data_dir / RAW_DATA_DIR for inputs and
data_dir / DATA_DIR for outputs, optionally with a schema
subdirectory when schema is non-NULL.