Skip to contents

PostgreSQL to Parquet

Functions for creating Parquet data from PostgreSQL sources.

db_to_pq()
Export a PostgreSQL table to Parquet
db_schema_to_pq()
Export all tables in a PostgreSQL schema to Parquet
db_schema_tables()
List PostgreSQL relations in a schema
lazy_tbl_to_pq()
Export a lazy dbplyr table to Parquet
tbl_to_pq()
Export a lazy dbplyr table to Parquet via ADBC

WRDS to Parquet

Functions for creating Parquet data from WRDS.

wrds_update_pq()
Export a WRDS table to Parquet, skipping if already up to date
wrds_schema_to_pq()
Export all tables in a WRDS schema to Parquet
wrds_sql_to_pq()
Export a WRDS SQL query to Parquet
wrds_get_tables()
List WRDS relations in a schema

WRDS authentication and configuration

wrds_get_username()
Resolve a WRDS username
wrds_conninfo()
Build WRDS PostgreSQL connection information
wrds_connect_dbi()
Connect to WRDS PostgreSQL with DBI
wrds_check_credentials()
Check WRDS PostgreSQL credentials

Parquet file utilities

Some useful functions for managing Parquet data repositories.

pq_data_dir()
Resolve the Parquet data repository directory
pq_last_modified()
Get last-modified metadata for Parquet data files
pq_archive()
Archive a Parquet file into the archive subdirectory
pq_restore()
Restore an archived Parquet file to the active schema directory
pq_remove()
Remove a Parquet file from active or archive storage

ADBC development helpers

ADBC support is experimental, as ABDC drivers are still early in their development.

adbc_diagnostics()
Report optional ADBC dependency status
con_to_adbi()
Create an ADBC PostgreSQL connection from an RPostgres connection
tbl_to_pq_debug()
Debug the ADBC chunk fetch path for a lazy dbplyr table