Skip to contents

Exports all tables from a DuckDB connection to parquet files. Optionally strips provenance columns for public releases.

Usage

write_parquet_outputs(
  con,
  output_dir,
  tables = NULL,
  strip_provenance = TRUE,
  compression = "snappy"
)

Arguments

con

DuckDB connection

output_dir

Directory for parquet files

tables

Optional vector of table names to export. If NULL, exports all tables.

strip_provenance

Remove provenance columns (source*, _ingested_at) (default: TRUE)

compression

Parquet compression codec (default: "snappy")

Value

Tibble with export statistics (table, rows, file_size, path)

Examples

if (FALSE) { # \dontrun{
con <- get_duckdb_con("calcofi.duckdb")
stats <- write_parquet_outputs(con, "output/parquet")
} # }