Retrieves metadata (size, md5, path) for all CSV files in a GCS archive.
The md5 hash is converted from GCS base64 encoding to hex for comparison
with tools::md5sum().
Usage
get_archive_manifest(
archive_timestamp,
provider,
dataset,
gcs_bucket = "calcofi-files-public",
archive_prefix = "archive"
)
Arguments
- archive_timestamp
Archive timestamp (e.g., "2026-02-02_121557")
- provider
Data provider (e.g., "swfsc.noaa.gov")
- dataset
Dataset name (e.g., "calcofi-db")
- gcs_bucket
GCS bucket name
- archive_prefix
Archive folder prefix
Value
Tibble with columns: name, size, md5, gcs_path
Examples
if (FALSE) { # \dontrun{
manifest <- get_archive_manifest(
archive_timestamp = "2026-02-02_121557",
provider = "swfsc.noaa.gov",
dataset = "calcofi-db")
} # }