Explore Scripps PIC Matches with CalCOFI Icthyoplankton

Published

2026-02-10

Claude Code Interactions

Used Claude Code by Anthropic from the command line to develop this notebook.

# from working directory of ~/Github/CalCOFI, with Claude CODE in plan mode sent:

Evaluate the PROMPT and expand the notebook @workflows/explore_zooplankton.qmd

PROMPT

The goal is to explore the SIO / CalCOFI zooplankton data (more at Zooplankton – CalCOFI) to see where there are matches to the ichthyoplankton data. The data files from the CalCOFI data manager Linsey include:

plankton_data: @~/My Drive/projects/calcofi/data-public/scripps.ucsd.edu/pelagic-invertebrate-collection/SIOPIC_DB_PNTtable_allRecords_wUUID_9Feb2026.csv
plankton_metadata: @~/My Drive/projects/calcofi/data-public/scripps.ucsd.edu/pelagic-invertebrate-collection/SIOPIC_DB_PNTtable_fieldDetails_9Feb2026.xlsx'

And notes from Linsey:

I included our Parent Net Tow (= pnt suffix you will see in fields) Table with all net tow records and a second “ReadMe” file with field definitions and “tips” for those.

Our ask is to see where PIC net tow samples are a match to SWF ichthyo. samples potentially utilizing: Expedition, Expedition Code, Station Line, Station Number, Latitude/Longitude polygon?, Net Type, Mesh Size?, Tow Type?, Fixative?, Preservative?

I introduced a new field ‘Expedition_Type_pnt’ and was able to easily integrate term = CalCOFI for all cruises we’d want in this first pass that represent the most formal CalCOFI net tow records. I will have to at some point integrate additional terms e.g. ‘NOAA Other’ for cruises that have CalCOFI stations sampled for special purposes (e.g. SaKE, CPS surveys, STARR, MERRP, ORCA WHALE Expeditions, etc.) but this would have taken too much time to integrate universally atm.

So, I believe it would be good to run this first pass for the records that have the Expedition_Type_pnt = CalCOFI and then the complete record set as second priority.

Use the database of ichthyoplankton data now available via @calcofi4r/ ie with:

# remotes::install_github("calcofi/calcofi4r")  # install package (once, and per version)
library(calcofi4r)  # load the package
con <- cc_get_db()# connect to latest frozen release
cc_list_tables() # list available tables
DBI::dbGetQuery(con, "SELECT COUNT(*) FROM ichthyo") # query with SQL

Reference the space-time windowing in the @int-app/app/ workflow (but with older version of database having different schema). Try different approaches to matching the data, including:

single nearest cast (by time)
single nearest cast (by space)

Scatterplot showing relating icthyo dataset to bottle environmental dataset with space-time windows from the CalCOFI integrated app.

Response

⏺ The plan is complete. I explored:                                            
  1. The zooplankton CSV (148K records, 20 columns) and Excel metadata                                                                                                                                          
  2. The calcofi4r database schema (ichthyo → net → tow → site → cruise chain with integer IDs)                                                                                                                 
  3. The prep_splot() space-time matching reference in int-app/app/functions.R                                                                                                                                  
  4. The merge_ichthyo_bottle.qmd notebook for style/pattern reference                                                                                                                                          
                                                                               
  The plan covers 6 notebook sections: Setup, Zooplankton exploration, Ichthyoplankton exploration, Station-based matching (line + station + cruise), Space-time window matching (adapted from prep_splot()),
  and a Comparison summary.

────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────────
 Ready to code?
                                                                                                                                                                                                                
 Here is Claude's plan:
╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌
 Plan: Expand workflows/explore_zooplankton.qmd                                

 Context

 The CalCOFI data manager Linsey provided SIO zooplankton (PIC) net tow data (148K records) and wants to find where these match SWFSC ichthyoplankton samples. This notebook will explore both datasets, try
 multiple matching strategies, and report match rates to guide integration.

 File to modify

 - /Users/bbest/Github/CalCOFI/workflows/explore_zooplankton.qmd

 Key reference files

 - int-app/app/functions.R:716-788 — prep_splot() space-time matching pattern
 - calcofi4r/R/read.R — cc_get_db(), cc_read_*() convenience functions
 - workflows/merge_ichthyo_bottle.qmd — notebook style reference (librarian::shelf, datatable, dir_data pattern)
 - workflows/ingest_swfsc.noaa.gov_calcofi-db.qmd:600-630 — DB schema (cruise_key, site_id, tow_id, net_id)

 Database schema (confirmed)

 cruise (cruise_key PK, ship_key FK, date_ym)
   → site (site_id PK, cruise_key FK, orderocc, latitude, longitude, line, station)
     → tow (tow_id PK, site_id FK, tow_type_key, tow_number, time_start)
       → net (net_id PK, tow_id FK, side, std_haul_factor, vol_sampled_m3, ...)
         → ichthyo (ichthyo_id PK, net_id FK, species_id FK, tally, ...)

 Zooplankton CSV columns

 EXPEDITION_pnt, EXPED_CODE_pnt, Expedition_Type_pnt, SHIP_pnt, SWFOrder_Occ, STATION_LINE_pnt, STATION_NUMBER_pnt, LAT_DECIMAL_pnt, LONG_DECIMAL_pnt, SAMPLE_DATE_pnt, START_TIME_pnt, END_TIME_pnt,
 DEPTH_MIN_pnt, DEPTH_MAX_pnt, MAX_MWO_pnt, NET_TYPE_pnt, MESH_SIZE_pnt, TOW_TYPE_pnt, FIXATIVE_pnt, PRESERVATIVE_pnt

 ---
 Notebook Sections

 1. Setup

 - librarian::shelf(calcofi/calcofi4r, DBI, dplyr, DT, glue, here, leaflet, lubridate, mapview, purrr, readr, readxl, sf, stringr, tibble, tidyr, quiet = T)
 - Define file paths using dir_data pattern from merge_ichthyo_bottle.qmd (Linux vs macOS toggle)
 - Read metadata Excel with readxl::read_excel(), display with DT::datatable()

 2. Read & Explore Zooplankton Data

 - Read CSV with readr::read_csv()
 - Parse datetimes: mdy(SAMPLE_DATE_pnt) for date; str_pad(START_TIME_pnt, 4, pad = "0") for time; combine into datetime_start with ymd_hm() and tz = "America/Los_Angeles"
 - Summary stats: nrow, date range, expedition type breakdown via count() + datatable()
 - Filter: Expedition_Type_pnt == "CalCOFI" → d_zoo_cc (82K records, first pass per Linsey)
 - Attribute tables: net types, tow types, mesh sizes, fixatives for CalCOFI subset
 - Map stations: distinct lat/lon → sf::st_as_sf() → mapview() colored by line

 3. Read & Explore Ichthyoplankton Data

 - Connect: con <- cc_get_db(); cc_list_tables(); cc_describe_table("site")
 - Build tow-level flat view via dbplyr:
 d_ich_tows <- tbl(con, "tow") |>
   left_join(tbl(con, "site"),   by = "site_id") |>
   left_join(tbl(con, "cruise"), by = "cruise_key") |>
   select(tow_id, site_id, cruise_key, ship_key, date_ym,
          orderocc, latitude, longitude, line, station,
          tow_type_key, tow_number, time_start) |>
   collect()
 - Summary stats: nrow, date range, station count
 - Map stations: distinct lat/lon → mapview() colored by line

 4. Match by Station (Line + Station Number)

 - Prepare: Convert STATION_LINE_pnt and STATION_NUMBER_pnt to numeric; inspect EXPED_CODE_pnt format (YYMM vs YYYYMM)
 - Cruise code reconciliation: cruise_key is YYMMKK format. EXPED_CODE_pnt is YYMM or YYYYMM. Extract YY and MM from both for comparison, or derive a cruise_ym from date_ym
 - Match 1 — line + station only: inner_join(by = c("line", "station")) → many-to-many (all cruises)
 - Match 2 — line + station + cruise date: add expedition code matching
 - Report: match counts, percentage of zoo records matched, unmatched records via anti_join()
 - Summary by year: count(year) + datatable()

 5. Match by Space-Time Window

 Adapt the prep_splot() approach from int-app/app/functions.R:

 - Register zoo data in DuckDB: DBI::dbWriteTable(con, "zoo_pnt", d_zoo_db, overwrite = TRUE) after selecting/renaming key columns
 - Load spatial extension: DBI::dbExecute(con, "INSTALL spatial; LOAD spatial;")
 - Create ichthyo tow view: SQL view joining tow → site → cruise with lat/lon and time_start
 - Space-time match function using SQL:
   - Time window: ich_datetime BETWEEN zoo_datetime - INTERVAL N HOUR AND zoo_datetime + INTERVAL N HOUR
   - Distance: ST_Distance_Sphere(ST_Point(zoo_lon, zoo_lat), ST_Point(ich_lon, ich_lat)) <= M
   - Return: both coordinates, time_diff_hrs, dist_m
 - Parameter sweep: try combinations of 6h/12h/24h × 2km/5km, report match rates in a summary table
 - Method 1 — nearest by time: group_by(zoo_row) |> slice_min(time_diff_hrs)
 - Method 2 — nearest by distance: group_by(zoo_row) |> slice_min(dist_m)
 - Map matched pairs: sf linestrings connecting zoo→ichthyo points, colored by distance

 6. Comparison & Summary

 - Summary table: all methods side-by-side (station match, station+cruise, space-time variants) with counts and percentages
 - Gap analysis: zoo records unmatched by any method, by year
 - Recommendations: which approach gives best coverage, optimal thresholds, next steps (full dataset beyond CalCOFI type)

 ---
 Technical notes

 - Timezone: Zoo times are PST. Ichthyo time_start needs inspection — may be UTC. Handle explicitly.
 - Memory: Don't collect full ichthyo table (~830K rows). Work at tow level (~75K) for matching.
 - DuckDB spatial: Must INSTALL spatial; LOAD spatial; before using ST_Distance_Sphere(). The cc_get_db() connection may be read-only views on GCS parquet — need to handle temp table creation carefully (may
 need a separate local DuckDB, or use duckdb() in-memory with both datasets registered).
 - Code style: 2-space indent, |> pipe, snake_case, lowercase comments, librarian::shelf(), datatable() for tables, mapview() for maps.

 Verification

 1. Render with quarto::quarto_render("workflows/explore_zooplankton.qmd")
 2. Check that all sections produce output (tables, maps, match statistics)
 3. Verify the space-time matching function returns reasonable match rates
 4. Confirm the summary comparison table shows all methods
╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌╌

 Claude has written up a plan and is ready to execute. Would you like to proceed?

 ❯ 1. Yes, clear context and auto-accept edits (shift+tab)
   2. Yes, auto-accept edits
   3. Yes, manually approve edits
   4. Type here to tell Claude what to change

⏺ The notebook has been written. Here's a summary of what was implemented in workflows/explore_zooplankton.qmd:                                                                                                                                                                                                  
  6 sections, ~450 lines of R/Quarto code:                                                                                                                        
                                                                                                                                                                  
  1. Setup — librarian::shelf() for all packages, Google Drive file paths, metadata Excel display                                                                 
  2. Read & Explore Zooplankton — CSV read, datetime parsing (PST), expedition type breakdown, CalCOFI filter, attribute tables (net/tow/mesh/fixative), station
  map via mapview()
  3. Read & Explore Ichthyoplankton — cc_get_db() connection, tow-level flat view via dbplyr (tow → site → cruise join), schema inspection, station map
  4. Match by Station — Line+station numeric conversion, EXPED_CODE_pnt → YYMM reconciliation with cruise_key YYMMKK, two match methods (line+station only,
  line+station+cruise YYMM), year-by-year breakdown
  5. Match by Space-Time Window — Separate in-memory DuckDB (since cc_get_db() is read-only), both datasets registered, match_spacetime() SQL function using
  ST_Distance_Sphere() + time interval, parameter sweep (6h/12h/24h × 2km/5km), nearest-by-time and nearest-by-distance methods, matched-pair map with sf
  linestrings
  6. Comparison & Summary — Side-by-side method comparison table, gap analysis by year, recommendations for next steps

  Key technical decisions:
  - Separate duckdb() in-memory connection for space-time matching (the cc_get_db() connection creates read-only views on GCS parquet)
  - Working at tow level (~75K rows) rather than full ichthyo table (~830K rows)
  - Zoo times treated as PST; ichthyo time_start inspected explicitly

The pandoc portion of rendering seems stuck. How to fix or explore, inspect running processes? processing file: explore_zooplankton.qmd


⏺ Pandoc is running at 99.6% CPU and has been going for 5+ minutes. This is almost certainly due to the large HTML widgets (multiple mapview maps with many points
   embedded inline). Let me check the intermediate file size.
                                                                                                                                                                  
⏺ Read 1 file (ctrl+o to expand)                                                                                                                                  
                                                                                                                                                                  
⏺ The intermediate markdown is 21 MB — that's what's making pandoc slow. The mapview widgets with many points generate massive embedded JSON/HTML. Pandoc is still
   working (99% CPU), so it will likely finish — it's just slow with a 21MB input.                                                                            
                                                                                                                                                                  
  You have two options:                                                                                                                                       
                                                                                                                                                                  
  Option 1: Wait it out — Pandoc is actively processing (not actually stuck). With a 21MB file it may take 10-15 minutes total.                                 
                                                                                                                                                                  
  Option 2: Kill and re-render with smaller maps — I can reduce the station point counts to speed things up.                                                      
                                                                                                                                                                  
  To reduce map size, I'd limit mapview() calls to show fewer points (e.g., sample down to ~500 stations instead of all distinct stations). The main culprits are:
  - map_zoo — all distinct CalCOFI zoo stations                                                                                                                   
  - map_ichthyo — all distinct ichthyo stations                                                                                                                   
  - map_matched_pairs — 500 linestrings (already capped)                                                                                                          
                                                                                                                                                                  
  Want me to:
  1. Wait for pandoc to finish (it is progressing, just slowly)?
  2. Kill pandoc and optimize the maps to reduce the HTML size, then re-render?

2

⏺ Rendered successfully! Let me verify the output.                                                                                                                
                                                                                   
⏺ Read 1 file (ctrl+o to expand)                                                                                                                                  
                                                                                                                                                                  
⏺ Rendered successfully. The fix was switching from mapview() to leaflet() with markerClusterOptions() — the HTML dropped from 21 MB (causing pandoc to churn for 
  10+ minutes) down to 2.5 MB, and pandoc completed quickly.                       
                                                                                                                                                                  
  What changed:                                                                    
  - map_zoo — leaflet with clustered circle markers instead of mapview                                                                                            
  - map_ichthyo — same leaflet clustering approach                                 
  - map_matched_pairs — leaflet polylines with colorNumeric palette + legend, capped at 200 pairs instead of 500

✻ Crunched for 1m 13s

Update with files containing UUIDs

Update @workflows/explore_zooplankton.qmd with the following file updates from Linsey, which now include the net tow primary keys and UUIDs.

I realized my first pass at this I did not incorporate our net tow primary keys nor UUIDs. They are now integrated in the updated files:

plankton_data: @~/My Drive/projects/calcofi/data-public/scripps.ucsd.edu/pelagic-invertebrate-collection/SIOPIC_DB_PNTtable_allRecords_wUUID_9Feb2026.csv
plankton_metadata: @~/My Drive/projects/calcofi/data-public/scripps.ucsd.edu/pelagic-invertebrate-collection/SIOPIC_DB_PNTtable_fieldDetails_9Feb2026.xlsx

(Retain the “Claude Code Interactions” section untouched)

⏺ Done. Three changes made (leaving the "Claude Code Interactions" section untouched):                                                                            
  
  1. dir_data — updated from calcofi.org/zooplankton to scripps.ucsd.edu/pelagic-invertebrate-collection                                                          
  2. CSV filename — updated from ...allRecords_9Feb2026.csv to ...allRecords_wUUID_9Feb2026.csv
  3. register_zoo chunk — added zoo_pk = ParentNetTowEventID_pk and zoo_uuid = occurenceID_UUID_pnt to the DuckDB registration so the new primary key and UUID    
  columns flow through to the space-time matching results                                                                                                         

  The xlsx filename stayed the same (just the directory changed). The Overview description was updated to note "22 columns incl. primary keys + UUIDs".

Fix messages

Why are message() statements not outputting to rendered html?

⏺ message() writes to stderr, which knitr captures separately as a "message" output class. In Quarto HTML with code-fold: true, these can be visually collapsed or
   styled in a way that makes them easy to miss — and some configurations suppress them.                                                                        
                                                                                                                                                                  
  The simplest fix: switch to cat() for output you want visible in the rendered HTML. cat() writes to stdout, which always renders as standard console output.
                                                                                                                                                                  
  For example, change:                                                             
  message(glue("Zooplankton PNT records: {format(nrow(d_zoo), big.mark = ',')}"))
  to:
  cat(glue("Zooplankton PNT records: {format(nrow(d_zoo), big.mark = ',')}"), "\n")

  Want me to replace all message(glue(...)) calls with cat(glue(...), "\n") throughout the notebook?

yes

⏺ All 28 message() calls replaced with cat(). The output will now render as standard console text in the HTML instead of going to stderr.

Overview

Goal: Explore the SIO / CalCOFI zooplankton (PIC) net tow data to find where records match SWFSC ichthyoplankton samples. More at Zooplankton – CalCOFI.

Data sources:

Zooplankton: SIO PIC Parent Net Tow table (148K records, 22 columns incl. primary keys + UUIDs) from CalCOFI data manager Linsey
Ichthyoplankton: SWFSC database via calcofi4r::cc_get_db()

Matching strategies:

Station match — line + station number
Station + cruise match — line + station + expedition code
Space-time window — adapted from prep_splot() in the CalCOFI integrated app

Notes from Linsey:

I included our Parent Net Tow (= pnt suffix you will see in fields) Table with all net tow records and a second “ReadMe” file with field definitions and “tips” for those.

Our ask is to see where PIC net tow samples are a match to SWF ichthyo. samples potentially utilizing: Expedition, Expedition Code, Station Line, Station Number, Latitude/Longitude polygon?, Net Type, Mesh Size?, Tow Type?, Fixative?, Preservative?

I introduced a new field ‘Expedition_Type_pnt’ and was able to easily integrate term = CalCOFI for all cruises we’d want in this first pass that represent the most formal CalCOFI net tow records.

So, I believe it would be good to run this first pass for the records that have the Expedition_Type_pnt = CalCOFI and then the complete record set as second priority.

1 Setup

Code

librarian::shelf(
  calcofi / calcofi4r,
  DBI,
  dplyr,
  DT,
  duckdb,
  glue,
  here,
  leaflet,
  lubridate,
  mapview,
  purrr,
  readr,
  readxl,
  sf,
  stringr,
  tibble,
  tidyr,
  quiet = T
)
options(readr.show_col_types = FALSE)
options(DT.options = list(scrollX = TRUE))

# data directory (Google Drive)
dir_data <- "~/My Drive/projects/calcofi/data-public/scripps.ucsd.edu/pelagic-invertebrate-collection"

# file paths (updated with primary keys + UUIDs)
zoo_csv <- file.path(
  dir_data,
  "SIOPIC_DB_PNTtable_allRecords_wUUID_9Feb2026.csv"
)
zoo_xlsx <- file.path(
  dir_data,
  "SIOPIC_DB_PNTtable_fieldDetails_9Feb2026.xlsx"
)

stopifnot(
  "Zooplankton CSV not found" = file.exists(zoo_csv),
  "Zooplankton Excel not found" = file.exists(zoo_xlsx)
)

1.1 Metadata

Code

d_meta <- read_excel(zoo_xlsx)
datatable(
  d_meta,
  caption = "Zooplankton PNT field definitions"
)

2 Read & Explore Zooplankton Data

2.1 Read CSV

Code

d_zoo <- read_csv(zoo_csv)

cat(
  glue(
    "Zooplankton PNT records: {format(nrow(d_zoo), big.mark = ',')}"
  ),
  "\n"
)

Zooplankton PNT records: 148,129

Code

cat(
  glue(
    "Columns: {ncol(d_zoo)}"
  ),
  "\n"
)

Columns: 22

2.2 Parse datetimes

Code

d_zoo <- d_zoo |>
  mutate(
    # parse date (M/D/YYYY format)
    date_sample = mdy(SAMPLE_DATE_pnt),
    # pad time to 4 chars (e.g. 830 → "0830")
    time_str = str_pad(
      as.character(START_TIME_pnt),
      4,
      pad = "0"
    ),
    # combine date + time as PST
    datetime_start = ymd_hm(
      paste(date_sample, time_str),
      tz = "America/Los_Angeles"
    ),
    year = year(date_sample)
  ) |>
  select(-time_str)

cat(
  glue(
    "Date range: {min(d_zoo$date_sample, na.rm = TRUE)} to ",
    "{max(d_zoo$date_sample, na.rm = TRUE)}"
  ),
  "\n"
)

Date range: 1900-03-03 to 2024-04-19

Code

cat(
  glue(
    "Parsed datetime: {sum(!is.na(d_zoo$datetime_start))} of ",
    "{nrow(d_zoo)} records"
  ),
  "\n"
)

Parsed datetime: 144107 of 148129 records

2.3 Summary stats

Code

# expedition type breakdown
d_zoo |>
  count(Expedition_Type_pnt, name = "n_records") |>
  arrange(desc(n_records)) |>
  datatable(caption = "Zooplankton records by expedition type")

2.4 Filter to CalCOFI

Code

d_zoo_cc <- d_zoo |>
  filter(Expedition_Type_pnt == "CalCOFI")

cat(
  glue(
    "CalCOFI subset: {format(nrow(d_zoo_cc), big.mark = ',')} records ",
    "({round(100 * nrow(d_zoo_cc) / nrow(d_zoo), 1)}% of total)"
  ),
  "\n"
)

CalCOFI subset: 82,343 records (55.6% of total)

2.5 Attribute tables

Code

# net types
d_zoo_cc |>
  count(NET_TYPE_pnt, name = "n") |>
  arrange(desc(n)) |>
  datatable(caption = "CalCOFI zooplankton net types")

Code

# tow types
d_zoo_cc |>
  count(TOW_TYPE_pnt, name = "n") |>
  arrange(desc(n)) |>
  datatable(caption = "CalCOFI zooplankton tow types")

Code

# mesh sizes
d_zoo_cc |>
  count(MESH_SIZE_pnt, name = "n") |>
  arrange(desc(n)) |>
  datatable(caption = "CalCOFI zooplankton mesh sizes")

Code

# fixatives
d_zoo_cc |>
  count(FIXATIVE_pnt, name = "n") |>
  arrange(desc(n)) |>
  datatable(caption = "CalCOFI zooplankton fixatives")

2.6 Map stations

Code

# distinct station locations
sf_zoo <- d_zoo_cc |>
  filter(
    !is.na(LAT_DECIMAL_pnt),
    !is.na(LONG_DECIMAL_pnt)
  ) |>
  distinct(
    STATION_LINE_pnt,
    STATION_NUMBER_pnt,
    LAT_DECIMAL_pnt,
    LONG_DECIMAL_pnt
  ) |>
  st_as_sf(
    coords = c("LONG_DECIMAL_pnt", "LAT_DECIMAL_pnt"),
    crs = 4326
  )

cat(
  glue(
    "Distinct CalCOFI zoo stations: {nrow(sf_zoo)}"
  ),
  "\n"
)

Distinct CalCOFI zoo stations: 18686

Code

# use leaflet with clustering for lighter HTML output
leaflet(sf_zoo) |>
  addProviderTiles("CartoDB.Positron") |>
  addCircleMarkers(
    radius = 3,
    stroke = FALSE,
    fillOpacity = 0.6,
    popup = ~ paste0(
      "Line: ",
      STATION_LINE_pnt,
      "<br>Station: ",
      STATION_NUMBER_pnt
    ),
    clusterOptions = markerClusterOptions()
  )

3 Read & Explore Ichthyoplankton Data

3.1 Connect to database

Code

con <- cc_get_db()
tbls <- cc_list_tables()
cat(glue("Tables: {paste(tbls, collapse = ', ')}"), "\n")

Tables: bottle, bottle_measurement, cast_condition, casts, cruise, grid, ichthyo, lookup, measurement_type, net, segment, ship, site, species, taxa_rank, taxon, tow

Code

# show site schema
cc_describe_table("site") |>
  datatable(caption = "Site table schema")

Code

# show tow schema
cc_describe_table("tow") |>
  datatable(caption = "Tow table schema")

Code

# show cruise schema
cc_describe_table("cruise") |>
  datatable(caption = "Cruise table schema")

3.2 Build tow-level flat view

Code

# build tow-level view joining tow → site → cruise
d_ich_tows <- tbl(con, "tow") |>
  left_join(tbl(con, "site"), by = "site_id") |>
  left_join(tbl(con, "cruise"), by = "cruise_key") |>
  select(
    tow_id,
    site_id,
    cruise_key,
    ship_key,
    date_ym,
    orderocc,
    latitude,
    longitude,
    line,
    station,
    tow_type_key,
    tow_number,
    time_start
  ) |>
  collect()

cat(
  glue(
    "Ichthyo tow records: {format(nrow(d_ich_tows), big.mark = ',')}"
  ),
  "\n"
)

Ichthyo tow records: 75,506

Code

cat(
  glue(
    "Date range: {min(d_ich_tows$date_ym, na.rm = TRUE)} to ",
    "{max(d_ich_tows$date_ym, na.rm = TRUE)}"
  ),
  "\n"
)

Date range: 1951-01-01 to 2023-01-01

Code

cat(
  glue(
    "Distinct stations (line-station): ",
    "{d_ich_tows |> distinct(line, station) |> nrow()}"
  ),
  "\n"
)

Distinct stations (line-station): 7529

3.3 Inspect ichthyo time_start

Code

# check what time_start looks like
d_ich_tows |>
  filter(!is.na(time_start)) |>
  slice_sample(n = 10) |>
  select(tow_id, cruise_key, date_ym, time_start, line, station) |>
  datatable(caption = "Sample ichthyo tow records with time_start")

3.4 Map stations

Code

sf_ich <- d_ich_tows |>
  filter(!is.na(latitude), !is.na(longitude)) |>
  distinct(line, station, latitude, longitude) |>
  st_as_sf(
    coords = c("longitude", "latitude"),
    crs = 4326
  )

cat(
  glue(
    "Distinct ichthyo stations: {nrow(sf_ich)}"
  ),
  "\n"
)

Distinct ichthyo stations: 31323

Code

# use leaflet with clustering for lighter HTML output
leaflet(sf_ich) |>
  addProviderTiles("CartoDB.Positron") |>
  addCircleMarkers(
    radius = 3,
    stroke = FALSE,
    fillOpacity = 0.6,
    popup = ~ paste0(
      "Line: ",
      line,
      "<br>Station: ",
      station
    ),
    clusterOptions = markerClusterOptions()
  )

4 Match by Station (Line + Station Number)

4.1 Prepare matching keys

Code

# prepare zoo keys: convert line/station to numeric
d_zoo_match <- d_zoo_cc |>
  mutate(
    line = as.numeric(STATION_LINE_pnt),
    station = as.numeric(STATION_NUMBER_pnt)
  ) |>
  filter(!is.na(line), !is.na(station))

cat(
  glue(
    "Zoo records with valid line+station: ",
    "{format(nrow(d_zoo_match), big.mark = ',')} of ",
    "{format(nrow(d_zoo_cc), big.mark = ',')}"
  ),
  "\n"
)

Zoo records with valid line+station: 81,915 of 82,343

Code

# inspect expedition code format
d_zoo_match |>
  mutate(
    code_len = nchar(as.character(EXPED_CODE_pnt))
  ) |>
  count(code_len, name = "n") |>
  datatable(caption = "Expedition code lengths")

4.2 Cruise code reconciliation

Code

# cruise_key is YYMMKK (6 chars: 2-yr, 2-mo, 2-ship)
# EXPED_CODE_pnt appears to be YYMM (4 chars) or YYYYMM (6 chars)

# extract year-month from zoo expedition code
d_zoo_match <- d_zoo_match |>
  mutate(
    exped_str = as.character(EXPED_CODE_pnt),
    # handle both YYMM and YYYYMM formats
    zoo_ym = case_when(
      nchar(exped_str) == 4 ~ exped_str,
      nchar(exped_str) >= 6 ~ substr(exped_str, 3, 6),
      TRUE ~ NA_character_
    )
  )

# extract year-month from ichthyo cruise_key (first 4 chars = YYMM)
d_ich_match <- d_ich_tows |>
  mutate(
    ich_ym = substr(cruise_key, 1, 4)
  )

# show sample codes
d_zoo_match |>
  distinct(EXPED_CODE_pnt, exped_str, zoo_ym) |>
  slice_head(n = 20) |>
  datatable(caption = "Sample zoo expedition codes → YYMM")

Code

d_ich_match |>
  distinct(cruise_key, ich_ym) |>
  slice_head(n = 20) |>
  datatable(caption = "Sample ichthyo cruise_key → YYMM")

4.3 Match 1 — line + station only

Code

# count zoo records that have a matching ichthyo line+station
n_m1_zoo <- d_zoo_match |>
  semi_join(
    d_ich_match |> distinct(line, station),
    by = c("line", "station")
  ) |>
  nrow()

cat(
  glue(
    "Match 1 (line + station only):"
  ),
  "\n"
)

Match 1 (line + station only):

Code

cat(
  glue(
    "  Zoo records matched: {format(n_m1_zoo, big.mark = ',')} of ",
    "{format(nrow(d_zoo_match), big.mark = ',')} ",
    "({round(100 * n_m1_zoo / nrow(d_zoo_match), 1)}%)"
  ),
  "\n"
)

  Zoo records matched: 66,292 of 81,915 (80.9%)

4.4 Match 2 — line + station + cruise YYMM

Code

n_m2_zoo <- d_zoo_match |>
  semi_join(
    d_ich_match |> distinct(line, station, ich_ym),
    by = c("line", "station", "zoo_ym" = "ich_ym")
  ) |>
  nrow()

cat(
  glue(
    "Match 2 (line + station + cruise YYMM):"
  ),
  "\n"
)

Match 2 (line + station + cruise YYMM):

Code

cat(
  glue(
    "  Zoo records matched: {format(n_m2_zoo, big.mark = ',')} of ",
    "{format(nrow(d_zoo_match), big.mark = ',')} ",
    "({round(100 * n_m2_zoo / nrow(d_zoo_match), 1)}%)"
  ),
  "\n"
)

  Zoo records matched: 54,062 of 81,915 (66%)

Code

# unmatched records
d_m2_unmatched <- d_zoo_match |>
  anti_join(
    d_ich_match |> distinct(line, station, ich_ym),
    by = c("line", "station", "zoo_ym" = "ich_ym")
  )

cat(
  glue(
    "  Unmatched: {format(nrow(d_m2_unmatched), big.mark = ',')}"
  ),
  "\n"
)

  Unmatched: 27,853

4.5 Matched by year

Code

# tag each zoo record as matched or not via semi/anti join
d_m2_matched <- d_zoo_match |>
  semi_join(
    d_ich_match |> distinct(line, station, ich_ym),
    by = c("line", "station", "zoo_ym" = "ich_ym")
  ) |>
  mutate(matched = TRUE)

d_m2_unmatched_tagged <- d_m2_unmatched |>
  mutate(matched = FALSE)

d_m2_year <- bind_rows(d_m2_matched, d_m2_unmatched_tagged) |>
  count(year, matched) |>
  pivot_wider(
    names_from = matched,
    values_from = n,
    values_fill = 0
  ) |>
  rename(
    unmatched = `FALSE`,
    matched = `TRUE`
  ) |>
  mutate(
    total = matched + unmatched,
    pct = round(100 * matched / total, 1)
  )

d_m2_year |>
  datatable(caption = "Match 2 results by year")

5 Match by Space-Time Window

Adapted from prep_splot() in int-app/app/functions.R.

5.1 Create local DuckDB for matching

Code

# cc_get_db() creates read-only views on GCS parquet, so we need a
# separate in-memory DuckDB for creating temp tables with both datasets
con_match <- dbConnect(duckdb())

# install and load spatial extension
dbExecute(con_match, "INSTALL spatial; LOAD spatial;")

[1] 0

5.2 Register zooplankton data

Code

# prepare zoo data for registration
d_zoo_db <- d_zoo_cc |>
  filter(
    !is.na(LAT_DECIMAL_pnt),
    !is.na(LONG_DECIMAL_pnt),
    !is.na(datetime_start)
  ) |>
  transmute(
    zoo_row = row_number(),
    zoo_pk = ParentNetTowEventID_pk,
    zoo_uuid = occurenceID_UUID_pnt,
    zoo_lat = LAT_DECIMAL_pnt,
    zoo_lon = LONG_DECIMAL_pnt,
    zoo_dt = datetime_start,
    zoo_line = as.numeric(STATION_LINE_pnt),
    zoo_stn = as.numeric(STATION_NUMBER_pnt),
    zoo_ym = case_when(
      nchar(as.character(EXPED_CODE_pnt)) == 4 ~
        as.character(EXPED_CODE_pnt),
      nchar(as.character(EXPED_CODE_pnt)) >= 6 ~
        substr(as.character(EXPED_CODE_pnt), 3, 6),
      TRUE ~ NA_character_
    )
  )

dbWriteTable(con_match, "zoo_pnt", d_zoo_db, overwrite = TRUE)
cat(
  glue(
    "Registered {format(nrow(d_zoo_db), big.mark = ',')} zoo records"
  ),
  "\n"
)

Registered 82,242 zoo records

5.3 Register ichthyoplankton data

Code

# prepare ichthyo tow data
d_ich_db <- d_ich_tows |>
  filter(
    !is.na(latitude),
    !is.na(longitude),
    !is.na(time_start)
  ) |>
  transmute(
    ich_row = row_number(),
    ich_lat = latitude,
    ich_lon = longitude,
    ich_dt = time_start,
    ich_line = line,
    ich_stn = station,
    ich_ym = substr(cruise_key, 1, 4),
    tow_id = tow_id
  )

dbWriteTable(con_match, "ich_tow", d_ich_db, overwrite = TRUE)
cat(
  glue(
    "Registered {format(nrow(d_ich_db), big.mark = ',')} ",
    "ichthyo tow records"
  ),
  "\n"
)

Registered 75,506 ichthyo tow records

5.4 Space-time match function

Code

match_spacetime <- function(
  con,
  max_hours = 6,
  max_meters = 2000
) {
  sql <- glue(
    "
    SELECT
      z.zoo_row,
      z.zoo_lat,
      z.zoo_lon,
      z.zoo_dt,
      i.ich_row,
      i.ich_lat,
      i.ich_lon,
      i.ich_dt,
      i.tow_id,
      ABS(
        EPOCH(z.zoo_dt - i.ich_dt)
      ) / 3600.0 AS time_diff_hrs,
      ST_Distance_Sphere(
        ST_Point(z.zoo_lon, z.zoo_lat),
        ST_Point(i.ich_lon, i.ich_lat)
      ) AS dist_m
    FROM zoo_pnt z
    JOIN ich_tow i
      ON i.ich_dt BETWEEN
           z.zoo_dt - INTERVAL '{max_hours}' HOUR AND
           z.zoo_dt + INTERVAL '{max_hours}' HOUR
    WHERE ST_Distance_Sphere(
            ST_Point(z.zoo_lon, z.zoo_lat),
            ST_Point(i.ich_lon, i.ich_lat)
          ) <= {max_meters}"
  )

  dbGetQuery(con, sql)
}

5.5 Parameter sweep

Code

# try combinations of time and distance thresholds
params <- expand_grid(
  max_hours = c(6, 12, 24),
  max_meters = c(2000, 5000)
)

sweep_results <- map_dfr(seq_len(nrow(params)), function(i) {
  h <- params$max_hours[i]
  m <- params$max_meters[i]

  cat(glue("  Matching: {h}h x {m}m ..."), "\n")
  d <- match_spacetime(con_match, max_hours = h, max_meters = m)

  tibble(
    max_hours = h,
    max_meters = m,
    n_pairs = nrow(d),
    n_zoo_matched = n_distinct(d$zoo_row),
    n_ich_matched = n_distinct(d$ich_row),
    pct_zoo_matched = round(
      100 * n_distinct(d$zoo_row) / nrow(d_zoo_db),
      1
    ),
    median_dist_m = round(median(d$dist_m), 0),
    median_time_hrs = round(median(d$time_diff_hrs), 1)
  )
})

  Matching: 6h x 2000m ... 
  Matching: 6h x 5000m ... 
  Matching: 12h x 2000m ... 
  Matching: 12h x 5000m ... 
  Matching: 24h x 2000m ... 
  Matching: 24h x 5000m ...

Code

sweep_results |>
  datatable(caption = "Space-time parameter sweep results")

5.6 Best match: nearest by time

Code

# use moderate thresholds for detailed matching
d_st <- match_spacetime(
  con_match,
  max_hours = 12,
  max_meters = 5000
)

# nearest by time for each zoo record
d_nearest_time <- d_st |>
  group_by(zoo_row) |>
  slice_min(time_diff_hrs, n = 1, with_ties = FALSE) |>
  ungroup()

cat(
  glue(
    "Nearest-time matches: ",
    "{format(nrow(d_nearest_time), big.mark = ',')}"
  ),
  "\n"
)

Nearest-time matches: 74,254

Code

cat(
  glue(
    "  Median dist: ",
    "{round(median(d_nearest_time$dist_m), 0)} m"
  ),
  "\n"
)

  Median dist: 423 m

Code

cat(
  glue(
    "  Median time diff: ",
    "{round(median(d_nearest_time$time_diff_hrs), 1)} hrs"
  ),
  "\n"
)

  Median time diff: 0.3 hrs

5.7 Best match: nearest by distance

Code

# nearest by distance for each zoo record
d_nearest_dist <- d_st |>
  group_by(zoo_row) |>
  slice_min(dist_m, n = 1, with_ties = FALSE) |>
  ungroup()

cat(
  glue(
    "Nearest-distance matches: ",
    "{format(nrow(d_nearest_dist), big.mark = ',')}"
  ),
  "\n"
)

Nearest-distance matches: 74,254

Code

cat(
  glue(
    "  Median dist: ",
    "{round(median(d_nearest_dist$dist_m), 0)} m"
  ),
  "\n"
)

  Median dist: 420 m

Code

cat(
  glue(
    "  Median time diff: ",
    "{round(median(d_nearest_dist$time_diff_hrs), 1)} hrs"
  ),
  "\n"
)

  Median time diff: 0.5 hrs

5.8 Map matched pairs

Code

# create lines connecting zoo → ichthyo for nearest-distance matches
# sample up to 200 pairs for visualization
d_map <- d_nearest_dist |>
  slice_sample(n = min(200, nrow(d_nearest_dist)))

# build sf lines
lines <- map(seq_len(nrow(d_map)), function(i) {
  st_linestring(matrix(
    c(d_map$zoo_lon[i], d_map$zoo_lat[i], d_map$ich_lon[i], d_map$ich_lat[i]),
    ncol = 2,
    byrow = TRUE
  ))
})

sf_pairs <- st_sf(
  dist_m = d_map$dist_m,
  time_diff_hrs = round(d_map$time_diff_hrs, 1),
  geometry = st_sfc(lines, crs = 4326)
)

# use leaflet for lighter HTML
pal <- colorNumeric("YlOrRd", sf_pairs$dist_m)
leaflet(sf_pairs) |>
  addProviderTiles("CartoDB.Positron") |>
  addPolylines(
    color = ~ pal(dist_m),
    weight = 1.5,
    opacity = 0.7,
    popup = ~ paste0(
      "Distance: ",
      round(dist_m),
      " m",
      "<br>Time diff: ",
      time_diff_hrs,
      " hrs"
    )
  ) |>
  addLegend(
    pal = pal,
    values = ~dist_m,
    title = "Distance (m)"
  )

6 Comparison & Summary

6.1 Summary table: all methods

Code

n_zoo_total <- nrow(d_zoo_match)

# space-time nearest time
n_st_time <- nrow(d_nearest_time)

# space-time nearest distance
n_st_dist <- nrow(d_nearest_dist)

d_summary <- tibble(
  method = c(
    "Station (line + station)",
    "Station + cruise YYMM",
    "Space-time 12h/5km nearest-time",
    "Space-time 12h/5km nearest-dist"
  ),
  zoo_matched = c(
    n_m1_zoo,
    n_m2_zoo,
    n_st_time,
    n_st_dist
  ),
  zoo_total = n_zoo_total,
  pct_matched = round(
    100 * zoo_matched / zoo_total,
    1
  )
)

d_summary |>
  datatable(caption = "Matching method comparison (CalCOFI subset)")

6.2 Gap analysis: unmatched by any method

Code

# zoo records not matched by space-time (nearest-dist)
zoo_matched_st <- d_nearest_dist$zoo_row
d_unmatched_st <- d_zoo_db |>
  filter(!zoo_row %in% zoo_matched_st)

cat(
  glue(
    "Unmatched by space-time (12h/5km): ",
    "{format(nrow(d_unmatched_st), big.mark = ',')} zoo records"
  ),
  "\n"
)

Unmatched by space-time (12h/5km): 7,988 zoo records

Code

# gap by year for station+cruise method
d_gap_m2 <- d_m2_unmatched |>
  count(year, name = "n_unmatched") |>
  arrange(year)

d_gap_m2 |>
  datatable(
    caption = "Unmatched zoo records by year (station+cruise method)"
  )

6.3 Recommendations

Code

cat(
  "
### Key findings

- **Station matching** (line + station) gives the broadest coverage
  but includes cross-cruise matches that may not be meaningful.
- **Station + cruise YYMM** is the most precise tabular match and
  should be the primary method for linking records.
- **Space-time matching** captures records where station codes differ
  but the tows were physically close in space and time. This is
  useful for non-standard cruises or slight station numbering
  differences.

### Recommended next steps

1. Use **station + cruise YYMM** as the primary matching key for the
   CalCOFI subset.
2. Use **space-time nearest-distance** (12h / 5km) as a supplementary
   method to catch records missed by station matching.
3. Expand to the full dataset (all expedition types, not just CalCOFI)
   as a second priority.
4. Investigate unmatched records — are they from years where ichthyo
   data is missing, or from non-standard station patterns?
5. Consider building a persistent cross-reference table linking
   zoo_row to tow_id for downstream analysis.
"
)

6.4 Key findings

Station matching (line + station) gives the broadest coverage but includes cross-cruise matches that may not be meaningful.
Station + cruise YYMM is the most precise tabular match and should be the primary method for linking records.
Space-time matching captures records where station codes differ but the tows were physically close in space and time. This is useful for non-standard cruises or slight station numbering differences.

6.5 Recommended next steps

Use station + cruise YYMM as the primary matching key for the CalCOFI subset.
Use space-time nearest-distance (12h / 5km) as a supplementary method to catch records missed by station matching.
Expand to the full dataset (all expedition types, not just CalCOFI) as a second priority.
Investigate unmatched records — are they from years where ichthyo data is missing, or from non-standard station patterns?
Consider building a persistent cross-reference table linking zoo_row to tow_id for downstream analysis.

7 Cleanup

Code

dbDisconnect(con_match, shutdown = TRUE)
cat("Match database closed\n")

Match database closed

Session Info

Code

devtools::session_info()

─ Session info ───────────────────────────────────────────────────────────────
 setting  value
 version  R version 4.5.2 (2025-10-31)
 os       macOS Sequoia 15.7.1
 system   aarch64, darwin20
 ui       X11
 language (EN)
 collate  en_US.UTF-8
 ctype    en_US.UTF-8
 tz       America/Mexico_City
 date     2026-02-10
 pandoc   3.8.3 @ /opt/homebrew/bin/ (via rmarkdown)
 quarto   1.8.25 @ /usr/local/bin/quarto

─ Packages ───────────────────────────────────────────────────────────────────
 package           * version  date (UTC) lib source
 abind               1.4-8    2024-09-12 [1] CRAN (R 4.5.0)
 assertthat          0.2.1    2019-03-21 [1] CRAN (R 4.5.0)
 backports           1.5.0    2024-05-23 [1] CRAN (R 4.5.0)
 base64enc           0.1-6    2026-02-02 [1] CRAN (R 4.5.2)
 bit                 4.6.0    2025-03-06 [1] CRAN (R 4.5.0)
 bit64               4.6.0-1  2025-01-16 [1] CRAN (R 4.5.0)
 blob                1.3.0    2026-01-14 [1] CRAN (R 4.5.2)
 broom               1.0.10   2025-09-13 [1] CRAN (R 4.5.0)
 bslib               0.10.0   2026-01-26 [1] CRAN (R 4.5.2)
 cachem              1.1.0    2024-05-16 [1] CRAN (R 4.5.0)
 calcofi4r         * 1.1.3    2026-02-09 [1] Github (calcofi/calcofi4r@304d271)
 cellranger          1.1.0    2016-07-27 [1] CRAN (R 4.5.0)
 class               7.3-23   2025-01-01 [1] CRAN (R 4.5.2)
 classInt            0.4-11   2025-01-08 [1] CRAN (R 4.5.0)
 cli                 3.6.5    2025-04-23 [1] CRAN (R 4.5.0)
 codetools           0.2-20   2024-03-31 [1] CRAN (R 4.5.2)
 crayon              1.5.3    2024-06-20 [1] CRAN (R 4.5.0)
 crosstalk           1.2.2    2025-08-26 [1] CRAN (R 4.5.0)
 curl                7.0.0    2025-08-19 [1] CRAN (R 4.5.0)
 data.table          1.18.2.1 2026-01-27 [1] CRAN (R 4.5.2)
 DBI               * 1.2.3    2024-06-02 [1] CRAN (R 4.5.0)
 dbplyr              2.5.1    2025-09-10 [1] CRAN (R 4.5.0)
 devtools            2.4.6    2025-10-03 [1] CRAN (R 4.5.0)
 digest              0.6.39   2025-11-19 [1] CRAN (R 4.5.2)
 dplyr             * 1.2.0    2026-02-03 [1] CRAN (R 4.5.2)
 DT                * 0.34.0   2025-09-02 [1] CRAN (R 4.5.0)
 duckdb            * 1.4.4    2026-01-28 [1] CRAN (R 4.5.2)
 dygraphs            1.1.1.6  2018-07-11 [1] CRAN (R 4.5.0)
 e1071               1.7-17   2025-12-18 [1] CRAN (R 4.5.2)
 ellipsis            0.3.2    2021-04-29 [1] CRAN (R 4.5.0)
 evaluate            1.0.5    2025-08-27 [1] CRAN (R 4.5.0)
 farver              2.1.2    2024-05-13 [1] CRAN (R 4.5.0)
 fastmap             1.2.0    2024-05-15 [1] CRAN (R 4.5.0)
 fs                  1.6.6    2025-04-12 [1] CRAN (R 4.5.0)
 fuzzyjoin           0.1.6.1  2025-07-10 [1] CRAN (R 4.5.0)
 generics            0.1.4    2025-05-09 [1] CRAN (R 4.5.0)
 geojsonsf           2.0.5    2025-11-26 [1] CRAN (R 4.5.2)
 ggplot2             4.0.2    2026-02-03 [1] CRAN (R 4.5.2)
 glue              * 1.8.0    2024-09-30 [1] CRAN (R 4.5.0)
 gtable              0.3.6    2024-10-25 [1] CRAN (R 4.5.0)
 here              * 1.0.2    2025-09-15 [1] CRAN (R 4.5.0)
 highcharter         0.9.4    2022-01-03 [1] CRAN (R 4.5.0)
 hms                 1.1.4    2025-10-17 [1] CRAN (R 4.5.0)
 htmltools           0.5.9    2025-12-04 [1] CRAN (R 4.5.2)
 htmlwidgets         1.6.4    2023-12-06 [1] CRAN (R 4.5.0)
 httpuv              1.6.16   2025-04-16 [1] CRAN (R 4.5.0)
 httr                1.4.7    2023-08-15 [1] CRAN (R 4.5.0)
 httr2               1.2.2    2025-12-08 [1] CRAN (R 4.5.2)
 igraph              2.2.1    2025-10-27 [1] CRAN (R 4.5.0)
 isoband             0.3.0    2025-12-07 [1] CRAN (R 4.5.2)
 jquerylib           0.1.4    2021-04-26 [1] CRAN (R 4.5.0)
 jsonlite            2.0.0    2025-03-27 [1] CRAN (R 4.5.0)
 KernSmooth          2.23-26  2025-01-01 [1] CRAN (R 4.5.2)
 knitr               1.51     2025-12-20 [1] CRAN (R 4.5.2)
 later               1.4.5    2026-01-08 [1] CRAN (R 4.5.2)
 lattice             0.22-7   2025-04-02 [1] CRAN (R 4.5.2)
 lazyeval            0.2.2    2019-03-15 [1] CRAN (R 4.5.0)
 leafem              0.2.5    2025-08-28 [1] CRAN (R 4.5.0)
 leaflet           * 2.2.3    2025-09-04 [1] CRAN (R 4.5.0)
 leaflet.providers   2.0.0    2023-10-17 [1] CRAN (R 4.5.0)
 librarian           1.8.1    2021-07-12 [1] CRAN (R 4.5.0)
 lifecycle           1.0.5    2026-01-08 [1] CRAN (R 4.5.2)
 lubridate         * 1.9.5    2026-02-04 [1] CRAN (R 4.5.2)
 magrittr            2.0.4    2025-09-12 [1] CRAN (R 4.5.0)
 mapgl               0.4.4    2026-01-12 [1] CRAN (R 4.5.2)
 mapview           * 2.11.4   2025-09-08 [1] CRAN (R 4.5.0)
 markdown            2.0      2025-03-23 [1] CRAN (R 4.5.0)
 Matrix              1.7-4    2025-08-28 [1] CRAN (R 4.5.2)
 memoise             2.0.1    2021-11-26 [1] CRAN (R 4.5.0)
 mgcv                1.9-3    2025-04-04 [1] CRAN (R 4.5.2)
 mime                0.13     2025-03-17 [1] CRAN (R 4.5.0)
 nlme                3.1-168  2025-03-31 [1] CRAN (R 4.5.2)
 otel                0.2.0    2025-08-29 [1] CRAN (R 4.5.0)
 pillar              1.11.1   2025-09-17 [1] CRAN (R 4.5.0)
 pkgbuild            1.4.8    2025-05-26 [1] CRAN (R 4.5.0)
 pkgconfig           2.0.3    2019-09-22 [1] CRAN (R 4.5.0)
 pkgload             1.4.1    2025-09-23 [1] CRAN (R 4.5.0)
 plotly              4.12.0   2026-01-24 [1] CRAN (R 4.5.2)
 png                 0.1-8    2022-11-29 [1] CRAN (R 4.5.0)
 promises            1.5.0    2025-11-01 [1] CRAN (R 4.5.0)
 proxy               0.4-29   2025-12-29 [1] CRAN (R 4.5.2)
 purrr             * 1.2.1    2026-01-09 [1] CRAN (R 4.5.2)
 quantmod            0.4.28   2025-06-19 [1] CRAN (R 4.5.0)
 R6                  2.6.1    2025-02-15 [1] CRAN (R 4.5.0)
 rappdirs            0.3.4    2026-01-17 [1] CRAN (R 4.5.2)
 raster              3.6-32   2025-03-28 [1] CRAN (R 4.5.0)
 RColorBrewer        1.1-3    2022-04-03 [1] CRAN (R 4.5.0)
 Rcpp                1.1.1    2026-01-10 [1] CRAN (R 4.5.2)
 readr             * 2.1.6    2025-11-14 [1] CRAN (R 4.5.2)
 readxl            * 1.4.5    2025-03-07 [1] CRAN (R 4.5.0)
 remotes             2.5.0    2024-03-17 [1] CRAN (R 4.5.0)
 rlang               1.1.7    2026-01-09 [1] CRAN (R 4.5.2)
 rlist               0.4.6.2  2021-09-03 [1] CRAN (R 4.5.0)
 rmarkdown           2.30     2025-09-28 [1] CRAN (R 4.5.0)
 RPostgres           1.4.8    2025-02-25 [1] CRAN (R 4.5.0)
 rprojroot           2.1.1    2025-08-26 [1] CRAN (R 4.5.0)
 rstudioapi          0.18.0   2026-01-16 [1] CRAN (R 4.5.2)
 S7                  0.2.1    2025-11-14 [1] CRAN (R 4.5.2)
 sass                0.4.10   2025-04-11 [1] CRAN (R 4.5.0)
 satellite           1.0.6    2025-08-21 [1] CRAN (R 4.5.0)
 scales              1.4.0    2025-04-24 [1] CRAN (R 4.5.0)
 sessioninfo         1.2.3    2025-02-05 [1] CRAN (R 4.5.0)
 sf                * 1.0-24   2026-01-13 [1] CRAN (R 4.5.2)
 shiny               1.11.1   2025-07-03 [1] CRAN (R 4.5.0)
 shinyWidgets        0.9.0    2025-02-21 [1] CRAN (R 4.5.0)
 sp                  2.2-0    2025-02-01 [1] CRAN (R 4.5.0)
 stars               0.7-0    2025-12-14 [1] CRAN (R 4.5.2)
 stringi             1.8.7    2025-03-27 [1] CRAN (R 4.5.0)
 stringr           * 1.6.0    2025-11-04 [1] CRAN (R 4.5.0)
 terra               1.8-93   2026-01-12 [1] CRAN (R 4.5.2)
 tibble            * 3.3.1    2026-01-11 [1] CRAN (R 4.5.2)
 tidyr             * 1.3.2    2025-12-19 [1] CRAN (R 4.5.2)
 tidyselect          1.2.1    2024-03-11 [1] CRAN (R 4.5.0)
 timechange          0.4.0    2026-01-29 [1] CRAN (R 4.5.2)
 TTR                 0.24.4   2023-11-28 [1] CRAN (R 4.5.0)
 tzdb                0.5.0    2025-03-15 [1] CRAN (R 4.5.0)
 units               1.0-0    2025-10-09 [1] CRAN (R 4.5.0)
 usethis             3.2.1    2025-09-06 [1] CRAN (R 4.5.0)
 vctrs               0.7.1    2026-01-23 [1] CRAN (R 4.5.2)
 viridisLite         0.4.3    2026-02-04 [1] CRAN (R 4.5.2)
 vroom               1.7.0    2026-01-27 [1] CRAN (R 4.5.2)
 withr               3.0.2    2024-10-28 [1] CRAN (R 4.5.0)
 xfun                0.56     2026-01-18 [1] CRAN (R 4.5.2)
 xtable              1.8-4    2019-04-21 [1] CRAN (R 4.5.0)
 xts                 0.14.1   2024-10-15 [1] CRAN (R 4.5.0)
 yaml                2.3.12   2025-12-10 [1] CRAN (R 4.5.2)
 zoo                 1.8-15   2025-12-15 [1] CRAN (R 4.5.2)

 [1] /Library/Frameworks/R.framework/Versions/4.5-arm64/Resources/library
 * ── Packages attached to the search path.

──────────────────────────────────────────────────────────────────────────────