Skip to contents

This function scrapes the EPA AirData website for downloadable ZIP file links, extracts relevant metadata from filenames (such as year, unit of analysis, and analyte), and returns a tidy tibble with structured information.

Usage

get_epa_airdata_zip_links(archive = FALSE, archive_id = "20250126115248")

Arguments

archive

Logical. If TRUE, the function retrieves data from the Wayback Machine (Internet Archive) rather than the live EPA AirData website. Defaults to FALSE.

archive_id

Character. The timestamp ID for the archived version of the EPA AirData website on the Wayback Machine (only used if archive = TRUE). Defaults to "20250126115248".

Value

A tibble with structured metadata of available ZIP files.