extract_pdf_data {Tivy}R Documentation

Extract data from PDF announcements

Description

Processes PDF files containing official fishing announcements and extracts relevant information such as dates, coordinates, and nautical miles. Handles both local files and URLs.

Usage

extract_pdf_data(
  pdf_sources = NULL,
  temp_dir = NULL,
  verbose = TRUE,
  max_retries = 3
)

Arguments

pdf_sources

Character vector of PDF file paths or URLs.

temp_dir

Temporary directory for downloaded files. If NULL, uses tempdir().

verbose

Show processing messages.

max_retries

Maximum download retries for URLs.

Value

Data frame with extracted announcement information including coordinates, dates, and nautical mile distances.

Examples

## Not run: 
pdf_files <- c("announcement1.pdf", "announcement2.pdf")
results <- extract_pdf_data(pdf_sources = pdf_files)

pdf_urls <- c(
  "https://example.com/announcement1.pdf",
  "https://example.com/announcement2.pdf"
)
results <- extract_pdf_data(pdf_sources = pdf_urls)

## End(Not run)


[Package Tivy version 0.1.0 Index]