Wraps the Python catpol.summarize() function. Generates summaries from
policy text or from a registered political data source. Adds a tone
parameter for policy-specific framing.
Usage
summarize(
input_data = NULL,
source = NULL,
doc_type = NULL,
since = NULL,
until = NULL,
n = NULL,
format = "paragraph",
tone = "eli5",
api_key = NULL,
description = "",
instructions = "",
max_length = NULL,
focus = NULL,
user_model = "gpt-4o",
model_source = "auto",
mode = "image",
input_mode = NULL,
input_type = "auto",
pdf_dpi = 150L,
creativity = NULL,
thinking_budget = 0L,
chain_of_thought = TRUE,
context_prompt = FALSE,
step_back_prompt = FALSE,
filename = NULL,
save_directory = NULL,
models = NULL,
max_workers = NULL,
parallel = NULL,
auto_download = FALSE,
safety = FALSE,
max_retries = 5L,
batch_retries = 2L,
retry_delay = 1,
row_delay = 0,
fail_strategy = "partial",
batch_mode = FALSE,
batch_poll_interval = 30,
batch_timeout = 86400
)Arguments
- input_data
A character vector, list, or PDF/URL paths;
NULLto fetch from a registered source.- source
Character or
NULL. Registered source name.- doc_type
Character or
NULL. Filter source by document type.- since
Character or
NULL. Earliest source row date (YYYY-MM-DD).- until
Character or
NULL. Latest source row date (YYYY-MM-DD).- n
Integer or
NULL. Max number of source rows.- format
Character. Output format. Default
"paragraph".- tone
Character. Policy-specific tone, e.g.
"eli5","neutral","academic". Default"eli5".- api_key
Character or
NULL. API key for the LLM provider.- description
Character. Default
"".- instructions
Character. Specific instructions for the summary. Default
"".- max_length
Integer or
NULL. DefaultNULL.- focus
Character or
NULL. DefaultNULL.- user_model
Character. Default
"gpt-4o".- model_source
Character. Default
"auto".- mode
Character. Default
"image".- input_mode
Character or
NULL. DefaultNULL.- input_type
Character. Default
"auto".- pdf_dpi
Integer. Default
150L.- creativity
Numeric or
NULL. DefaultNULL.- thinking_budget
Integer. Default
0L.- chain_of_thought
Logical. Default
TRUE.- context_prompt
Logical. Default
FALSE.- step_back_prompt
Logical. Default
FALSE.- filename
Character or
NULL.- save_directory
Character or
NULL.- models
List of model specs for ensemble mode. Default
NULL.- max_workers
Integer or
NULL. DefaultNULL.- parallel
Logical or
NULL. DefaultNULL.- auto_download
Logical. Default
FALSE.- safety
Logical. Default
FALSE.- max_retries
Integer. Default
5L.- batch_retries
Integer. Default
2L.- retry_delay
Numeric. Default
1.0.- row_delay
Numeric. Default
0.0.- fail_strategy
Character. Default
"partial".- batch_mode
Logical. Default
FALSE.- batch_poll_interval
Numeric. Default
30.0.- batch_timeout
Numeric. Default
86400.0.
Examples
if (FALSE) { # \dontrun{
results <- summarize(
source = "federal_executive_orders",
since = "2025-01-01",
n = 20L,
format = "paragraph",
tone = "eli5",
api_key = Sys.getenv("OPENAI_API_KEY"),
user_model = "gpt-4o-mini"
)
} # }