paper_firehose.coreΒΆ

Modules

abstract_source

Abstract source interface using Python Protocol for structural subtyping.

apis

API client modules for fetching abstracts from various sources.

command_context

Command context for shared initialization across CLI commands.

command_utils

Shared utilities for command implementations.

config

Configuration management for YAML-based config files.

database

Database management for the three-database approach: - all_feed_entries.db: All RSS entries for deduplication - matched_entries_history.db: Historical matches across all topics - papers.db: Current run processing data

doi_utils

Unified DOI extraction utilities.

exit_codes

Process exit codes for the Paper Firehose CLI.

http_client

Shared HTTP client with retry logic and rate limiting.

model_manager

Model management utilities for Sentence-Transformers models.

paths

Utilities for locating runtime data and built-in system assets.

text_utils

Shared text processing utilities.