A small agent/service that converts PDF files to structured JSON (metadata, pages, simple headings, tables) via CLI or HTTP API. Headings are detected heuristically by relatively large font sizes and ...