Roadmap¶

Implemented MVP¶

Classic xref parsing with incremental update chain support (follows Prev pointers)
Page tree traversal
Content parsing for common text, path, image, clipping, color, graphics-state, and marked-content operators (including inline images and dictionary operands)
Simple-font text extraction and search geometry (including fonts set via ExtGState gs operator)
Type0 / Identity-H composite font extraction, search, and redaction when ToUnicode is available
Geometry target normalization for rects, quads, and quad groups
Three redaction modes: strip (remove bytes), redact (blank space + overlay), erase (blank space, no overlay)
Tighter glyph bounding boxes (80% em-square height) to reduce adjacent-line false positives
True redaction for a constrained subset of PDFs
Deterministic full-save rewrite
WASM bindings and a browser demo
Demo UI with zoom controls, collapsible pages, search-driven redaction, and in-app error reporting

Broader CID and composite font support beyond Identity-H + ToUnicode
Form XObject traversal and redaction
Better vector-path bounds
Partial image rewriting
Optional-content and hidden-layer sanitization
Overlay text stamping
Incremental-save preservation (reading is supported; output is always a flat rewrite)

When one of these priorities lands, the following docs should be updated in the same change: