codex-pdf
Structured PDF extraction API that turns complex files into consistent JSON.
Open-source benchmark kit for PDF preflight engines. AssayPDF generates a deterministic GWG 2022 test corpus, runs it through callas pdfToolbox, Enfocus PitStop Server, and lintPDF under one harness, and produces reproducible accuracy reports.
AssayPDF gives the print industry the public, reproducible GWG 2022 benchmark it never had — a deterministic test corpus, a uniform harness, and per-rule scoring across the major preflight engines.
39 rules across 23 variants of the Ghent Workgroup 2022 specification, codified into a single test matrix. The first public corpus for the new spec.
~175 PDFs generated byte-identically from a seed: 23 positive baselines plus 152 negative failure-mode files, each targeting exactly one rule.
One CLI runs the same corpus through callas pdfToolbox, Enfocus PitStop Server, and lintPDF — so the only variable in the comparison is the engine.
Per rule, per variant, per engine. Surface where each preflight tool over-flags, under-flags, or gets it right — with the receipts to back the score.
Markdown and HTML accuracy reports rendered from the run artifacts. Diff two engine runs or two engine versions side by side, in CI or locally.
Vendor inputs are fetched from GWG canonical URLs with SHA-256 verification, and every generated PDF is validated against PDF/X-4 (ISO 15930-7) by verapdf before it enters the corpus.
MIT-licensed corpus, harness, and reports. The GWG 2015 Compliancy Test Suite is gated to vendor members — AssayPDF closes the gap for GWG 2022.
Open source · managed hosting
A toolkit of focused, standalone PDF utilities — extraction, preflight, viewing, assembly, imposition planning, and an asset store. Each one plugs into the prepress workflow you already run. Use the open source yourself, or let us host any single tool for you on work.withsynergy.io.
Structured PDF extraction API that turns complex files into consistent JSON.
Programmatic PDF assembly — a deterministic API build step for rewriting and generating print-ready PDFs.
Detection-only PDF preflight engine — 500+ checks plus the PDF/X-4 conformance suite.
Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.
PDF assay and metadata reporting — surface what's actually inside the file.
WYSIWYG canvas editor for label and packaging artwork — PDF/X-4 output, flexo support, and a full create-to-RIP workflow.
Stateless imposition-planning solver — step-and-repeat, gang, and true-shape nesting.
Content-addressed digital-asset plane — versioned blobs, a presigned data plane, and on-prem agent recall.