codex-pdf
Structured PDF extraction API that turns complex files into consistent JSON.
Trove versions, organizes, and serves every artwork, die, font, profile, and proof in one content-addressed store — a single source of assets for the prepress tools you already run. API-first, dedup by default, hosted or self-host.
AGPL-3.0 · content-addressed dedup · immutable versions · feeds your prepress tools
How it works
Trove keeps every asset content-addressed and immutable, then serves it to your prepress tools by hash — so extraction, preflight, imposition, and production all read the exact same bytes.
Push artwork, dies, fonts, ICC profiles, and proofs over REST. Trove hashes every byte on the way in — identical assets collapse to one stored object.
Each upload is an immutable, content-addressed version. Nothing is overwritten; every revision is reachable by its SHA, so history is the default, not an add-on.
Tag, group by job or SKU, and attach metadata. Search across the whole store by kind, fingerprint, or attribute — find the right die or font in seconds.
codex extracts, lint preflights, sift plans, compile produces, lens reviews — each reads its inputs straight from Trove by content hash. One source of assets.
Built for web-to-print platforms, packaging houses, and prepress teams that need one trustworthy home for every production asset — without lock-in.
Every asset is keyed by SHA-256. Re-upload the same artwork from ten jobs and Trove stores it once — storage stays lean and identity is exact, not filename-guessed.
Uploads never overwrite. Each revision is its own content-addressed object, so full history is always reachable and an approved proof can never silently change.
Tag by job, SKU, customer, or asset kind; attach arbitrary metadata. Query the whole store by fingerprint, kind, or attribute to find the right die, font, or profile fast.
codex, lint, sift, compile, and lens can each read inputs from Trove by content hash — one canonical store of artwork, dies, fonts, ICC profiles, and proofs.
Assets live behind per-tenant isolation with role-based access. The hosted platform adds auth, quotas, and audit — your store, your boundaries.
AGPL-3.0 OSS you can run on Docker or Railway with your own S3/R2 bucket, or use managed Print With Synergy hosting — same store, managed and metered.
Hybrid by design
Packaging houses and regulated prepress can't always put masters in the cloud. Trove lets the bytes of record stay on-site while the hosted service still gives you versioning, search, and a store that reads by hash.
Keep masters on the plant floor and let Trove cache hot copies in the cloud for speed. Per-asset placement policy — cloud-only, on-prem origin + cloud cache, mirror, or stub-only — decides where the bytes of record live.
A single static daemon dials out to Trove over one persistent control channel — nothing to expose. File bytes move agent↔R2 over short-lived presigned URLs, never over the control socket, and every transfer is hash-verified end to end.
GDPR erase is a first-class operation: purge a blob's stored bytes and its cached derived facts on request. Content-addressed dedup means erase only ever removes truly unreferenced content — never bytes another job still relies on.
Optional per-tenant client-side encryption: the on-prem agent envelope-encrypts before bytes ever leave your network, so the cloud holds only ciphertext and Trove never sees plaintext or your keys. Dedup still works — it's keyed on the plaintext hash.
Pricing
Run the asset store as a managed hosted service, or self-host the open source — same store, you pick who runs it.
AGPL-3.0 · your infrastructure
Run the whole asset plane yourself on your own object store. No quotas, no per-asset fees — ever.
Get the source →Pros
Cons
or $990/year (2 months free)
Add the managed asset store to your workspace. We run the storage, the database, and the data plane for you.
Start with TrovePros
Cons
Plan details and any metered-overage rates are shown at checkout. The open-source edition is AGPL-3.0 and free forever.
Open source · managed hosting
A toolkit of focused, standalone PDF utilities — extraction, preflight, viewing, assembly, imposition planning, and an asset store. Each one plugs into the prepress workflow you already run. Use the open source yourself, or let us host any single tool for you on work.withsynergy.io.
Structured PDF extraction API that turns complex files into consistent JSON.
Programmatic PDF assembly — a deterministic API build step for rewriting and generating print-ready PDFs.
Detection-only PDF preflight engine — 500+ checks plus the PDF/X-4 conformance suite.
Embeddable PDF viewer with separations, TAC, layers, and annotation overlays.
PDF assay and metadata reporting — surface what's actually inside the file.
WYSIWYG canvas editor for label and packaging artwork — PDF/X-4 output, flexo support, and a full create-to-RIP workflow.
Stateless imposition-planning solver — step-and-repeat, gang, and true-shape nesting.
Content-addressed digital-asset plane — versioned blobs, a presigned data plane, and on-prem agent recall.