PDF/A
ConceptsPDF/A is an ISO standard (PDF for Archive) for creating PDFs suitable for long-term preservation. It restricts certain features (e.g., external fonts, JavaScript, encryption) so that documents remain viewable and reproducible over time. PDF/A is often required in legal, government, and archival contexts. DocLD can parse PDF/A documents like other native PDFs; text extraction and layout analysis work the same way.
Variants
- PDF/A-1 — Based on PDF 1.4; widely supported.
- PDF/A-2 — Based on PDF 1.7; supports more features (e.g., transparency, layers).
- PDF/A-3 — Allows embedded files (e.g., Office docs); useful for compound archives.
If you receive or produce PDF/A for compliance, DocLD’s parsing and extraction can still process the content. Generating PDF/A output (if needed) is typically handled by your authoring or conversion tools.
Related Concepts
PDF/A is a file format variant of PDF. Parsing and native PDF handling apply; compliance needs may drive use of PDF/A.