Bounding Box
ConceptsA bounding box is a set of coordinates that define a rectangular region on a document page—often page index plus x, y, width, height (or equivalent). In DocLD, bounding boxes can be returned with extraction citations or parsing output so you can highlight "where" in the document a value or passage came from. They support visual citation and overlay in viewers.
Use Cases
- Citation — Show the exact region that supported an extracted field or a RAG answer.
- Overlay — Draw highlights or boxes on a PDF or image viewer.
- Validation — Inspect the source region when confidence score is low.
Bounding boxes are derived from parsing and layout analysis; accuracy depends on document structure and OCR quality for scanned documents.
Related Concepts
Bounding boxes accompany citation and extraction results. They come from parsing and layout analysis. Confidence score and confidence threshold can be used with regions to flag review.