Vector Database
StorageA vector database is a database designed to store vectors (embeddings) and answer similarity or nearest-neighbor queries efficiently. DocLD uses Pinecone as its vector database for knowledge base chunks: vector search runs against Pinecone to power RAG retrieval.
Capabilities
- Upsert — Store embeddings with metadata (e.g., document-id, page, chunk-id).
- Query — Given a query embedding, return the most similar vectors (top-k) and optional metadata filtering.
- Namespace — Organize vectors by namespace (e.g., per knowledge base).
DocLD’s vector index is backed by Pinecone; ingestion and reindex update the vector database.
Related Concepts
Vector database backs the vector index and vector search. Pinecone is DocLD’s vector database. Embedding and retrieval depend on it.