Vector Database
ToolDefinition
A database optimized for storing and querying high-dimensional embedding vectors via approximate nearest neighbor (ANN) search. Core infrastructure for RAG systems. Examples: Pinecone, Weaviate, Qdrant, pgvector (PostgreSQL extension).
Related Terms
Embedding
A dense numerical vector that represents text, images, or other data in a high-dimensional space where semantic similarity maps to geometric closeness. Foundation of semantic search, RAG systems, and recommendation engines.
RAG (Retrieval-Augmented Generation)
An architecture that enhances LLM outputs by first retrieving relevant documents from a knowledge base (via vector search) and injecting them into the prompt. Grounds the model in external, up-to-date facts without requiring retraining.
Inference
The process of running a trained model to generate predictions or outputs — as opposed to training (updating weights). Inference efficiency (speed, cost, latency) is the primary concern for production deployments.