PaPoo
cover

What is a vector database?

A vector database, also called a vector store, is a system for storing and searching data by its numerical embeddings so you can find items that are “most similar” rather than exactly matching text or IDs.

Why it matters

Regular databases are great when you know the exact thing you want: a user ID, a product SKU, a timestamp range. But many AI features need semantic search or nearest-neighbor retrieval: “find questions like this one,” “retrieve passages relevant to this prompt,” or “recommend items similar to this image.”

That is where a vector database helps. It lets you store embedding vectors for text, images, audio, or other content and quickly retrieve the closest ones to a query vector. In practice, teams often use one when building:

How it works

  1. Convert content into vectors.
    A model turns each item into an embedding: a list of numbers that captures meaning or features.

  2. Store vectors plus metadata.
    The database keeps the vector and often the original text, document ID, tags, timestamps, or permissions.

  3. Search by similarity.
    When you query with a new embedding, the system finds nearby vectors using a distance or similarity measure such as cosine similarity or Euclidean distance.

  4. Filter and rank results.
    Most systems also let you combine vector search with metadata filters, so you can search “similar passages” inside only one tenant, language, or document type.

Under the hood, many vector databases use approximate nearest-neighbor indexing to make similarity search fast at scale. The exact indexing method varies by product, but the core idea is the same: trade a little exactness for much faster retrieval.

Tiny concrete example

Suppose you have support articles:

A user asks: “I can’t sign in.”

You embed that question, search the vector database, and get back the most similar articles, even if none of them contains the exact words “can’t sign in.”

That makes it useful for a chatbot that first retrieves relevant help docs before generating an answer.

Common pitfalls / when NOT to use it

Related terms

同じ著者の記事