What's the best vector database for building AI products?

Vector databases are the backbone of retrieval-augmented generation (RAG), a key technique enabling modern AI products to deliver accurate, context-aware answers from private data. This is our comprehensive comparison of leading vector databases, including Turbopuffer, Pinecone, Qdrant, pgvector, and many more.

on September 15th

What's the best vector database for building AI products?

Choosing the right vector database is critical for any AI product that must ground responses in private data—customer records, team documentation, internal metrics, and more. The best choice ensures that your AI can quickly find accurate information using retrieval-augmented generation (RAG), while scaling seamlessly and staying affordable.

In this guide we'll be comparing the best vector databases available in 2025: Turbopuffer, Pinecone, Qdrant, pgvector, Cloudflare Vectorize, Weaviate, Milvus/Zilliz, Turso Vector, MongoDB Atlas Vector Search, Chroma, and Redis.

Introduction

When we set out to launch AI Copilots, our customizable AI chat product for React, we faced the challenge of selecting a vector database firsthand. Because our product manages the entire conversation loop, including message persistence for each user, we needed a vector database that could serve proprietary knowledge with multi-tenant isolation, real-time streaming, scalability, and cost-effectiveness.

It’s a crowded market with many competing solutions, so we spent months testing different approaches. In the end, we chose a hybrid approach where we run both BM25 (keyword/semantic) and vector similarity searches, optionally followed by a rerank step.

In this post we'll outline the criteria we used and the tradeoffs we found, so you can pick the best vector database for your AI in 2025.

High-level considerations

During our research, we discovered that vector databases vary greatly in terms of features, limitations, and performance. Comparing benchmark speeds alone isn't enough, and a number of factors helped us make the right decision:

Performance & scalability: Performance is crucial for us, as we need to provide responsive AI agents for our customers. While we weren't able to benchmark every solution, we'll discuss available third-party benchmarks.
Features: We focused on indexing strategies, namespace support, which give us the ability to split data by type and tenant. We also quickly identified that hybrid search is essential for robust RAG solutions, and because our agents run close to the user on edge runtimes, an HTTP API or edge-compatible SDK was a must.
Limitations: Each option varies greatly in terms of limitations, particularly when it comes to indexes and namespaces.
Enterprise compatibility: As with other enterprise service providers, compliance and security are key. HIPAA, SOC2, single sign-on, and similar enterprise features are non-negotiable requirements.
Cost: As a provider, cost of goods directly affects what we pass on to customers, so pricing is a major factor—especially for systems with large data ceilings. For consistency we've compared providers with a standard formula*.
Extension vs dedicated database: Building a vector search solution into your existing database (e.g. Postgres) can be tempting, as it will simplify looking up data, but may lead to resource contention and scalability issues if not planned well. Using a separate, dedicated, vector database avoids these issues, but requires ongoing data synchronization between sources.

* 1536 dimensions, 1 million reads, 1 million writes, and 10 namespaces (where supported).

Turbopuffer (our pick)

Thanks to its performance, low cost, crazy high limits, and enterprise features without enterprise costs, Turbopuffer became the obvious choice for us when building AI Copilots. We experienced firsthand the reason why they're the choice of some of our favorite tools like Cursor, Notion, and Linear.

Provider	Open-source	Built-in embeddings	Max namespaces/indexes	Max vectors per index	Self-host	Cost per 1536 dimension vector, 1M reads, 1M writes
Turbopuffer	No	No	No hard limit	No hard limit	No	$9.36
Pinecone	No	Yes	100k namespaces / 20 indexes	~Unlimited*	No	$41
Qdrant	Yes	No	No enforced limit	No enforced limit	Yes	$102*
pgvector	Yes	No	Based on Postgres infra	Based on infra	Yes	N/A
Cloudflare Vectorize	No	Yes	50k namespaces	5M vectors	No	$47
Weaviate	Yes	Yes	Based on infra	Based on infra	Yes	$153.78*
Milvus/Zilliz	Yes	No	Based on cluster config	–	Yes	$89.55
sqlite-vec / Turso	Yes	No	N/A	–	–	$4.99
Chroma	Yes	No	Unlimited	–	Yes	$81
Mongo Atlas	Yes	No	Based on infra	Based on infra	Yes-ish	N/A
Redis	Yes	No	Based on infra	Based on infra	Yes	N/A

Provider	Open-source	Built-in embeddings	Max namespaces/indexes	Max vectors per index	Self-host	Cost per 1536 dimension vector, 1M reads, 1M writes
Turbopuffer	No	No	No hard limit	No hard limit	No	$9.36
Pinecone	No	Yes	100k namespaces / 20 indexes	~Unlimited*	No	$41
Qdrant	Yes	No	No enforced limit	No enforced limit	Yes	$102*
pgvector	Yes	No	Based on Postgres infra	Based on infra	Yes	N/A
Cloudflare Vectorize	No	Yes	50k namespaces	5M vectors	No	$47
Weaviate	Yes	Yes	Based on infra	Based on infra	Yes	$153.78*
Milvus/Zilliz	Yes	No	Based on cluster config	–	Yes	$89.55
sqlite-vec / Turso	Yes	No	N/A	–	–	$4.99
Chroma	Yes	No	Unlimited	–	Yes	$81
Mongo Atlas	Yes	No	Based on infra	Based on infra	Yes-ish	N/A
Redis	Yes	No	Based on infra	Based on infra	Yes	N/A

What's the best vector database for building AI products?

Ready to get started?

Related blog posts

Why we built our AI agents on WebSockets instead of HTTP

We’ve open-sourced our customizable React emoji picker

Configure each user’s notification settings for email, Slack, and more

What's the best vector database for building AI products?

Ready to get started?

Related blog posts

Why we built our AI agents on WebSockets instead of HTTP

We’ve open-sourced our customizable React emoji picker

Configure each user’s notification settings for email, Slack, and more