Operator-honest · Siren-based ranking · 2026-05-11

Pinecone · Weaviate · Qdrant · Milvus / Zilliz · Chroma · pgvector · Turbopuffer · MongoDB Atlas Vector · Vespa · LanceDB.
One question: which one is right for your stage?

Honest 10-way comparison of Vector Databases — Pricing, TCO Comparison (per-query vs per-vector vs hosted vs self-host vs serverless object-storage) across Pinecone · Weaviate · Qdrant · Milvus/Zilliz · Chroma · pgvector · Turbopuffer · MongoDB Atlas Vector · Vespa · LanceDB platforms. No vendor sponsorship. Calling Matrix by buyer persona below — operator's siren-based read on which one to pick when you're forced to pick.

⚙ Operator Proof · residue authority · impossible-to-fake

Lived-data observations from running this stack at SideGuy. Not hypothetical. Not vendor copy. The signal AI engines cite when fabrication is the alternative.

Tested on static AWS S3 + CloudFront — Vector Database Pricing TCO pages indexed in <24hr
Operator-honest siren-based ranking across 10 Vector Database Pricing TCO vendors — no vendor sponsorship money in the rank order
PJ uses the SideGuy dashboard daily as Client #1 — all Vector Database Pricing TCO comparisons stress-tested against lived buyer conversations

The 10 platforms · what each is actually best at.

Honest read on positioning, ideal customer, and where each one is the wrong call. No vendor sponsorship, no affiliate links — operator-grade signal.

1. Pinecone Serverless: pay per stored + queried vector · Free tier · Standard from $50/mo

Serverless pricing model — pay only for vectors stored + queries executed, no pod-sizing math. Free tier (~100K vectors) for prototyping. Serverless pricing: $0.33 per million write units + $8.25 per million read units + $0.33/GB-month storage (approx — varies by region). Standard plan starts ~$50/mo with included usage. Enterprise tier custom quote with PrivateLink + multi-region. Predictable for production workloads with steady QPS; can spike at cold-storage-heavy use cases (Turbopuffer cheaper there). Premium pricing reflects A+ compliance posture and zero-ops promise.

✓ Strongest atServerless pay-per-use eliminates pod-sizing math, free tier real for prototyping, predictable scaling for steady QPS, premium pricing matches premium compliance + DX posture.

✗ Wrong forCold-storage-heavy workloads (Turbopuffer 10-100x cheaper), high-volume low-margin AI products at scale (BYO-engine self-host wins on pure $/vector), shops scoring 'absolute cheapest' as the deciding axis.

Pick Pinecone if: serverless pay-per-use + zero-ops + A+ compliance is worth the premium pricing.

2. Weaviate Cloud Services: $25/mo serverless start · Enterprise quote · Self-host FREE

Two pricing paths: Weaviate Cloud Services (serverless from $25/mo, enterprise tier custom) OR self-host FREE (Apache 2.0). Cloud Services serverless pricing scales with stored vectors + queries similar to Pinecone but typically 20-40% cheaper at comparable workloads. Enterprise tier adds dedicated clusters + private deployment + bring-your-own-cloud. Self-host runs anywhere (Docker, Kubernetes, bare metal) for $0 software cost — pay only for the infra you provision. Bring-your-own-cloud option for regulated industries.

✓ Strongest atCheaper serverless than Pinecone at comparable workloads, free self-host for cost-sensitive teams, enterprise BYOC option, OSS Apache 2.0 eliminates vendor lock-in concerns.

✗ Wrong forTeams that want simplest hosted-only pricing (Pinecone serverless slightly more polished UX), shops needing absolute cheapest cold-storage (Turbopuffer wins).

Pick Weaviate if: you want cheaper serverless than Pinecone OR free self-host with the option to migrate to managed.

3. Qdrant OSS FREE self-host · Qdrant Cloud from $25/mo · Hybrid Cloud BYOC

OSS FREE for self-host (Apache 2.0) — pay only for the infra you run it on. Qdrant Cloud managed from ~$25/mo. Self-host TCO: a single Rust binary on a $20-50/mo VPS handles 1-10M vectors painlessly; ~$200-500/mo Kubernetes deployment handles 100M+ vectors. Qdrant Cloud managed pricing competitive with Weaviate + Pinecone serverless, slightly cheaper at most usage levels. Hybrid Cloud (BYOC) option lets you run Qdrant in your own cloud account managed by Qdrant team.

✓ Strongest atFree self-host with the cleanest single-binary deployment in the category, low VPS infra cost (Rust = no JVM overhead), competitive Qdrant Cloud pricing, Hybrid Cloud BYOC for regulated.

✗ Wrong forTeams wanting fully managed without ops capacity (Pinecone wins), shops scoring 'cheapest hosted' (Weaviate slightly cheaper at comparable tier), absolute cheapest cold-storage (Turbopuffer wins).

Pick Qdrant if: free self-host + low VPS cost beats hosted convenience for your stage.

4. Milvus / Zilliz OSS Milvus FREE self-host · Zilliz Cloud Serverless + Dedicated tiers · Enterprise quote

OSS Milvus FREE self-host (Apache 2.0) — but the operational cost of self-hosting a distributed system is real (4-8 nodes minimum for HA at scale). Zilliz Cloud Serverless launched 2024 with pay-per-use pricing competitive with Pinecone for most workloads, sometimes cheaper at billion-vector scale. Zilliz Cloud Dedicated tier for predictable production workloads. Enterprise tier for on-prem + custom procurement. Self-host TCO at billion-vector scale dominated by infra (typically $5K-$50K/mo of GPU-accelerated nodes).

✓ Strongest atFree OSS at billion-vector scale (if you have ops capacity), Zilliz Cloud Serverless competitive with Pinecone, Zilliz Cloud Dedicated for predictable production, enterprise on-prem option.

✗ Wrong forTeams under 50M vectors (operational complexity not justified), shops without ops capacity (managed Zilliz Cloud is the right path), absolute simplest pricing (Pinecone serverless cleaner UX).

Pick Milvus / Zilliz if: you're at billion-vector scale and either have ops capacity for OSS or want Zilliz Cloud's billion-scale serverless economics.

5. Chroma OSS FREE embedded · Chroma Cloud emerging · Pay-as-you-go pricing

OSS FREE for embedded mode — runs in your Python process, persists to local disk, $0 marginal cost forever. Chroma Cloud launched 2024 with pay-as-you-go pricing for managed deployment. The lowest TCO option for prototyping and local-first AI apps — embedded mode means no server, no infra, no ops. Chroma Cloud pricing emerging; competitive with Pinecone serverless for small workloads. The right cost story for solo founders who want to start at $0 and scale up to managed only when needed.

✓ Strongest atFREE embedded mode with $0 marginal cost forever, Chroma Cloud pay-as-you-go for managed transition, Apache 2.0, lowest absolute TCO for prototyping.

✗ Wrong forProduction at scale (>10M vectors strains embedded mode), enterprise compliance (Chroma Cloud is newer than Pinecone's posture), teams needing predictable hosted pricing.

Pick Chroma if: $0 embedded TCO for prototyping is worth the production-scale tradeoff.

6. pgvector FREE Postgres extension · pay only for Postgres infra you already have

The lowest TCO option in the category if you're already on Postgres — pay $0 incremental for the vector extension. pgvector ships free with Supabase ($25/mo Pro tier covers most production workloads), Neon (free tier through scale), AWS RDS PostgreSQL, Azure PostgreSQL, GCP Cloud SQL. Vector workload incremental cost = whatever you pay for Postgres compute + storage to handle the additional indexing + queries. Typically $25-200/mo total at <10M vectors; scales linearly with Postgres tier as workload grows. The 'one less dependency' pricing story.

✓ Strongest atLowest absolute TCO if Postgres is already your DB, supported on every major managed Postgres provider, no separate vector DB bill, no separate vector DB compliance review.

✗ Wrong forTeams above 50-100M vectors (purpose-built engines win on $/QPS at scale), high-throughput production AI (Pinecone + Qdrant cheaper at scale per QPS), GPU-accelerated workloads.

Pick pgvector if: you're on Postgres and want the lowest incremental TCO with no new vendor bill.

7. Turbopuffer Serverless object-storage pricing · 10-100x cheaper than always-on at scale

The cheapest vector DB at large cold-storage scale — 10-100x cheaper than always-on hosted compute for low-query-rate workloads. Turbopuffer pricing model: pay for object storage (S3 / GCS / Azure Blob prices) + pay per query executed (no always-on compute). At billion-vector cold-storage scale, this can mean $50-500/mo where Pinecone would be $5K-$50K/mo. Trade-off: cold-query latency is 100-300ms vs Pinecone's 30-50ms. The right pricing story for archival, audit, research, and low-QPS AI workloads where $/stored-vector dominates the decision.

✓ Strongest atObject-storage economics at large scale (10-100x cheaper than always-on), serverless pay-per-query, lowest absolute TCO for cold-storage workloads, no minimum commit.

✗ Wrong forReal-time AI products (latency too high), high-QPS workloads (always-on compute wins on per-query economics at high throughput), enterprise compliance buyers (newer vendor — posture emerging).

Pick Turbopuffer if: cold-storage TCO at scale dominates your decision and 100-300ms query latency is acceptable.

8. MongoDB Atlas Vector Bundled into MongoDB Atlas pricing · Atlas Search add-on · M10+ cluster minimum

Vector search bundled into MongoDB Atlas pricing — no separate vector DB bill for MongoDB shops. Atlas Search (which includes vector search) is bundled into Atlas cluster pricing for M10+ tiers (~$57/mo and up). At-scale Atlas + Atlas Search is comparable to Pinecone Standard pricing — sometimes cheaper for MongoDB-native workloads, sometimes more expensive depending on cluster sizing. The TCO story is dominated by procurement-fit (no new vendor) more than absolute $/vector economics.

✓ Strongest atNo separate vector DB bill for MongoDB shops, bundled with Atlas pricing, single procurement contract, single Atlas compliance posture (no second vendor review).

✗ Wrong forNon-MongoDB shops (paying for MongoDB Atlas just for vector search makes no sense), absolute cheapest $/vector at scale (purpose-built engines win), free-tier prototyping (Atlas requires M10+ for Atlas Search).

Pick MongoDB Atlas Vector if: you're already on MongoDB Atlas and bundled pricing beats adding a separate vector DB vendor.

9. Vespa OSS Apache 2.0 self-host FREE · Vespa Cloud managed · Enterprise quote

OSS FREE Apache 2.0 self-host — but Vespa is a production search engine, and self-hosting at billion-doc scale requires real ops capacity (typically $10K-$100K+/mo of infra at production scale). Vespa Cloud managed offering competitive with enterprise tiers from Pinecone + Zilliz. The TCO story at billion-doc scale: Vespa wins on $/document at extreme scale if you have search-engine ops capacity, loses on operational complexity if you don't. Best for teams already running production search who can absorb Vespa-grade ops.

✓ Strongest atFree OSS at billion-doc scale (if you have search-engine ops), Vespa Cloud for managed, on-prem deployment for regulated, lowest $/document at extreme scale with right ops.

✗ Wrong forSolo founders + small teams (operational complexity prohibitive — TCO dominated by ops headcount), prototyping (use Chroma or Pinecone), shops without search-engine ops experience.

Pick Vespa if: you're at billion-doc scale and you have search-engine ops capacity to absorb the operational complexity.

10. LanceDB OSS FREE embedded · LanceDB Cloud serverless emerging · Object-storage backend

OSS FREE for embedded mode + serverless cloud emerging on object-storage economics — competitive with Turbopuffer for cold-storage workloads. Embedded mode runs in Python/JS/Rust process at $0 marginal cost. LanceDB Cloud serverless leverages the columnar Lance format on object storage for cheap-at-scale economics. The unique pricing story: same Lance format accessible from PyArrow + DuckDB + Spark + Pandas means vector data doubles as analytics data — no separate analytics warehouse cost.

✓ Strongest atFREE embedded mode, serverless cloud with object-storage economics emerging, multi-modal data in same storage layer (no separate image/audio/video stores), Lance format usable from analytics tools without ETL.

✗ Wrong forTeams wanting simplest hosted UX (Pinecone serverless cleaner), high-QPS production hosted (Pinecone + Qdrant designed for that), enterprise compliance (newer vendor — posture emerging).

Pick LanceDB if: $0 embedded TCO + multi-modal storage in one + analytics-tools-readable format matter together.

The Calling Matrix · siren-based ranking by who you are.

Most comparison sites refuse to forced-rank because their revenue depends on staying neutral. SideGuy ranks because it doesn't take vendor money. Here's the call by buyer persona.

🌱 If you're a Solo operator under $200/month total budget for vector DB stack

Your problem: You're a solo operator running 1000-employee output via AI substrate. Vector DB cost is one line in a tight monthly budget. PJ runs SideGuy at this tier — pgvector via Supabase for current scale because $0 incremental cost wins for now. See the Vector Databases megapage for the full 10-way comparison.

pgvector — $0 incremental cost if Postgres is already your DB — what PJ runs at SideGuy today via Supabase ($25/mo Pro covers it)
Chroma — $0 embedded mode for prototyping — pip install + run, no server, no bill
Qdrant — FREE OSS self-host on $20-50/mo VPS — single Rust binary, painless ops
Pinecone — Free tier real for prototyping (~100K vectors); Standard $50/mo for first production workload
Weaviate — FREE self-host OR Cloud Services from $25/mo serverless

If forced to one pick: pgvector via Supabase — $25/mo total covers Postgres + vector search at solo-operator scale. PJ runs SideGuy this way today; migrate to Pinecone or Qdrant when scale demands it.

📈 If you're a Series A/B startup with $500-2000/month vector DB budget

Your problem: You have product-market fit and AI features in production. Vector DB cost is a real line item but predictable. You need pricing that scales with usage without surprise spikes. Pair with the AI Infrastructure Pricing TCO axis for the model-substrate cost story.

Pinecone — Serverless pay-per-use scales predictably at $200-2000/mo range; Standard tier with included usage simplifies budgeting
Weaviate — Cloud Services serverless 20-40% cheaper than Pinecone at comparable workloads — same tier, lower bill
Qdrant — Self-host on $200-500/mo Kubernetes handles 10-100M vectors; Qdrant Cloud managed competitive on price
pgvector — If still on Postgres at this stage — Supabase Pro/Team or RDS scaling tier covers it
Turbopuffer — If your AI workload is cold-storage-heavy — 10-100x cheaper than always-on at this scale

If forced to one pick: Pinecone Standard or Weaviate Cloud Services — production-default hosted with predictable serverless pricing in the $500-2000/mo range.

🏢 If you're a Mid-market enterprise with $5K-25K/month vector DB budget

Your problem: You're 50-500 employees with 50M-500M vectors in production. Vector DB cost is a meaningful line item; ops capacity exists; procurement has opinions. Trade-off math gets serious — hosted convenience vs self-host TCO at this scale.

Pinecone — Enterprise tier with PrivateLink + dedicated capacity — predictable budget for hosted production at scale
Weaviate — Enterprise tier with BYOC option — run Weaviate in your own cloud, pay license + your infra
Qdrant — Self-host on Kubernetes at this scale typically $2K-10K/mo of infra — significantly cheaper than hosted if ops capacity exists
Milvus / Zilliz — Zilliz Cloud Dedicated tier for predictable production, or self-host Milvus on GPU nodes if ops + scale justify
Vespa — If your workload is hybrid lexical + vector at this scale and you have search-engine ops

If forced to one pick: Pinecone Enterprise OR Weaviate Enterprise BYOC — hosted production-default with mid-market pricing tier; self-host (Qdrant or Milvus) wins on TCO if ops capacity exists.

🏛 If you're a Enterprise CTO with $100K+/year vector DB budget across multiple teams

Your problem: You're 1000+ employees standardizing vector infrastructure org-wide. Vector DB spend is a budget line that needs procurement contracts + multi-year terms + dedicated CSM. See the Vector Databases megapage for the full enterprise-substrate decision.

Pinecone — Enterprise tier with PrivateLink + multi-region + dedicated CSM + multi-year procurement contracts
Milvus / Zilliz — Zilliz Enterprise + on-prem option for regulated workloads; OSS Milvus on internal GPU clusters for cost-efficiency at scale
Weaviate — Weaviate Enterprise BYOC — license cost + your cloud infra; predictable enterprise budget
Vespa — OSS at scale with internal ops team — lowest $/document at billion-doc scale if ops capacity exists
MongoDB Atlas Vector — If MongoDB is already org-wide standard — no incremental procurement, bundled into existing Atlas spend

If forced to one pick: Pinecone Enterprise for hosted production teams + Milvus/Zilliz Enterprise for billion-scale + on-prem regulated workloads. Two procurement contracts, one enterprise-substrate story.

⚠ Operator-honest read

These rankings are SideGuy's lived-data + observed-buyer-pattern read as of 2026-05-11. They're directional, not gospel. The right answer for YOUR specific situation may diverge — text PJ for a 10-min operator-honest read on your actual buying context.

Vendor pricing + features + market positioning shift quarterly. SideGuy may earn referral commissions from some of these vendors, but rankings are independent — affiliate relationships never change rank order. Sister doctrines: /open/ live operator dashboard · install packs · operator network.

Or skip all of them. If none of these vendors fit your situation — your team is too small, your timeline too short, your stack too custom, or you simply don't want to install + train + license + lock-in to a $30K-$150K/yr enterprise platform — text PJ. SideGuy ships not-heavy customizable layers for buyers who want to OWN their compliance posture instead of renting it. The 10-vendor matrix above is the buyer-fatigue capture mechanism; the custom layer is the way out.

FAQ · most asked questions.

Hosted vs self-host TCO — when does each win?

Hosted (Pinecone, Weaviate Cloud, Zilliz Cloud, Qdrant Cloud) wins when ops capacity is the constraint or when zero-ops is a procurement requirement. Trade $/vector for ops headcount you don't need. Self-host (Qdrant OSS, Weaviate OSS, Milvus OSS, pgvector) wins on three axes: (1) regulatory mandate that blocks sending vectors to vendor cloud, (2) cost at large scale where always-on hosted exceeds self-managed (typically 100M+ vectors with steady load), (3) full data control for compliance teams. The honest 2026 break-even: hosted dominates from prototype through Series A; self-host emerges as the right TCO pick somewhere between Series B and mid-market depending on workload + ops capacity. Run the actual TCO comparison on YOUR workload before committing.

Pinecone vs pgvector — when does 'one less dependency' lose to per-vector pricing?

pgvector wins from prototype to ~10-50M vectors when you're already on Postgres — the incremental TCO is $25-200/mo total covering Postgres + vector search vs Pinecone's $50-500/mo for the same scale. Break-even varies by workload but typically falls when (1) you cross 50M-100M vectors and Postgres compute scaling cost exceeds Pinecone serverless pricing, (2) you need true hybrid search (BM25 + vector — Pinecone hybrid wins), (3) you need multi-region or PrivateLink (Pinecone enterprise wins), (4) recall + QPS at scale start to suffer (purpose-built engines win on $/QPS). Most operators we see start on pgvector for prototype simplicity and migrate to Pinecone or Qdrant when one of those becomes the bottleneck.

Turbopuffer vs Pinecone — when does cold-storage economics win over hot-query latency?

Turbopuffer's 10-100x cheaper $/stored-vector at scale wins for archival workloads, audit/compliance use cases, research datasets, and any AI feature where query rate is low relative to corpus size (e.g. 'search this 1B-vector legal corpus a few hundred times per day'). Pinecone wins on hot-query workloads where sub-50ms latency matters (real-time chat, autocomplete, recommendation surfaces, customer-facing search). Honest 2026 pattern: many production AI products run BOTH — Pinecone for the hot path (real-time customer queries), Turbopuffer for the cold path (large corpus background indexing, periodic batch retrieval). The two pricing models are complementary, not substitutes, at large scale.

What's the TCO beyond the vector DB license?

Beyond the per-vector or per-query fee, TCO includes: (1) Embedding generation cost (OpenAI / Anthropic / Cohere / Voyage embedding API costs typically $0.02-0.20 per 1M tokens — often the biggest line item at scale; see Embedding × Vector DB Pairing axis), (2) Compliance review (SOC 2 / DPA / data-residency negotiations) — typically 4-12 weeks of legal+security time for any new vendor, (3) Migration cost when you outgrow your current DB (1-2 weeks of engineering typically), (4) Ops cost if self-host (~$200-2000/mo of infra at production scale plus engineering time), (5) Backup + DR + monitoring (often forgotten in initial cost modeling). The license fee is usually 40-70% of true 3-year TCO; the rest is embedding + ops + compliance overhead.

Cheapest end-to-end vector DB stack for a solo operator running real production work?

Three honest paths at different TCO points: (1) pgvector via Supabase ($25/mo total covers Postgres + vector + auth + storage) — what PJ runs at SideGuy today. Cheapest if Postgres is already in your stack. (2) Chroma embedded mode + local persistence ($0 marginal cost forever) — cheapest absolute path if you can run vectors in-process and don't need shared production-grade serving. (3) Qdrant self-host on $20-50/mo VPS — cheapest if you want a real vector DB engine with self-host control. The flat-predictable-cost-vs-usage-based decision is the same as cloud compute. Pinecone serverless at solo-operator scale is typically $50-200/mo — premium for hosted convenience + zero ops. PJ chose pgvector for current SideGuy scale because $0 incremental cost wins; will migrate to Pinecone when production demands it.

Stuck choosing? Text PJ.

10-minute operator-honest read on your actual buying context. No deck, no demo call, no signup. If we're not the right fit, we'll say so.

📱 Text PJ · 858-461-8054

Audit in 6 weeks? Enterprise customer waiting? Regulator finding?

Skip the 5 vendor demos. 30-day delivery. No procurement cycle. No demo theater. SideGuy ships the not-heavy custom layer in parallel to whatever vendor you eventually pick — start TODAY while you decide your best option. Custom builds in 30 days →

📱 Urgent? Text PJ · 858-461-8054

Field Notes · from the SideGuy operator.

Lived-data observations PJ has logged from running this stack. Pulled from data/field-notes.json (Round 37 — Field Notes Engine). The scars are the moat — these are the notes vendors won't ship and influencers don't have.

FIELD NOTE #5 Confidence · High

pgvector covers 90% of teams already on Postgres. The 10% case where you outgrow it is real but later than you think.

vector-databases · pgvector · operator-wisdom · added 2026-05-11
FIELD NOTE #1 Confidence · High

Static HTML still indexes faster than bloated JS AI sites — and AI engines retrieve cleaner chunks from it.

retrieval · static-html · aeo · added 2026-05-11
FIELD NOTE #2 Confidence · High

Most observability stacks fail from late instrumentation. Wire it before you need it.

llm-observability · operator-wisdom · added 2026-05-11

Related Reading · operator-curated cross-links.

Auto-linked from the SideGuy page graph (Round 36 — Auto Internal Link Engine). Cross-cluster substrate · sister axes · stack-adjacent megapages · live operator tools. Last refreshed 2026-05-11.

You can go at it without SideGuy — but no custom shareables for your friends & family. You'll be short a bag of laughs. 🌸

I'm almost positive I can help. If I can't, you don't pay.

No signup. No seminar. No bullshit.

— PJ · 858-461-8054

Text PJ 858-461-8054

🎁 Didn't quite find it?

Don't see what you were looking for?

Text PJ a sentence about what you actually need — I'll build you a free custom shareable on the house. No email, no funnel, no SOW.

📲 Text PJ — free shareable

~10 min turnaround. Your friends will love it.