Question 1

What is managed pgvector?

Accepted Answer

Managed pgvector means PostgreSQL with the pgvector extension pre-installed, pre-tuned, and operated for you on dedicated infrastructure. You get the standard pgvector SQL surface (the vector type, HNSW and IVFFlat indexes, distance operators) on top of a database where someone else handles backups, monitoring, replication, failover, and version upgrades. On Rivestack specifically, that infrastructure is local NVMe rather than cloud SSD, which is what HNSW random-read workloads care about most.

Question 2

Can pgvector handle production workloads?

Accepted Answer

Yes. pgvector is used in production for RAG, semantic search, recommendation engines, and embedding search APIs with millions of vectors. The constraints to plan for are memory headroom for the HNSW index, storage latency on cache misses, sensible ef_search and m values, and proper filtering, usually B-tree indexes on the columns used in WHERE alongside the HNSW index. PostgreSQL handles the rest with the same backup, replication, monitoring, and tuning patterns it has used for years.

Question 3

How many vectors can pgvector handle?

Accepted Answer

The practical limits are memory available for the HNSW index, storage IOPS under random reads, and how strict your latency targets are, so size by index size, not just row count. Our measured in-RAM fast-search capacity ladder for 1536-dimension vectors: Free (shared, 2 GB) ~100K; Solo (2 GB) ~250K; Starter (4 GB) ~350K; Growth (8 GB) ~600K; Scale (16 GB) ~1M. On a Starter node we measure ~1,600 QPS at recall@10 0.93 (ef_search=80) at 16 clients on 250k × 1536; Growth reaches ~2,950 and Scale ~4,465 (full tables at /pgvector-benchmarks-measured). A 1M × 1536 index (~6 GB) builds and serves hot on Scale (~3,600 QPS at recall@10 0.74 (ef=80) / p50 4.2ms at 16 clients) but is RAM-limited on smaller nodes. Pick the node by the index size you need to keep hot.

Question 4

What is HNSW in pgvector?

Accepted Answer

HNSW (Hierarchical Navigable Small World) is the graph-based approximate nearest neighbor index pgvector uses for fast vector search. Build-time parameters m and ef_construction control graph density; the query-time parameter hnsw.ef_search controls how aggressively the graph is traversed. Higher values mean better recall and slightly higher latency. HNSW search is dominated by random reads, which is why storage latency matters more than CPU once the index no longer fits in memory.

Question 5

Does pgvector replace Pinecone or a dedicated vector database?

Accepted Answer

For most teams, yes. pgvector keeps vectors next to the relational data, so filters, joins, and tenant scoping happen in standard SQL, without a second system to keep in sync. Dedicated vector databases still make sense at very large scale (hundreds of millions to billions of vectors) or when the workload is purely vector search with no relational context. For typical RAG and semantic search systems, pgvector on managed PostgreSQL is simpler and cheaper.

Question 6

How do I choose a managed pgvector provider?

Accepted Answer

Start with the workload, not the provider list. The questions that actually decide it: what storage latency the plan delivers under random reads, how much memory is available for the HNSW index plus PostgreSQL cache, which pgvector version is supported and how upgrades happen, what backup and PITR coverage is included, what the HA topology looks like, how pricing changes when query volume grows, and how the provider helps you migrate from where you are today. A provider that can answer those in workload language is a real candidate.

Question 7

How fast is pgvector on NVMe vs cloud SSDs?

Accepted Answer

It scales with the node, and more cores buy throughput without a latency cliff. All numbers below are 1536-dim, HNSW m=16 / ef_construction=64 / cosine, measured from a same-region app over PgBouncer and TLS, with recall and client count attached because QPS and p50 are a concurrency tradeoff, not simultaneous best-cases. Every region runs the AMD cpx line. On a Starter node (2 vCPU / 4 GB) with 250k × 1536 we see ~1,185 QPS at recall@10 0.93 (ef_search=80) with p50 3.2ms at 4 clients, up to ~1,600 QPS at 16 clients. A Growth node (4 vCPU / 8 GB) does ~2,950 QPS at recall 0.94 (16 clients) and stays above ~2,570 QPS even at recall 0.95. A Scale node (8 vCPU / 16 GB) reaches ~4,465 QPS at recall 0.95 (16 clients). Scale also builds and serves a full 1M × 1536 index hot at ~3,600 QPS at recall 0.74 (ef=80) / p50 4.2ms. The same-region network floor is ~0.4ms, so pooling and TLS add little over raw Postgres. Every operating point and one-command reproduce steps are at /pgvector-benchmarks-measured.

Question 8

Can I migrate from Supabase, Neon, or Pinecone?

Accepted Answer

Yes. Supabase and Neon are standard PostgreSQL with pgvector, so migration is pg_dump / pg_restore for smaller databases or logical replication for always-on workloads, typically 30 to 60 minutes of effort, no application changes. Pinecone is not Postgres, so you re-insert your embeddings into a vector column once; we have a migration helper and we will look at your workload shape before you start.

Question 9

What pgvector version do you support?

Accepted Answer

Rivestack tracks the pgvector 0.8.x release line and rolls extension updates through standard maintenance windows after compatibility testing. New clusters are provisioned with the supported 0.8.x build; existing clusters are upgraded deliberately rather than silently.

Question 10

Does pgvector work with PostgreSQL 18?

Accepted Answer

Yes. pgvector 0.8.x supports PostgreSQL 13 through 18. Rivestack provisions every cluster on PostgreSQL 18 with pgvector pre-installed, so you can use the latest planner, partitioning, and observability improvements alongside vector search without managing extension compatibility yourself.

Question 11

Do you support HNSW and IVFFlat indexes?

Accepted Answer

Both. HNSW is the default and what we tune for, because it is the right choice for almost every production workload at the scale our customers run. IVFFlat is fully supported when you have a static dataset and want lower index build time and lower memory. You can mix both index types within the same database.

	Rivestack	Exoscale	Elestio	Supabase
Best fit	Postgres AI workloads	European cloud buyers	Marketplace-style hosting	Full app platform
Starting price	$0 free · $29/mo	Hourly DBaaS	$14/mo+	$25 + compute add-ons
Storage default	Dedicated local NVMe	Managed DBaaS storage	Provider-dependent VM disk	Cloud block storage
pgvector support	0.8.x line	Extension included	Extension service	Extension included
HNSW tuning	Workload tuned · ef_search guidance	General DB tuning	Managed service defaults	Extension defaults
Vector workload review	Included	Support plan	Support plan	Community / paid
Pricing model	Fixed per node	Hourly usage	Monthly service	Plan + usage/add-ons
Migration path	pg_dump, restore, or logical replication help	Standard PostgreSQL tools	DB migration service	Import/export tooling

Managed pgvector hostingon dedicated NVMe.

Managed pgvector hosting
on dedicated NVMe.