PgBouncer connection pooling

A session-mode PgBouncer fronts Postgres on both staging and production as a Kamal accessory (heatwave-pgbouncer). This is the architecture + decision record; for operations (rebuild/update, bootstrap, failover) see the pgbouncer skill, and for exact bootstrap commands see config/pgbouncer/README.md.

Topology

app (web ./bin/rails, consolidated sidekiq)
   │  DATABASE_HOST / DATABASE_HOST_VERSIONS = heatwave-haproxy:6433
   ▼
heatwave-haproxy    (Kamal accessory, write-VIP failover router, :6433, mode tcp passthrough)
   │  routes every connection to the CURRENT primary's pgbouncer (health-gated; see below)
   ▼
heatwave-pgbouncer  (Kamal accessory, kamal network, :6432, session mode)
   │  pools: heatwave + heatwave_versions
   ▼
heatwave-postgres   (local Postgres accessory, :5432)

Since 2026-06-13 the app no longer points at the bouncer directly — it points at the heatwave-haproxy:6433 write-VIP, a TCP-passthrough failover router that forwards to whichever node is the live primary, so a pg_promote reroutes writes with no databases.ini edit and no app redeploy. PgBouncer is unchanged underneath; HAProxy just sits in front. See HAPROXY.md for the routing layer. (To revert to direct pooling, set DATABASE_HOST / DATABASE_HOST_VERSIONS back to heatwave-pgbouncer:6432.)

The pooler is co-located with the app on each host: the app always talks to a local bouncer (now via the local HAProxy), and the bouncer’s backend is the local Postgres accessory. The heatwave→heatwave_versions FDW is internal to Postgres (loopback) and is unaffected — the bouncer just needs a pool per database.

Why it exists

Cap the post-consolidation connection demand. The Kamal cutover collapsed the four Sidekiq processes into one (SIDEKIQ_CONSOLIDATED=1) reserving 49 worker threads; PR #1072 sized its pool at DB_POOL=55 (after the default-5 pool caused ActiveRecord::ConnectionTimeoutError across workers — AppSignal #5951–#5961). Steady-state demand (~150–177 conns) fits under the 197 usable (max_connections=200 − superuser_reserved_connections=3), but a rolling deploy briefly runs old + new containers together — sidekiq alone then needs 2 × (55+55) = 220 to the two DBs, over the limit → the new container boots into FATAL: sorry, too many clients already → failed deploy. The pooler caps server connections (max_db_connections=90/DB) and queues the surplus instead of letting Postgres reject it. This cliff did not exist at DB_POOL=5; #1072’s bump created it, and PgBouncer is its structural fix.
A repointable connection layer for the PG16→18 ping-pong. During the Dallas↔Chicago migration, failover became a backend repoint on the bouncer rather than an app redeploy. (That migration is complete — prod cut over to PG18.4 in Dallas on 2026-06-10.) See the ping-pong runbook.

Session mode is mandatory

The app relies on three session-scoped behaviours that transaction pooling would silently break (each transaction would land on a different backend):

Advisory locks — 28 files use with_advisory_lock/pg_advisory* (plus the Rails migration advisory lock).
LISTEN/NOTIFY — app/models/liquid/order_drop.rb.
Per-connection SET statement_timeout — config/database.yml variables:.

So pool_mode = session. The trade-off: session mode does not multiplex at steady state (one server connection is bound to a client for its whole session), so the pooler here is a failover-indirection + connection-ceiling layer, not a connection multiplier. default_pool_size is therefore set to cover real demand (transparent passthrough) and max_db_connections is the hard ceiling that only bites during the deploy-overlap storm. A future optimization could route the read-only reading role through a second, transaction-mode port once the Dallas PG18 replica exists.

Sizing (from PR #1072)

param	value	why
`pool_mode`	`session`	advisory locks + LISTEN/NOTIFY + `SET` (above)
`default_pool_size`	80	≥ heatwave peak (sidekiq `DB_POOL=55` + web 4×5) → transparent
`max_db_connections`	90 / DB	hard ceiling; 2 DBs × 90 = 180 < 197 usable
`max_client_conn`	2000	absorbs old+new container overlap during a deploy
`min_pool_size`	10	warm servers for the deploy handoff
`query_wait_timeout`	30 s	deploy-overlap clients queue rather than error
`server_idle_timeout`	600 s	reap idle servers so steady state tracks the active set

Per-DB demand, peak: sidekiq 1×55, web 4×5, plus the lazy reading-role pools (ApplicationViewRecord→primary_replica) and a little monitoring/console overhead.

Auth (no app password stored)

auth_type = scram-sha-256 + auth_query. A low-privilege pgbouncer login role owns nothing but EXECUTE on pgbouncer.get_auth(text) — a SECURITY DEFINER function (owned by a superuser) that returns a user’s verifier from pg_shadow. PgBouncer authenticates an incoming deploy client against that verifier and relays to the backend via SCRAM pass-through, so the app deploy password is never in the pooler’s files. The only secret stored is the plaintext pgbouncer-role password in /data/pgbouncer/userlist.txt — required because PgBouncer must authenticate as the auth-user to run auth_query, and a stored SCRAM verifier (StoredKey) is one-way and can’t produce a client proof for that login. The pgbouncer role is low-privilege; its password lives in 1Password as Heatwave-PgBouncer-{staging,production}.

The image — our own build

ghcr.io/warmlyyours/heatwave-pgbouncer:1.25.2, built from the upstream release tarball (pinned by sha256) via docker/pgbouncer.Dockerfile, Alpine multi-stage, non-root. We build our own rather than depend on a wrapper (edoburu lags upstream) so the supply chain is ours and rebuildable on the next CVE — 1.25.2 carries the SCRAM (CVE-2026-6665) + auth_query search_path (CVE-2025-12819) fixes that land in our auth path. Rebuild/bump procedure: the pgbouncer skill.

Current state (2026-06-10)

Staging (dal-latitude-heatwave-01) — live through the bouncer since commit 2c7401da8e; backend Postgres is PG18.
Production — fronted by the bouncer since the 2026-06-08 Chicago cutover (verified with real traffic: 30+ active heatwave connections, session mode, zero auth errors, maxwait=0, no recurrence of #5951–#5961). The PG16→18 ping-pong then moved prod to Dallas on PG18.4 (2026-06-10); the bouncer is the repointable layer that made that a backend repoint rather than an app redeploy.

The two hosts differ in DB bootstrap mechanics (staging trust + deploy-owned /data; prod peer auth → connect as the postgres superuser + root-owned /data → sudo) — captured in the pgbouncer skill + config/pgbouncer/README.md.

Operations (build/update, bootstrap, failover, gotchas): pgbouncer skill
Exact bootstrap commands: config/pgbouncer/README.md
Deploys / accessories / 1Password: kamal-deploy skill
PG16→18 migration that motivated the repointable layer: ping-pong runbook