A production retrieval pipeline that answers domain-specific clinical queries across three distinct reimbursement models — the kind of problem generic RAG quietly fails at.
Pillbox Health operates an RTM (Remote Therapeutic Monitoring) platform reimbursed under three separate CPT billing models. Providers needed fast, accurate answers about [...]
Off-the-shelf RAG returned plausible-sounding text that was often wrong on the fine print. Fine print is the whole business.
I built a hybrid pipeline that combines lexical (BM25) and semantic (vector) retrieval, reranks with a cross-encoder, and boosts chunks whose CPT-code metadata matches the query. A local [...]
The system ships clinical answers grounded in the right billing model, with a citation trail providers can inspect. Document scrubbing and audit logging satisfied internal compliance rev[...]
Most "RAG in production" war stories are really retrieval stories. The LLM is the last 10% — the first 90% is getting the right three chunks in front of it. For a domain like this, met[...]
As a PM, the interesting work was defining what "grounded" meant for this product, and getting clinicians to agree on acceptance criteria before we wrote the retrieval code.