RAG copilot for compliance analysts
Challenge. Analysts spent 30–60 minutes on each case review: searching 2.4M internal documents, checking regulatory requirements. Open-source search gave irrelevant results, naive RAG hallucinated.
What we did. Hybrid search (BM25 + embeddings) with cross-encoder re-ranking. Chunking aware of document structure. Separate fact-checker pass on Claude. Citation-aware UI: every claim linked to source. Eval on golden set of 1200 pairs.
Outcome. Faithfulness 0.94 (vs 0.71 baseline), case review time dropped from 40 to 6 minutes, token cost 72% lower via smart prefix caching and routing simple questions to cheaper models.