Dr. Rebecca Bilbro PyData Global 2025

Dr. Rebecca Bilbro
.ical

Session

How exactly does one validate the factuality of answers from a Retrieval-Augmented Generation (RAG) system? Or measure the impact of the new system prompt for your customer service agent? What do you do when stakeholders keep asking for "accuracy" metrics that you simply don't have? In this talk, we’ll learn how to define (and measure) what “good” looks like when traditional model metrics don’t apply.

Live from PyData Boston

Dr. Rebecca Bilbro .ical

Session

Dr. Rebecca Bilbro
.ical