Bernardo Dionisi
Hi, I’m Bernardo. I earned my PhD at Duke University, where I studied the economics of innovation. That work drew me into the practical challenges of data—how to make pipelines reliable, how to integrate validation naturally, and more recently, how these tools can be combined with AI.
Session
Modern data pipelines are fast and expressive, but ensuring data quality is often not as straightforward. This talk introduces Paguro, an open-source, feature-rich validation and metadata library designed on top of the Polars DataFrame library. Paguro enables users to validate both single Data(Lazy)Frames and collections of Data(Lazy)Frames together, and provides beautifully formatted terminal diagnostics that explain why and where validation failed. Attendees will learn how to integrate the lightweight, fast, and composable validation toolkit into their workflows, from exploration to production, using a familiar Polars-native syntax.