Realtime Financial Fraud Detection with Modern Python PyData Global 2025

Realtime Financial Fraud Detection with Modern Python
.ical
2025-12-10 12:30–13:00, Machine Learning & AI

Building ML models for financial fraud detection sounds straightforward, until you have to evaluate, validate, and deploy them in real-world pipelines. This talk walks through the practical stack, metrics, and mindsets needed to build fraud detection systems with modern Python. We'll cover key challenges like concept drift, extreme class imbalance, false-positive overload, and why the usual ML workflows fall short. Along the way, we’ll explore a real-world architecture using classical ML, deep learning, and GNNs, plus the validation techniques and production patterns that make or break fraud systems. If you're tired of toy problems and want patterns that survive real money and real latency, this talk’s for you.

This talk distills a production‑tested path for real‑time financial fraud detection in Python (inc. choosing the right objective, validating in time, and shipping with guardrails).

Core idea:

Optimize the business decision (alerts under cost/latency constraints), not just the ML score.

Outline (30 minutes):

Problem framing: Adversaries, label delay, extreme imbalance, and why “accuracy” lies.
Metrics that matter: Precision and recall, AUC‑PR vs ROC, cost‑weighted utility, calibration for decisions.
Validation done right: Temporal splits, rolling/blocked CV with gap, prequential test‑then‑train, leak and drift traps.
Modeling under latency budgets: Where XGBoost shines, when to add tabular DL, injecting graph signals without blowing latency (simple handcrafted graph stats + GNNs).
From notebook to service: Small, testable core, FastAPI endpoint, thresholds and shadow mode, alert quotas, analyst feedback loops.
Operations & monitoring: Drift indicators, calibration checks, label‑delay dashboards, canaries/rollbacks.
Wrap‑up/Q&A: Failure modes and a 1‑page runbook.

Attendee outcomes:

A copy‑and‑adapt roadmap for deploying financial fraud detection services with Python.
A latency‑aware model selection heuristic.
A minimal deployment pattern (service, thresholds, monitoring) that scales from pilot to production.

Prior knowledge expected:

Basic Python and DataFrames, ML classification basics, HTTP/JSON.

Prior Knowledge Expected: Yes

César Soto Valero

César is currently a Data Scientist at SEB Group, where he develops AI models to enhance the security of financial transactions on a global scale. He completed an M.Sc. in Machine Learning and moved to Sweden in 2018 to pursue a Ph.D. in Computer Science at KTH Royal Institute of Technology. During his five years at KTH, he pioneered open-source tools and techniques to mitigate software bloat, contributing to the efficiency and security of modern software systems. César is deeply passionate about AI, science, and technology, with a strong focus on bridging cutting-edge research with real-world applications. He is dedicated to advancing AI’s role in building smarter, more resilient systems that drive innovation.

This speaker also appears in:

From Ideas to APIs: Delivering Fast with Modern Python

Realtime Financial Fraud Detection with Modern Python .ical 2025-12-10 12:30–13:00, Machine Learning & AI

Realtime Financial Fraud Detection with Modern Python
.ical
2025-12-10 12:30–13:00, Machine Learning & AI