PyData Berlin 2025

Jérôme Petazzoni

Jérôme was part of the team that built, scaled, and operated the dotCloud PAAS, before that company became Docker. He's now an independent consultant, and since he loves to share what he learned, he continues to give many talks and demos on containers, Docker, and Kubernetes. He values diversity, and strives to be a good ally, or at least a decent social justice sidekick. He also collects musical instruments and can arguably play the theme of Zelda on a dozen of them.


Session

09-02
13:40
30min
Data science in containers: the good, the bad, and the ugly
Jérôme Petazzoni

If we want to run data science workloads (e.g. using Tensorflow, PyTorch, and others) in containers (for local development or production on Kubernetes), we need to build container images. Doing that with a Dockerfile is fairly straightforward, but is it the best method?
In this talk, we'll take a well-known speech-to-text model (Whisper) and show various ways to run it in containers, comparing the outcomes in terms of image size and build time.

Infrastructure - Hardware & Cloud
B05-B06