PyData Global 2025

CLINTON OYOGO DAVID

Clinton Oyogo David is a Data Scientist at Oxford Policy Management, specializing in geospatial analytics, data engineering, dashboard development, and automation. He has led data-intensive projects across Africa and Asia, developing data pipelines, dashboards, and data analysis for various organisations. Clinton combines a background in statistics with a deep interest in scalable data solutions that inform policy and drive impact. His recent work focuses on harmonizing large raster datasets using tools like xarray and Dask to support small area estimation of poverty and sustainable development research.


Session

12-11
12:30
30min
Engineering Large-scale geospatial raster processing with xarray and dask
CLINTON OYOGO DAVID

Geospatial analysis often involves harmonizing and processing raster datasets from diverse sources with varying resolutions, coordinate systems, and data formats. This talk demonstrates how you can build efficient, scalable pipelines for zonal statistics extraction using Python’s scientific computing stack, xarray, and dask to handle rasters that would otherwise overwhelm traditional processing approaches.
Through a real-world case study of processing multi-source geospatial data for small-area estimation of poverty, we’ll explore practical strategies for memory-efficient raster harmonization, parallel computing workflows, and automated statistical aggregation across administrative boundaries.

Data Engineering & Infrastructure
Data Engineering & Infrastructure