IELE 2025 1

Prof: Leo Ferres

Overview

Curious how scientists predict outbreaks before they happen? Wonder what it takes to build models that inform real-world public health decisions? Want to use Bayesian statistics, geospatial tools, and real data to tackle an actual crisis? Like the idea of working in teams to solve a fast-moving, high-stakes epidemiological puzzle?

In this course, you'll take on the role of data scientists racing to model the spread of dengue in Chile, a disease on the move thanks to climate change, urbanization, and global travel. You'll collect messy, real-world data, learn modern tools like Python, Stan, and QGIS, and build predictive models that aim to forecast where the next outbreak will hit. This is not a toy problem. It's the kind of challenge public health agencies face every day. By the end, you'll not only understand the science of prediction, you’ll have practiced it.

Teaching philosophy

This is not a typical course and that's by design. Rather than traditional lectures, this is a project-driven, skill-based course designed to immerse you in the actual practice of scientific modeling and data analysis. You won't be sitting through lectures while I walk you through material. Instead, you'll learn by doing, tackling real-world problems using tools and methods drawn from contemporary data science and epidemiology. Every week (or sometimes every two weeks, depending on the complexity), you'll receive an assignment that introduces a specific type of skill or analytical technique. These are not isolated exercises—they build toward a larger goal: constructing a functioning predictive model of dengue spread in Chile. Think of each assignment as a piece of that puzzle. The structure is intentionally inverted: I won't be lecturing on each topic in advance. Instead, I'll guide you to high-quality external tutorials, documentation, and video content (e.g., from YouTube or official project pages). You'll study the material independently before class. Then, during class, we'll use our time together to dive deeper, discuss the content, troubleshoot your work, and collaboratively address any roadblocks. To make this work, your engagement is essential. You're expected to come to class having reviewed the assigned materials, having tried the assignment, and critically having real questions. The learning will happen through interaction, collaboration, and problem-solving. My role is not to lecture, but to help you think, guide you through complexity, and learn alongside you as a team. This course is designed to feel closer to a research lab or collaborative project environment than a lecture hall. If you're curious, self-directed, and ready to take on challenging, meaningful problems, you'll thrive here.

Topics

This is a data-driven course on modeling spatial, environmental, and epidemiological phenomena using Bayesian methods. Emphasis is on integrating diverse datasets and implementing models in Stan and Python. Topics include:

Assignments

General Description: This course is structured around five cumulative assignments that progressively build technical and analytical skills in geospatial analysis, environmental modeling, and Bayesian inference, with a focus on public health and socio-environmental data. Each task introduces new tools and concepts while reinforcing prior work, culminating in a final project involving Bayesian disease modeling with real-world data.