Slides & Code

This tutorial will make use of the ImputeGAP for its code examples. ImputeGAP is a comprehensive Python library for imputation of missing values in time series data. It implements user-friendly APIs to easily visualize, analyze, and repair time series datasets. The library supports a diverse range of imputation algorithms and modular missing data simulation catering to datasets with varying characteristics. ImputeGAP includes extensive customization options, such as automated hyperparameter tuning, benchmarking, explainability, and downstream evaluation.

The code examples will be provided in Jupyter and Colab notebooks. All the notebooks and slides will be available on this GitHub repository.

Tutorial Materials

Paper and slides

▸ [PDF] ImputeGAP: A Comprehensive Library for Time Series Imputation.

▸ [Slides] https://github.com/eXascaleInfolab/ImputeGAP/blob/main/imputegap/slides/2025_imputegap_kdd.pdf


Notebooks

▸ [ImputeGAP] Notebook 1 - Data Loading and Contamination: Colab Notebook, Jupyter Notebook

▸ [ImputeGAP] Notebook 2 - Imputation: Colab Notebook, Jupyter Notebook

▸ [ImputeGAP] Notebook 3 - Benchmark: Colab Notebook, Jupyter Notebook

▸ [ImputeGAP] Notebook 4 - Downstream: Colab Notebook, Jupyter Notebook

▸ [ImputeGAP] Notebook 5 - Explainer: Colab Notebook, Jupyter Notebook

▸ [ImputeGAP] Full Hands-on: Colab Notebook, Jupyter Notebook