Add automated notebook testing with Papermill #602

drbenvincent · 2025-12-20T21:52:30Z

Introduces a GitHub Actions workflow to run and validate Jupyter notebooks in CI using a new runner script. Adds scripts to mock PyMC sampling for faster execution, updates test dependencies to include papermill, and documents the notebook runner usage. Also updates the interrogate badge to reflect new coverage.

📚 Documentation preview 📚: https://causalpy--602.org.readthedocs.build/en/602/

Introduces a GitHub Actions workflow to run and validate Jupyter notebooks in CI using a new runner script. Adds scripts to mock PyMC sampling for faster execution, updates test dependencies to include papermill, and documents the notebook runner usage. Also updates the interrogate badge to reflect new coverage.

Copilot

Pull request overview

This PR introduces automated testing for Jupyter notebooks in CI using Papermill. The implementation includes a runner script that mocks PyMC's MCMC sampling with faster prior predictive sampling to validate notebooks execute without errors.

Key changes:

New notebook runner script with filtering capabilities for different notebook types
Mock PyMC sampling implementation that replaces expensive MCMC with prior predictive sampling (10 draws)
GitHub Actions workflow that runs notebooks in parallel across three categories (PyMC, sklearn, and other notebooks)

Reviewed changes

Copilot reviewed 5 out of 6 changed files in this pull request and generated 5 comments.

Show a summary per file

File	Description
scripts/run_notebooks/runner.py	Main script for executing notebooks with Papermill, includes filtering and logging
scripts/run_notebooks/injected.py	Mock implementation of pm.sample that uses prior predictive sampling
scripts/run_notebooks/README.md	Documentation for the notebook runner usage and CI integration
.github/workflows/test_notebook.yml	GitHub Actions workflow for parallel notebook testing
pyproject.toml	Adds papermill to test dependencies
docs/source/_static/interrogate_badge.svg	Updates documentation coverage badge from 96.3% to 96.0%

💡 Add Copilot custom instructions for smarter, more guided reviews. Learn how to get started.

scripts/run_notebooks/runner.py

scripts/run_notebooks/injected.py

scripts/run_notebooks/README.md

codecov · 2025-12-20T22:00:54Z

Codecov Report

✅ All modified and coverable lines are covered by tests.
✅ Project coverage is 93.77%. Comparing base (9ddf58c) to head (5e8005c).

Additional details and impacted files

@@            Coverage Diff             @@
##             main     #602      +/-   ##
==========================================
+ Coverage   93.74%   93.77%   +0.02%     
==========================================
  Files          41       41              
  Lines        6827     6827              
  Branches      458      458              
==========================================
+ Hits         6400     6402       +2     
+ Misses        267      266       -1     
+ Partials      160      159       -1

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

Co-authored-by: Copilot <[email protected]>

Updated the mock for pm.sample() to use 100 draws instead of 50 for prior predictive sampling, as reflected in both the injected script and documentation. This change aims to provide more robust validation during notebook execution.

Updated the mock for pm.sample to use 500 draws instead of 100 to ensure compatibility with notebook code that iterates over posterior samples, such as plot_ate which defaults to 500 draws. Adjusted documentation and injected.py accordingly.

Introduces skip_notebooks.yml to specify notebooks incompatible with prior predictive sampling mock. Updates runner.py to filter out these notebooks and reduces MIN_DRAWS from 500 to 100 for faster execution.

Replaces import of LinearRegression from causalpy.skl_models with sklearn's LinearRegression and removes execution count from the first code cell.

review-notebook-app · 2025-12-24T08:04:24Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

Replaces hardcoded sample size with dynamic calculation based on the length of 'uncertainty' to prevent errors when fewer than 500 samples are available. Also resets execution count to null for the affected notebook cell.

Expanded the skip_notebooks.yml file to include iv_pymc.ipynb, which requires JAX not available in the CI test environment. Updated comments to clarify reasons for skipping each notebook.

Installs Graphviz as a system dependency in the test_notebook GitHub Actions workflow to support notebooks or tests that require it.

drbenvincent · 2025-12-24T09:13:56Z

passing!

@NathanielF See skip_notebooks.yaml. I had to bypass testing some of yours for reasons explained in the file. Those reasons are probably fixable in a follow up PR.

drbenvincent · 2025-12-24T09:14:47Z

bugbot review

cursor · 2025-12-24T09:14:51Z

PR Summary

Introduces CI to validate docs notebooks execute without errors.

Adds .github/workflows/test_notebook.yml to run notebooks in parallel splits on Python 3.12
New scripts/run_notebooks/ utilities: runner.py (Papermill execution with temporary notebooks), injected.py (mocks pm.sample with prior draws and minimal sample_stats), skip_notebooks.yml (notebooks excluded from CI), and a brief README.md
Updates docs/source/_static/interrogate_badge.svg from 96.3% to 96.0%

^{Written by Cursor Bugbot for commit c15a229. This will update automatically on new commits. Configure here.}

cursor

✅ Bugbot reviewed your changes and found no bugs!

drbenvincent requested review from Copilot and williambdean December 20, 2025 21:52

drbenvincent added documentation Improvements or additions to documentation devops DevOps related labels Dec 20, 2025

Copilot started reviewing on behalf of drbenvincent December 20, 2025 21:52 View session

Copilot AI reviewed Dec 20, 2025

View reviewed changes

drbenvincent and others added 10 commits December 24, 2025 04:13

Update scripts/run_notebooks/README.md

8340526

Co-authored-by: Copilot <[email protected]>

Update scripts/run_notebooks/injected.py

a58bd62

Co-authored-by: Copilot <[email protected]>

Update scripts/run_notebooks/injected.py

73481eb

Co-authored-by: Copilot <[email protected]>

Update scripts/run_notebooks/runner.py

ecd690a

Co-authored-by: Copilot <[email protected]>

attempt to fix failing remote notebook execution test

555b4ad

fix pre-commit checks

702089e

Add skip list for incompatible notebooks

2d68170

Introduces skip_notebooks.yml to specify notebooks incompatible with prior predictive sampling mock. Updates runner.py to filter out these notebooks and reduces MIN_DRAWS from 500 to 100 for faster execution.

Update imports in iv_pymc notebook

4bf53ae

Replaces import of LinearRegression from causalpy.skl_models with sklearn's LinearRegression and removes execution count from the first code cell.

drbenvincent added 4 commits December 24, 2025 08:08

Fix sampling bug in uncertainty plot

d05002d

Replaces hardcoded sample size with dynamic calculation based on the length of 'uncertainty' to prevent errors when fewer than 500 samples are available. Also resets execution count to null for the affected notebook cell.

Update skipped notebooks list for CI environment

c91c5d8

Expanded the skip_notebooks.yml file to include iv_pymc.ipynb, which requires JAX not available in the CI test environment. Updated comments to clarify reasons for skipping each notebook.

Add iv_weak_instruments.ipynb to skipped notebooks list

79323dc

Add Graphviz installation to CI workflow

c15a229

Installs Graphviz as a system dependency in the test_notebook GitHub Actions workflow to support notebooks or tests that require it.

drbenvincent requested review from NathanielF and juanitorduz December 24, 2025 09:14

cursor bot reviewed Dec 24, 2025

View reviewed changes

Merge branch 'main' into notebook-testing

f2e7b46

update pre-commit checks

5e8005c

drbenvincent mentioned this pull request Jan 7, 2026

iv_vs_priors.ipynb fails: az.plot_energy() incompatible with numpyro sampler #634

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Add automated notebook testing with Papermill #602

Add automated notebook testing with Papermill #602

Uh oh!

drbenvincent commented Dec 20, 2025 •

edited by github-actions bot

Loading

Uh oh!

Copilot AI left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Dec 20, 2025 •

edited

Loading

Uh oh!

review-notebook-app bot commented Dec 24, 2025

Uh oh!

drbenvincent commented Dec 24, 2025

Uh oh!

drbenvincent commented Dec 24, 2025

Uh oh!

cursor bot commented Dec 24, 2025 •

edited

Loading

Uh oh!

cursor bot left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Add automated notebook testing with Papermill #602

Are you sure you want to change the base?

Add automated notebook testing with Papermill #602

Uh oh!

Conversation

drbenvincent commented Dec 20, 2025 • edited by github-actions bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

codecov bot commented Dec 20, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

review-notebook-app bot commented Dec 24, 2025

Uh oh!

drbenvincent commented Dec 24, 2025

Uh oh!

drbenvincent commented Dec 24, 2025

Uh oh!

cursor bot commented Dec 24, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

PR Summary

Uh oh!

cursor bot left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

drbenvincent commented Dec 20, 2025 •

edited by github-actions bot

Loading

codecov bot commented Dec 20, 2025 •

edited

Loading

cursor bot commented Dec 24, 2025 •

edited

Loading