Why reproducibility?

By reproducibility, we refer to the main principle of the scientific method: research findings must be replicable through independent experimentation or data analysis. The results should be consistent with those of the original study, within acceptable error margin.

In modern scientific research, software plays a central role in data processing, analysis, and experimentation, introducing additional challenges to reproducibility. It is no longer sufficient to merely describe the methods used in an experiment. Access to the source code is essential - but not sufficient on its own.

Geoscientific Model Development (GMD) Guidelines

GMD Cover

The GMD journal explicitly states the importance of reproducibility in its editorial guidelines. In GMD executive editors [2019], it introduced editorial guidelines regarding code and data policies. For example:

(…) it is not sufficient that the source code is provided. It is also necessary to have access to all the input data (…) and all model configuration files are provided.

Additionally:

(…) challenge (…) occurs where model inputs or outputs have been manually processed by an author. (…) nobody, not even the author, can definitively know (…) how the results came about.

All figures and tables must be scientifically reproducible from the scripts.

Why is reproducibility essential? Because it is a core principle of the scientific method and a requirement enforced by scientific journals.

Attention

GMD Guidelines: Journals enforce reproducibility against archived releases (which is great!)

Why reproducibility?

Geoscientific Model Development (GMD) Guidelines

Why notebooks?

Why Jupyter is data scientists’ computational notebook of choice

Reactive, reproducible, collaborative: computational notebooks evolve

Can we do even better?