Your Research Code Works.. Until It Doesn’t: Rethinking Reproducibility

You’ve shared your code. You’ve shared your data.…so why can’t anyone reproduce your results?

In real-world research projects, reproducibility often breaks in subtle ways, missing dependencies, unclear workflows, undocumented steps, or environments that no longer work. These issues don’t appear all at once; they emerge gradually as research evolves. In this talk, I introduce the idea of Reproducibility Debt as a way to understand how these issues accumulate over time. Importantly, this debt is not simply the result of poor practice, it reflects real-world constraints such as time pressure, experimentation, and shifting project goals. While reproducibility is often framed as a technical problem, solved through tools like Docker, Conda, or better documentation, it is in reality a multi-faceted challenge shaped by the interaction between technical, organisational, and human factors. Rather than focusing on technical fixes alone, this talk takes a socio-technical perspective: how decisions, trade-offs, and team practices influence reproducibility outcomes. I will show how reproducibility can be approached as something to understand, assess, and manage throughout a project, rather than something to fix at the end.

The talk introduces a lightweight way of identifying and tracking reproducibility risks based on a structured view of common contributing factors. If you have ever struggled to rerun your own code, or someone else’s, this talk offers a different way to think about why, and what to do about it.

Reproducibility is widely recognised as important, yet in practice it is often treated as a final step, something to address once development is complete. In fast-moving research environments, this approach rarely works. This talk reframes reproducibility as a continuous, socio-technical challenge shaped by the interaction between code, environments, workflows, and human decision-making. Rather than presenting a checklist of tools, this talk emphasises that reproducibility is not only about using Docker, Conda, or improving code structure. These are useful, but they address only part of the problem. Reproducibility challenges arise from a broader set of interacting factors that need to be understood and managed over time.

We will cover:

how reproducibility issues emerge from everyday development decisions
why “perfect reproducibility” is often unrealistic in practice
how reproducibility debt accumulates as a natural part of research work
a structured view (taxonomy) of common factors contributing to reproducibility challenges
how to identify, assess, and track reproducibility risks throughout a project
when to fix issues, when to defer them, and how to make them visible for others

The session includes a practical “before vs after” example and highlights how reproducibility can be treated as an ongoing, manageable concern rather than a one-time fix. This perspective is particularly relevant for research software engineers and developers working in dynamic, exploratory environments.

Audience

Research Software Engineers
PhD students and academic researchers
Python developers working in data-intensive or exploratory projects

Key Takeaways

Reframe reproducibility as a socio-technical and multi-faceted challenge
Understand the different factors that contribute to reproducibility issues
Learn how to identify and track reproducibility risks over time
Move from “fixing at the end” to managing throughout the project
Apply practical strategies that balance development speed and reproducibility

Your Research Code Works.. Until It Doesn’t: Rethinking Reproducibility

Zara Hassan

Share this session