Pynblint: A quality assurance tool to improve the quality of Python Jupyter notebooks

Autor: Luigi Quaranta, Fabio Calefato, Filippo Lanubile
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: SoftwareX, Vol 28, Iss , Pp 101959- (2024)
Druh dokumentu: article
ISSN: 2352-7110
DOI: 10.1016/j.softx.2024.101959
Popis: Jupyter Notebook is widely recognized as a crucial tool for data science professionals and students. Its interactive and self-documenting nature makes it particularly suitable for data-driven programming tasks. Nonetheless, it faces criticism for its limited support for software engineering best practices and its tendency to encourage bad programming habits, such as non-linear code execution. These issues often result in non-reproducible, poorly documented, and low-quality notebook code. In this paper, we introduce Pynblint, a static analyzer for Python Jupyter notebooks. Pynblint is designed to help data scientists write better notebooks, easy to understand and reproduce. We report on how we validated Pynblint with both professional data scientists and students, receiving overall positive feedback. Additionally, we discuss the potential of Pynblint to facilitate research inquiries into computational notebooks.
Databáze: Directory of Open Access Journals