Landscape of High-performance Python to Develop Data Science and Machine Learning Applications
Autor: | Castro, Oscar, Bruneau, Pierrick, Sottet, Jean-Sébastien, Torregrossa, Dario |
---|---|
Rok vydání: | 2023 |
Předmět: | |
Druh dokumentu: | Working Paper |
Popis: | Python has become the prime language for application development in the Data Science and Machine Learning domains. However, data scientists are not necessarily experienced programmers. While Python lets them quickly implement their algorithms, when moving at scale, computation efficiency becomes inevitable. Thus, harnessing high-performance devices such as multicore processors and Graphical Processing Units (GPUs) to their potential is generally not trivial. The present narrative survey was thought as a reference document for such practitioners to help them make their way in the wealth of tools and techniques available for the Python language. Our document revolves around user scenarios, which are meant to cover most situations they may face. We believe that this document may also be of practical use to tool developers, who may use our work to identify potential lacks in existing tools and help them motivate their contributions. Comment: 30 pages, accepted for publication in ACM Computing Surveys on 21/08/2023 |
Databáze: | arXiv |
Externí odkaz: |