Popis: |
Exposing the runtime behavior of long running, resource-burning scientific applications on HPC platforms is a must if the platforms are going to be used efficiently and wisely. This paper presents a small work in progress that aims to automatically instrument scientific applications in order to produce a heartbeat event that indicates the application is still progressing. Preliminary results show the feasibility of an automatic approach that will not require developer or user intervention. |