Publications
Anomaly detection in scientific workflows using end-to-end execution gantt charts and convolutional neural networks
Abstract
Fundamental progress towards reliable modern science depends on accurate anomaly detection during application execution. In this paper, we suggest a novel approach to tackle this problem by applying Convolutional Neural Network (CNN) classification methods to high-resolution visualizations that capture the end-to-end workflow execution timeline. Subtle differences in the timeline reveal information about the performance of the application and infrastructure’s components. We collect 1000 traces of a scientific workflow’s executions. We explore and evaluate the performance of CNNs trained from scratch and pre-trained on ImageNet [7]. Our initial results are promising with over 90% accuracy.
Metadata
- publication
- Practice and Experience in Advanced Research Computing, 1-5, 2021
- year
- 2021
- publication date
- 2021/7/17
- authors
- Patrycja Krawczuk, George Papadimitriou, Shubham Nagarkar, Mariam Kiran, Anirban Mandal, Ewa Deelman
- link
- https://dl.acm.org/doi/abs/10.1145/3437359.3465597
- resource_link
- https://dl.acm.org/doi/pdf/10.1145/3437359.3465597
- book
- Practice and Experience in Advanced Research Computing 2021: Evolution Across All Dimensions
- pages
- 1-5