SC19 Proceedings

The International Conference for High Performance Computing, Networking, Storage, and Analysis

Poster 24: Fingerprinting Anomalous Computation with RNN for GPU-Accelerated HPC Machines

Student: Pengfei Zou (Clemson University)
Supervisor: Rong Ge (Clemson University)

Abstract: This paper presents a workload classification framework that discriminates illicit computation from authorized workloads on GPU-accelerated HPC systems. As such systems become more and more powerful, they are exploited by attackers to run malicious and for-profit programs that typically require extremely high computing ability to be successful. Our classification framework leverages the distinctive signatures between illicit and authorized workloads, and explore machine learning methods to learn the workloads and classify them. The framework uses lightweight, non-intrusive workload profiling to collect model input data, and explores multiple machine learning methods, particularly recurrent neural network (RNN) that is suitable for online anomalous workload detection. Evaluation results on four generations of GPU machines demonstrate that the workload classification framework can tell apart the illicit authorized workloads with high accuracy of over 95%. The collected dataset, detection framework, and neural network models will be made available on GitHub.

ACM-SRC Semi-Finalist: yes

Poster: PDF
Poster Summary: PDF

Back to Poster Archive Listing