논문 상세보기

병렬 프로그램 로그 군집화 기반 작업 실행 시간 예측모형 연구 KCI 등재

Runtime Prediction Based on Workload-Aware Clustering

  • 언어KOR
  • URLhttps://db.koreascholar.com/Article/Detail/319924
구독 기관 인증 시 무료 이용이 가능합니다. 4,000원
한국산업경영시스템학회지 (Journal of Society of Korea Industrial and Systems Engineering)
한국산업경영시스템학회 (Society of Korea Industrial and Systems Engineering)
초록

Several fields of science have demanded large-scale workflow support, which requires thousands of CPU cores or more. In order to support such large-scale scientific workflows, high capacity parallel systems such as supercomputers are widely used. In order to increase the utilization of these systems, most schedulers use backfilling policy: Small jobs are moved ahead to fill in holes in the schedule when large jobs do not delay. Since an estimate of the runtime is necessary for backfilling, most parallel systems use user’s estimated runtime. However, it is found to be extremely inaccurate because users overestimate their jobs. Therefore, in this paper, we propose a novel system for the runtime prediction based on workload-aware clustering with the goal of improving prediction performance. The proposed method for runtime prediction of parallel applications consists of three main phases. First, a feature selection based on factor analysis is performed to identify important input features. Then, it performs a clustering analysis of history data based on self-organizing map which is followed by hierarchical clustering for finding the clustering boundaries from the weight vectors. Finally, prediction models are constructed using support vector regression with the clustered workload data. Multiple prediction models for each clustered data pattern can reduce the error rate compared with a single model for the whole data pattern. In the experiments, we use workload logs on parallel systems (i.e., iPSC, LANL-CM5, SDSC-Par95, SDSC-Par96, and CTC-SP2) to evaluate the effectiveness of our approach. Comparing with other techniques, experimental results show that the proposed method improves the accuracy up to 69.08%.

목차
1. 서 론
 2. 관련 연구
 3. 제안 기법
  3.1 Phase 1 : Feature Selection
  3.2 Phase 2 : Clustering of Workload Data
  3.3 Phase 3 : SVR Modeling by Clusters
 4. 실험 결과
  4.1 실험 환경
  4.2 실험 결과
 5. 결 론
 References
저자
  • 김은혜(한국전자통신연구원 융합기술연구소) | Eunhye Kim
  • 박주원(한국과학기술정보연구원 슈퍼컴퓨팅본부) | Ju-Won Park Corresponding Author