Analyzing the Power Consumption Behavior of a Large Scale Data Center

Similar documents
Performance & Energy

Classifier Evaluation and Selection. Review and Overview of Methods

File Systems: Fundamentals

Cloud Tutorial: AWS IoT. TA for class CSE 521S, Fall, Jan/18/2018 Haoran Li

Welfare State and Local Government: the Impact of Decentralization on Well-Being

SADA. South African Data Archive. The public s reaction to the Crime problem [South Africa], Esterhuysen, HM and Schurink, WJ SADA 0020

Cluster Analysis. (see also: Segmentation)

Coercion, Capacity, and Coordination: A Risk Assessment M

THE PRIMITIVES OF LEGAL PROTECTION AGAINST DATA TOTALITARIANISMS

Statistical Analysis of Corruption Perception Index across countries

Random Forests. Gradient Boosting. and. Bagging and Boosting

Real-Time Scheduling Single Processor. Chenyang Lu

CS 5523: Operating Systems

CS 5523 Operating Systems: Intro to Distributed Systems

Instructors: Tengyu Ma and Chris Re

Computational challenges in analyzing and moderating online social discussions

Split Decisions: Household Finance when a Policy Discontinuity allocates Overseas Work

Supporting Information for Signaling and Counter-Signaling in the Judicial Hierarchy: An Empirical Analysis of En Banc Review

Servilla: Service Provisioning in Wireless Sensor Networks. Chenyang Lu

John Parman Introduction. Trevon Logan. William & Mary. Ohio State University. Measuring Historical Residential Segregation. Trevon Logan.

An Integrated Tag Recommendation Algorithm Towards Weibo User Profiling

Scaling Administration 15

Probabilistic Latent Semantic Analysis Hofmann (1999)

National Labor Relations Board

Introduction to data on ethnicity

Return on Investment from Inbound Marketing through Implementing HubSpot Software

Can Politicians Police Themselves? Natural Experimental Evidence from Brazil s Audit Courts Supplementary Appendix

SADA. South African Data Archive. Political Regimes and Regime Transitions in Africa,

Big Data, information and political campaigns: an application to the 2016 US Presidential Election

Immigrant Legalization

Final Review. Chenyang Lu. CSE 467S Embedded Compu5ng Systems

Clinton vs. Trump 2016: Analyzing and Visualizing Tweets and Sentiments of Hillary Clinton and Donald Trump

RECOMMENDED CITATION: Pew Research Center, May, 2017, Partisan Identification Is Sticky, but About 10% Switched Parties Over the Past Year

Mapping Policy Preferences with Uncertainty: Measuring and Correcting Error in Comparative Manifesto Project Estimates *

Chapter. Sampling Distributions Pearson Prentice Hall. All rights reserved

Recommendations For Reddit Users Avideh Taalimanesh and Mohammad Aleagha Stanford University, December 2012

Support Vector Machines

Educated Preferences: Explaining Attitudes Toward Immigration In Europe. Jens Hainmueller and Michael J. Hiscox. Last revised: December 2005

2017 Citizen Satisfaction Survey City of Shawnee, Kansas

ANNUAL SURVEY REPORT: BELARUS

Case Evidence: Blacks, Hispanics, and Immigrants

My Health Online 2017 Website Update Online Appointments User Guide

FM Legacy Converter User Guide

The System Made Me Stop Doing It. The Indirect Origins of Commercial Peace

Results of L Année philologique online OpenURL Quality Investigation

Flash Eurobarometer 431. Summary. Electoral Rights

Strengthen Stewardship With Electronic Giving

ANNUAL SURVEY REPORT: REGIONAL OVERVIEW

Surviving Elections: Election Violence, Incumbent Victory, and Post-Election Repercussions January 11, 2016

Preliminary Effects of Oversampling on the National Crime Victimization Survey

Real-Time Wireless Control Networks for Cyber-Physical Systems

Migrant-specific use of the Labour Force Survey - Emigrants

Citizen engagement and compliance with the legal, technical and operational measures in ivoting

Processes. Criteria for Comparing Scheduling Algorithms

Schedule of Fees - Patents

Congruence in Political Parties

Classification of posts on Reddit

INCREASED OPPORTUNITY TO MOVE UP THE ECONOMIC LADDER? EARNINGS MOBILITY IN EU:

NISO s IOTA Working Group

Household Income and Expenditure Survey Methodology 2013 Workers Camps

Academic Dishonesty in Online Coursework

Content Analysis of Network TV News Coverage

Hoboken Public Schools. AP Statistics Curriculum

Benchmarks for text analysis: A response to Budge and Pennings

Digital research data in the Sigma2 prospective

Modeling Spray Drift: A Dispersion Model Case Study

Special Eurobarometer 474. Summary. Europeans perceptions of the Schengen Area

Description of Supplementary Data Files

CS 5523 Operating Systems: Synchronization in Distributed Systems

SIMPLE LINEAR REGRESSION OF CPS DATA

JUDGE, JURY AND CLASSIFIER

Real- Time Wireless Control Networks for Cyber- Physical Systems

Identifying Factors in Congressional Bill Success

The Diffusion of ICT and its Effects on Democracy

Use of the Delphi methodology to identify indicators of trafficking in human beings Process and results

National Homelessness Data

Learning from Small Subsamples without Cherry Picking: The Case of Non-Citizen Registration and Voting

GENDER EQUALITY IN THE LABOUR MARKET AND FOREIGN DIRECT INVESTMENT

UK Data Archive Study Number International Passenger Survey, 2016

Networks and grand corruption in Hungary. Exploratory analysis

WOMEN IN DECISION-MAKING POSITIONS

Migrant Wages, Human Capital Accumulation and Return Migration

IN THE UNITED STATES DISTRICT COURT FOR THE EASTERN DISTRICT OF PENNSYLVANIA

Happiness and economic freedom: Are they related?

EasyChair Preprint. (Anti-)Echo Chamber Participation: Examing Contributor Activity Beyond the Chamber

The CDB-based Poverty and Select CMDGs Maps and Charts

College Voting in the 2018 Midterms: A Survey of US College Students. (Medium)

Predicting Information Diffusion Initiated from Multiple Sources in Online Social Networks

Political Posts on Facebook: An Examination of Voting, Perceived Intelligence, and Motivations

Cadac SoundGrid I/O. User Guide

Networks and Innovation: Accounting for Structural and Institutional Sources of Recombination in Brokerage Triads

The Effect of ICT Investment on the Relative Compensation of High-, Medium-, and Low-Skilled Workers: Industry versus Country Analysis

Georgia Democratic Presidential Primary Poll 2/23/16. Fox 5 Atlanta

2018 MINNESOTA COUNTIES ELECTIONS CALENDAR

Abstract for: Population Association of America 2005 Annual Meeting Philadelphia PA March 31 to April 2

2018 MINNESOTA TOWNS WITH MARCH ELECTIONS CALENDAR

Most Believe Kinder Morgan Pipeline will have a Positive Economic Effect, But a Negative Environmental One

HCDNNJ Under One Roof Annual Conference: Utilizing Online Media to Publicize Your Organization

Subreddit Recommendations within Reddit Communities

Execution of Warrants of Arrest (Bench Warrants)

Transcription:

Analyzing the Power Consumption Behavior of a Large Scale Data Center KASHIF NIZAM KHAN, AALTO UNIVERSITY, FINLAND. SANJA S., TAPIO N., JUKKA K. N., SEBASTIAN V. A. & OLLI-PEKKA L. 1

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 2

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 3

Motivation Ø Data center energy spending is ever increasing Ø System power draw is increasing substantially without a breakthrough in energy efficiency Ø Increased economic, social and environmental pressure to decrease the energy cost Ø Performance of future HPC systems will be constrained by power cost 4

Motivation Ø Data center energy spending is ever increasing Ø Data center power consumption log analysis is relatively less studied 5

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 6

Contributions Ø Investigate the impact/relation of OS counters and RAPL on total power consumption Ø Analyse unsuccessful jobs and their influence in energy spending Ø Cluster the nodes based on the OS counter and RAPL values Ø Model/estimate the total power consumption using OS counters and RAPL value. 7

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 8

Dataset Description Ø 900 nodes Taito computing cluster 460 Sandybridge, 397 Haswell Ø Approximately 2 days of production data captured in June 2016 Ø vmstat, RAPL, plug power and job info. Ø Sampled at 0.5Hz https://research.csc.fi/taito-supercluster 9

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 10

Power Consumption of Computing Nodes 11

Power Consumption of Computing Nodes 12

Power Consumption of Computing Nodes NODE C581 13

Power Consumption of Computing Nodes NODE C749 14

Power Consumption of Computing Nodes NODE C836 15

Power Consumption of Computing Nodes NODE C585 16

Power Consumption of Computing Nodes NODE C626 17

Power Consumption of Computing Nodes NODE C775 18

Power Consumption of Computing Nodes NODE C819 19

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 20

Analysis of Unsuccessful Jobs ØCompleted - jobs that ran to completion ØFailed - jobs that failed to complete successfully ØCancelled- jobs that are cancelled by their users ØTimeout- jobs that did not run to successful completion within a given time limit. 21

Analysis of Unsuccessful Jobs 16% 43.5% 22

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 23

Power Consumption Estimation Ø Sample 2% of data from all the nodes (251,244 data samples) Ø First 2/3 rd of the data is used as historical data and train ML models Ø Last 1/3 rd of the data is used to validate Ø Random Forest gives the best result 24

Outline Ø Motivation Ø Contributions Ø Dataset Description Ø Power Consumption of Computing Nodes Ø Analysis of Unsuccessful Jobs Ø Power Consumption Estimation Ø Plug Power Modeling 25

Ø Aim - Model the plug power using OS counters and RAPL measurements Ø 30,000 measurements from 'Haswell' type computing nodes. Plug Power Modeling 1 0.8 0.6 0.4 0.2 0 0.2 0.4 0.6 0.8 1 r b swpd free buff cache si so bi bo in1 cs us sy id wa CPU1 DRAM1 CPU2 DRAM2 plug plug.lag5 r b swpd free buff cache si so bi bo in1 cs us sy id wa CPU1 DRAM1 CPU2 DRAM2 plug plug.lag5 Plug power Frequency 50 100 150 200 250 300 350 0 2000 4000 6000 8000 26

Plug Power Modeling MAPE: 2.10% 27

Plug Power Modeling MAPE: 1.97% 28

Clustering 29

Conclusion Ø Estimating plug power from utilization metrics is promising Ø RAPL add to the accuracy of the models by providing real time power consumption data Ø Considering interactions among RAPL variables the error reduces to 1.87% Ø Unsuccessful jobs can consume significant resources and power Ø In future, we aim to utilize such data center logs to produce job specific power consumption models 30

Thank You! 31