Publications Repository - Gdańsk University of Technology

Page settings

polski
Publications Repository
Gdańsk University of Technology

Treść strony

Influence of YARN Schedulers on Power Consumption and Processing Time for Various Big Data Benchmarks

Climate change caused by human activities can influence the lives of everybody onthe planet. The environmental concerns must be taken into consideration by all fields of studyincludingICT. Green Computing aims to reduce negative effects of IT on the environment while,at the same time, maintaining all of the possible benefits it provides. Several Big Data platformslike Apache Spark orYARNhave become widely used in analytics and High-PerformanceComputing systems due to the reliability and usability of Map Reduce implementations. Theauthors research the power consumption and energy efficiency of HadoopYARNschedulers usingApache Spark under three different workloads. The test cases include: sorting large binary files,counting unique words in large text files and processing satellite imagery from the Sentinel-2mission. The presented results show small (2%–11%) but distinct differences in the powerconsumption ofFIFOandFAIRschedulers

Authors