Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Department of Computer Science Faculty Scholarship and Creative Works

Series

Scheduling

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Fresh: Fair And Efficient Slot Configuration And Scheduling For Hadoop Clusters, Jiayin Wang, Yi Yao, Ying Mao, Bo Sheng, Ningfang Mi Dec 2014

Fresh: Fair And Efficient Slot Configuration And Scheduling For Hadoop Clusters, Jiayin Wang, Yi Yao, Ying Mao, Bo Sheng, Ningfang Mi

Department of Computer Science Faculty Scholarship and Creative Works

Hadoop is an emerging framework for parallel big data processing. While becoming popular, Hadoop is too complex for regular users to fully understand all the system parameters and tune them appropriately. Especially when processing a batch of jobs, default Hadoop setting may cause inefficient resource utilization and unnecessarily prolong the execution time. This paper considers an extremely important setting of slot configuration which by default is fixed and static. We proposed an enhanced Hadoop system called FRESH which can derive the best slot setting, dynamically configure slots, and appropriately assign tasks to the available slots. The experimental results show that …