Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Sciences

University of Nevada, Las Vegas

Theses/Dissertations

2018

Data Locality

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Deep Data Locality On Apache Hadoop, Sungchul Lee May 2018

Deep Data Locality On Apache Hadoop, Sungchul Lee

UNLV Theses, Dissertations, Professional Papers, and Capstones

The amount of data being collected in various areas such as social media, network, scientific instrument, mobile devices, and sensors is growing continuously, and the technology to process them is also advancing rapidly. One of the fundamental technologies to process big data is Apache Hadoop that has been adopted by many commercial products, such as InfoSphere by IBM, or Spark by Cloudera. MapReduce on Hadoop has been widely used in many data science applications. As a dominant big data processing platform, the performance of MapReduce on Hadoop system has a significant impact on the big data processing capability across multiple …