Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

PDF

Wayne State University

Theses/Dissertations

2015

Big Data

Articles 1 - 2 of 2

Full-Text Articles in Physical Sciences and Mathematics

Performance Comparison Of Two Data Mining Algorithms On Big Data Platforms, Md Rajiur Rahman Raju Jan 2015

Performance Comparison Of Two Data Mining Algorithms On Big Data Platforms, Md Rajiur Rahman Raju

Wayne State University Theses

In this Big data era, the need for performing large-scale computations is evident. A better understanding of the most suitable platforms which can efficiently run these computations is needed. In this thesis, we attempt to compare four such big data platforms, namely Hadoop, Spark, GPU, and Multicore CPU. We compare these platforms using two prominent data mining algorithms, namely, K-means clustering and K-nearest neighbour classification and discuss specific implementation-level details. We provide several insights into the best possible implementations of these algorithms and systematically compare the benefits and drawbacks of each of these platforms. We conduct experiments by varying data …


Unsupervised Learning And Image Classification In High Performance Computing Cluster, Itauma Itauma Jan 2015

Unsupervised Learning And Image Classification In High Performance Computing Cluster, Itauma Itauma

Wayne State University Theses

Feature learning and object classification in machine learning have become very active research areas in recent decades. Identifying good features has various benefits for object classification in respect to reducing the computational cost and increasing the classification accuracy. In addition, many research studies have focused on the use of Graphics Processing Units (GPUs) to improve the training time for machine learning algorithms. In this study, the use of an alternative platform, called High Performance Computing Cluster (HPCC), to handle unsupervised feature learning, image and speech classification and improve the computational cost is proposed.

HPCC is a Big Data processing and …