Increase the Performance of K-Means Clustering Algorithm Using Apache Spark

Chang Xie

About this Journal | Author Guidelines | Submit a Manuscript

Journal Menu

Browse Issue

Volume 1, No. 1, 2017
Volume 2, No. 1, 2018
Volume 2, No. 2, 2018
Volume 3, No. 1, 2019
Volume 4, No. 1, 2020

International Journal of Internet of Things and its Applications

Volume 1, No. 1, 2017, pp 13-28 http://dx.doi.org/10.21742/ijiota.2017.1.1.02
		Abstract

Increase the Performance of K-Means Clustering Algorithm Using Apache Spark

Chang Xie
Harbin University of Commerce, China

Abstract

Big data deals with large or complex traditional data. The term often refers to size and data. Big data presents a great challenge for database and data analytics research. It is used to get the predictive analysis from large data. It helps in decision making, and to take better decisions based on the given data. This paper consists of comparison between Hadoop Map Reduce and Apache Spark which are used for analyzing Bigdata. Even though both the frameworks are based on Bigdata, their performances differ from level to level and implementation also. In this paper we compare the performance of these both frameworks using k-means clustering algorithm.

Proper

Journal Menu

Browse Issue

Abstract

Contact Us