A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis

AUTHORS

Srikanth Meda,Associate Professor, Dept. of CSE, R.V.R.& J.C. College of Engineering, Guntur

ABSTRACT

Since a decade, emergence of interdisciplinary computer technologies changed the pace of medical diagnosis systems by insisting up-to-date intelligence and supervision. These intellectual systems predict the future health problems by processing the current health information of patients, which helps in prevention of diseases rather than cure. Although the medical diagnosis systems are adequate intelligent in disease diagnosis, but they are still suffering in pre-diagnosis of diseases due to the complexity in processing of huge medical datasets. Recently introduced Data Mining techniques with Big Data processing environment expanded the horizons of medical diagnosis systems to process the high velocity medical data sets to diagnose the occurrence of diseases early. Today’s medical diagnosis systems, which are utilizing different data mining techniques like Decision Trees (DT), Support Vector Machines (SVM), Naïve Bayes (NB), Fuzzy Logics and K-Nearest Neighbor (KNN), are suffering from uncertainty, imprecision and complexity in processing. In this paper we are proposing a cross reference methodology to improvise the reliability and precision of diagnostic results and utilizing big data tools to diminish the complexity in processing huge sets of medical data. Most popular data mining techniques, which are participating in medical data processing with high accuracy are selected and cross referenced by our proposed framework to overcome uncertainty and imprecision. In order to process the high velocity medical datasets with several data mining techniques, this frame work outsources the data processing business to Apache Hadoop environment. Experiments on Cleveland medical dataset proved that the proposed cross reference methodology framework recorded high precision, recall and accuracy in results than its counterparts.

 

KEYWORDS

Data Mining, Machine Learning classifiers, Decision making systems, Disease diagnosis.

REFERENCES

[1]     S. Palaniappan and R. Awang, “Intelligent heart disease prediction system using data mining techniques,” in IEEE/ACS International Conference on Computer Systems and Applications. IEEE, (2008), pp.108-115, [doi:10.1109/AICCSA.2008.4493524].(CrossRef)(Google Scholar)
[2]     Azuaje, F., Dubitzky, W., Lopes, P., Black, N., & Adamsom, K. “Predicting coronary disease risk based on short-term RR interval measurements: A neural network approach”. Artificial Intelligence in Medicine, Vol.15, No.3, pp.275-297, (1999). [DOI:10.1371/journal.pone.0210103](CrossRef)(Google Scholar)
[3]     Randa El Bialy, Mostafa, Omar and Khalifa “Feature analysis of coronary heart disease data sets” ICCMIT – 2015, Vol.65, by Elsiever’s science direct, procedia comp science, pp.459-468. (2015) [DOI: 10.1016/j.procs.2015.09.132](CrossRef)(Google Scholar)
[4]     Indira S. Fal Dessai “Intelligent Heart Disease Prediction System Using Probabilistic Neural Network”, International Journal on Advanced Computer Theory and Engineering (IJACTE), ISSN (Print) : pp.2319-2526, Vol.2, No.3, (2013)
[5]     Ali.Adeli, Mehdi.Neshat “A Fuzzy Expert System for Heart Disease Diagnosis” Proceedings of the international multi conference of engineers and computer scientists, vol.1, March-2010, hongkong.(2010)
[6]     University of California, Irvine (UCI) “Online accessible public medical data set: Cleveland Heart Disease Dataset,”. [Online] Available at : http://archive.ics.uci.edu/ml/datasets/Heart+Disease.
[7]     Robert Detrano & M.D & PhD, V.A. Medical Center, Long Each and Cleveland Clinic Foundation. Available: www.archive.ics.uci.edu/ml/datasets/Heart+Disease.
[8]     Azuaje, F., Dubitzky, W., Lopes, P., Black, N., & Adamsom, K. “Predicting coronary disease risk based on short-term RR interval measurements: A neural network approach”. Artificial Intelligence in Medicine, 15, pp.275-297, (1999). [DOI:10.1023/B:JOMS.0000041169.28544.fd].(CrossRef)(Google Scholar)
[9]     M. Gudadhe, K. Wankhade, and S. Dongre, “Decision support system for heart disease based on support vector machine and artificial neural network,” in Computer and Communication Technology (ICCCT), 2010 International Conference on, (2010), pp. 741-745, [DOI: 10.1109/ISCC.2017.8024530].(CrossRef)(Google Scholar)
[10]  H. Kahramanli and N. Allahverdi, “Design of a hybrid system for the diabetes and heart diseases,” Expert systems with applications, vol. 35, no. 1, pp. 82-89, (2008), DOI: 10.1016/j.eswa.2007.06.004.(CrossRef)(Google Scholar)
[11]  Samiya Khan, Kashish Ara Shakil and Mansaf Alam “Cloud-Based Big Data Analytics - A Survey Of Current Research And Future Directions” Big Data Analytics. Advances in Intelligent Systems and Computing, vol.654. Springer, Singapore,(2017) [DOI: 10.1007/978-981-10-6620-7_57](CrossRef)(Google Scholar)
[12]  Google Cloud Platform and Big Query. Retrieved from: https://cloud.google.com/bigquery/
[13]  K. Shvachko, H. Kuang, S. Radia and R. Chansler, "The Hadoop Distributed File System," 2010 IEEE 26th Symposium on Mass Storage Systems and Technologies (MSST), Incline Village, NV, (2010), pp.1-10. [doi: 10.1109/MSST.2010.5496972](CrossRef)(Google Scholar)

CITATION

  • APA:
    Meda,S.(2019). A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis. International Journal of Advanced Research in Big Data Management System, 3(2), 21-26. http://dx.doi.org/10.21742/IJARBMS.2019.3.2.04
  • Harvard:
    Meda,S.(2019). "A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis". International Journal of Advanced Research in Big Data Management System, 3(2), pp.21-26. doi:http://dx.doi.org/10.21742/IJARBMS.2019.3.2.04
  • IEEE:
    [1]S.Meda, "A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis". International Journal of Advanced Research in Big Data Management System, vol.3, no.2, pp.21-26, Nov. 2019
  • MLA:
    Meda Srikanth. "A survey on Machine Learning Classifiers and Big data for Accurate and Reliable Heart Disease Pre-diagnosis". International Journal of Advanced Research in Big Data Management System, vol.3, no.2, Nov. 2019, pp.21-26, doi:http://dx.doi.org/10.21742/IJARBMS.2019.3.2.04

ISSUE INFO

  • Volume 3, No. 2, 2019
  • ISSN(p):2208-1674
  • ISSN(o):2208-1682
  • Published:Nov. 2019

DOWNLOAD