Prediction of Heart Disease by Clustering and Classification Techniques
Reetu Singh1 , E. Rajesh2
Section:Research Paper, Product Type: Journal Paper
Volume-7 ,
Issue-5 , Page no. 861-866, May-2019
CrossRef-DOI: https://doi.org/10.26438/ijcse/v7i5.861866
Online published on May 31, 2019
Copyright © Reetu Singh, E. Rajesh . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Reetu Singh, E. Rajesh, “Prediction of Heart Disease by Clustering and Classification Techniques,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.861-866, 2019.
MLA Style Citation: Reetu Singh, E. Rajesh "Prediction of Heart Disease by Clustering and Classification Techniques." International Journal of Computer Sciences and Engineering 7.5 (2019): 861-866.
APA Style Citation: Reetu Singh, E. Rajesh, (2019). Prediction of Heart Disease by Clustering and Classification Techniques. International Journal of Computer Sciences and Engineering, 7(5), 861-866.
BibTex Style Citation:
@article{Singh_2019,
author = {Reetu Singh, E. Rajesh},
title = {Prediction of Heart Disease by Clustering and Classification Techniques},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {861-866},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4327},
doi = {https://doi.org/10.26438/ijcse/v7i5.861866}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.861866}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4327
TI - Prediction of Heart Disease by Clustering and Classification Techniques
T2 - International Journal of Computer Sciences and Engineering
AU - Reetu Singh, E. Rajesh
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 861-866
IS - 5
VL - 7
SN - 2347-2693
ER -
VIEWS | XML | |
932 | 311 downloads | 173 downloads |
Abstract
Every year 19 million people approximately die from heart disease worldwide. A heart patient shows several symptoms and it is very tough to attribute them to the heart disease in so many steps of disease progression. Data mining, as an answer to extract a hidden pattern from the clinical dataset, are applied to a database in this analysis. All available algorithms in classification technique are compared to each other to achieve the highest accuracy. To further increase the correctness of the solution, the dataset is preprocessed by different unsupervised and supervised algorithms. The two important tasks which are needed for the development of classifier come under data mining and they are clustering and classification. In K-means clustering the initial point selection effects on the results of the algorithm, both in the number of clusters found and their centroids. Methods to enhance the k-means clustering algorithm are discussed. With the help of these methods efficiency, accuracy and performance are improved. So, to improve the performance of clusters the Normalization which is a pre-processing stage is used to enhance the Euclidean distance by calculating more nearer centers, which result in a reduced number of iterations which will reduce the computational time as compared to k-means clustering. Finally, the classifiers are developed with Logistic regression by using the data extracted by K-Means Clustering. The techniques adopted in the design of classifier perform relatively well in terms of classification results better compared to clustering techniques.
Key-Words / Index Term
Data mining, Classification techniques, K-means clustering, Neural Networks, Logistic Regression
References
[1] de Carvalho Junior, Helton Hugo, et al. "A heart disease recognition embedded system with fuzzy cluster algorithm." Computer methods and programs in biomedicine 110.3 (2013): 447-454.
[2] Mirmozaffari, Mirpouya, Alireza Alinezhad, and Azadeh Gilanpour. "Data Mining Apriori Algorithm for Heart Disease Prediction." Int`l Journal of Computing, Communications & Instrumentation Engg (IJCCIE) 4.1 (2017).
[3] Khaing, Hnin Wint. "Data mining based fragmentation and prediction of medical data." Computer Research and Development (ICCRD), 2011 3rd International Conference on. Vol. 2. IEEE, 2011.
[4] Patel, Ajad, Sonali Gandhi, Swetha Shetty, and Bhanu Tekwani. "Heart Disease Prediction Using Data Mining." (2017).
[5] Wghmode, Mr Amol A., Mr Darpan Sawant, and Deven D. Ketkar. "Heart Disease Prediction Using Data mining Techniques." Heart Disease (2017).
[6] Vijayashree, J., and N. Ch SrimanNarayanaIyengar. "Heart disease prediction system using data mining and hybrid intelligent techniques: A review." Int. J. Bio-Sci. Biotechnol 8 (2016): 139-148.
[7] Singla, Meenu, and Kawaljeet Singh. "Heart Disease Prediction System using Data Mining Clustering Techniques."
[8] Cp, Prathibhamol, Anjana Suresh, and Gopika Suresh. "Prediction of cardiac arrhythmia type using clustering and regression approach (P-CA-CRA)." Advances in Computing, Communications and Informatics (ICACCI), 2017 International Conference on. IEEE, 2017.
[9] Banu, NK Salma, and Suma Swamy. "Prediction of heart disease at early stage using data mining and big data analytics: A survey." Electrical, Electronics, Communication, Computer and Optimization Techniques (ICEECCOT), 016 International Conference on. IEEE, 2016.
[10] Mane, Tejaswini U. "Smart heart disease prediction system using Improved K-means and ID3 on big data." Data Management, Analytics and Innovation (ICDMAI), 2017 International Conference on. IEEE, 2017.
[11] Senthil Kumar, B., and Dr Gunavathi R. "A Survey on Data Mining Approaches to Diabetes Disease Diagnosis and Prognosis." IJARCCE 5 (2016): 463-467.
[12] Kumar, B. Senthil, and R. Gunavathi. "Comparative and Analysis of Classification Problems." Journal of Network Communications and Emerging Technologies (JNCET) www. jncet. org 7.8 (2017).
[13] G. Kesavaraj, Dr. S.Sukumaran,”A Study on Classification Techniques in Data Mining”,IEEE-31661.
[14] F.U.Siddiqui, N.A.Mat Isa, “Optimized k-means clustering algorithm for image segmentation”, School of Electrical and electronic engineering, university Sains Malaysia, 14300, Nibong Tebel, Penang, Malaysia, 2012.
[15] Shital A. Raut and S. R. Sathe, “A Modified Fastmap KMeans Clustering Algorithm for Large Scale Gene Expression Datasets”, International Journal of Bioscience, Biochemistry and Bioinformatics, Vol. 1, No. 4, page 120- 124, November 2011.
[16] Tapas kanungo, David M. Mount, Nathan S. Netanyahu, Christine D. Piatko, Ruth Silverman, Angela Y.Wu,”An Efficient k-Means Clustering Algorithm: Analysis and Implememtation”, IEEE Transaction on Pattern Analysis and Machine Intelligence, Vol.24, No.7,2002.
[17] Kapil Joshi, Himanshu Gupta, Prashant Chaudhary, Punit Sharma,” Survey on Different enhanced K-means Clustering Algorithm”, International Journal Of Engineering Trends And Technology, Vol. 27 ,No. 4-September 2015.
[18] Adil Fahad, Najlaa Alshatri, Zahir Tari, Addullah Alamri, Ibrahim Khalil, albert Y.Zomaya, Sebti Foufou, Abdelaziz Bouras, “A survey of Clustering Algorithms for Big Data: Taxonomy And empirical Analysis”, IEEE TTANSACTION ON Emerging Topics in Computing.
[19] Jiawei Han, Micheline kamber, Jian pei, “Data Mining Concepts and Techniques”, Third Edition, © 2012, Elsevier Inc.
[20] K.Rajalakshmi,Dr.S.S.Dhenakaran,N.Roobin“Comparative Analysis of K-Means Algorithm in Disease Prediction”, International Journal of Science, Engineering and Technology Research (IJSETR), Vol. 4, Issue 7, July 2015. [
[21] BV Sumana, T.Santhanam,”Prediction of diseases by Cascading Clustering and Classification”,International Conference on Advances in Electronics , Computers and Communication(ICAECC),2014.
[22] K. R. Lakshmi, M. V. Krishna and S. P. Kumar, “Performance Comparison of Data Mining Techniques for Predicting of Heart Disease Survivability,” International Journal of Scientific and Research Publications, ISSN 2250-3153, Vol.3, Issue.6, June 2013.
[23] M. Kumari, R. Vohra and A. Arora, “Prediction of Diabetes Using Bayesian Network,” International Journal of Computer Science and Information Technologies, Vol. 5 (4), 5174-5178, 2014. M. A. Banu, B. Gomathy, “Disease forecasting system using data mining methods,” in IEEE International Conference on Intelligent Computing Applications (ICICA`14), pp. 130-133, 2014. https://doi.org/10.1109/icica.2014.36
[24] B. Bahrami, and M. H. Shirvani, “Prediction and Diagnosis of Heart Disease by Data Mining Techniques,” Journal of Multidisciplinary Engineering Science and Technology (JMEST), ISSN: 3159-0040, Vol. 2, Issue 2, February 2015.
[25] I. H. Witten and E. Frank, “Data Mining Practical Machine Learning Tools and Techniques,” Morgan Kaufman Publishers, 2005
[26] R. Jyoti, G. Preeti, “Analysis of Data Mining Techniques for Diagnosing Heart Disease,” International Journal of Advanced Research in Computer Science and Software Engineering (IJARCSSE), Vol. 5, ISSUE. 7, July 2015.
[27] K. Manimekalai, “prediction of heart disease using data mining techniques,” IJIRCCE, Vol.4, Issue 2, February 2016.
[28] A. Durgadevi and K. Saravanapriya, “comparative study of data mining classification algorithm in heart disease prediction,” international journal of recent research in mathematics computer science and information technology, Vol.2, Issue 2, March 2016.
[29] O. Y. Atkov, “Coronary heart disease diagnosis by artificial neural networks including genetic polymorphisms and clinical parameters,” Elsevier, 2012.
[30] R. Alizadehsani, J. Habibi, M. Hosseini, R. Boghrati, A. Ghandeharioun, B. Bahadorian, Z. Alizadehsani, “A Data Mining Approach for Diagnosis of Coronary Artery Disease, ” Elsevier, 2013.
[31] S. U. Amin, K. Agarwal, and R. Beg, “Genetic neural network based data mining in prediction of heart disease using risk factors,” presented at the IEEE Conference on Information & Communication Technologies, 2013.
[32] Priyanka, Sana Khan, Tulsi Kour “Investigation on Smart Health Care Using Data Mining Methods” International Journal Of Scientific Research in Computer Sciences and Engineering 2016.