Review of Decision Tree Based Classification Algorithms in Medical Data
Diksha 1 , D. Gupta2
Section:Review Paper, Product Type: Journal Paper
Volume-7 ,
Issue-5 , Page no. 230-234, May-2019
CrossRef-DOI: https://doi.org/10.26438/ijcse/v7i5.230234
Online published on May 31, 2019
Copyright © Diksha, D. Gupta . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Diksha, D. Gupta, “Review of Decision Tree Based Classification Algorithms in Medical Data,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.230-234, 2019.
MLA Style Citation: Diksha, D. Gupta "Review of Decision Tree Based Classification Algorithms in Medical Data." International Journal of Computer Sciences and Engineering 7.5 (2019): 230-234.
APA Style Citation: Diksha, D. Gupta, (2019). Review of Decision Tree Based Classification Algorithms in Medical Data. International Journal of Computer Sciences and Engineering, 7(5), 230-234.
BibTex Style Citation:
@article{Gupta_2019,
author = {Diksha, D. Gupta},
title = {Review of Decision Tree Based Classification Algorithms in Medical Data},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {230-234},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4228},
doi = {https://doi.org/10.26438/ijcse/v7i5.230234}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.230234}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4228
TI - Review of Decision Tree Based Classification Algorithms in Medical Data
T2 - International Journal of Computer Sciences and Engineering
AU - Diksha, D. Gupta
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 230-234
IS - 5
VL - 7
SN - 2347-2693
ER -
VIEWS | XML | |
332 | 361 downloads | 195 downloads |
Abstract
Classification problem in data mining is widely used to discover the potential information hidden in the data. Clinical, microarray data or image data related to medical field consists of high dimensions which pose difficulties for biomedical researchers in acquiring and analyzing data. Three principal challenges related to high dimensional data are Volume, Velocity and Variety. Various dimensionality reduction techniques are been used to remove irrelevant features to make the task easier and efficient. Also, using dimensionality techniques result in improved classification performance of the classifiers. This paper presents a review on the supervised machine learning algorithms for classification and prediction of various diseases. It also discusses various splitting criterion to determine the best attributes. Decision Tree algorithms are easy to understand and easy to use among all the classifiers.
Key-Words / Index Term
Classification, CART, C4.5, C5.0, Decision tree , Dimensionality Reduction, ID3
References
[1] M. Fernandes, “Data Mining: A Comparative Study of its Various Techniques and its Process”, International Journal of Scientific Research in Computer Science and Engineering, Vol.5, Issue.1, pp.19- 23, 2017.
[2] Himanshi, K.K. Bhatia, “Prediction Model for UnderGraduate Student’s Salary Uisng Data Mining Techniques”, International Journal of Scientific Research in Network Security and Communication, Vol.6, Issue.2, pp. 50-53, 2018.
[3] B. Hssina, A. Merbouha, H. Ezzikouri, M. Zrritali, “A comparative study of decision tree ID3 and C4.5”, International Journal of Advanced Computer Science and Applications, Special Issue on Advances in Vehicular Ad Hoc Networking and Applications, pp.13-19, 2014.
[4] M. Sabitha, M. Mayilvahanan, “Application of dimensionality reduction techniques in real time dataset”, International Journal of Advanced Research in Computer Engineering & Technology (IJARCET), Vol. 5, Issue. 7, pp.2187-2189, 2016.
[5] R. Revathy, R. Lawrance, “Comparative Analysis of C4.5 and C5.0 algorithms on crop pest data”, International Journal of Innovative Research in Computer and Communication Engineering, Vol. 5, Issue. 1, pp.50-58, 2017.
[6] J. Liang, J. Shi, “The information entropy, rough entropy and knowledge granulation in rough set theory”, International Journal of Uncertainty, Fuzziness and Knowledge-Based Systems, Vol. 12, pp.37-46, 2014.
[7] T.P. Exarchos, M.G. Tsipouras, C.P. Exarchos, C. Papaloukas, D.I. Fotiadis, L.K. Michalis, “A methodology for the automated creation of fuzzy expert systems for ischaemic and arrhythemic beat classification based on a set of rules obtained by a decision tree”, Artificial Intelligence in Medicine, Vol. 40, pp.187-200, 2007.
[8] J.R. Quinlan, “Generating production rules from decision trees”, In Proceedings of the International Joint Conference on Artificial Intelligence, Milan, Italy, Vol. 1, pp.304-307, 1987.
[9] S. Singh, P. Gupta, “Comparative study id3, cart and c4.5 decision tree algorithm: A Survey”, International Journal of Advanced Information Science and Technology (IJAIST), Vol. 27, Issue. 27, pp.97-103, 2014.
[10] D. Ventura, T.R. Martinez, “An empirical comparison of discretization methods”, In Proceedings of the Tenth International Symposium on Computer and Information Sciences, pp. 443-450, 1995.
[11] R. Revathy, R. Lawrance, “Comparative analysis of c4.5 and c5.0 algorithms on crop pest data”, International Journal of Innovative Research in Computer and Communication Engineering, Vol. 5, Issue. 1, pp.50-58, 2017.
[12] S. Kharya, “Using data mining techniques for diagnosis and prognosis of cancer disease”, International Journal of Computer Science, Engineering and Information Technology (IJCSEIT), Vol. 2, Issue. 2, pp.55-66, 2012.
[13] M.C. Tu, D. Shin, “A comparative study of medical data classification methods based on decision tree and bagging algorithms”, 2009 Eighth IEEE International Conference on Dependable, Autonomic and Secure Computing, Washington, DC, USA, pp.183-187, 2009.
[14] C. Shah, A.G. Jivani, “Comparison of data mining classification algorithms for breast cancer prediction”, International Conference On Computing, Communication And Networking Technologies, Tiruchengode, Tamil Nadu, India, pp.1-4, 2013.
[15] L.J.P. Maaten, E.O. Postma, H.J. Herik, “Dimensionality reduction: A comparative review”, Online Preprint, Journal of Machine Learning, 2008.
[16] M.Z.F. Nasution, O.S. Sitompul, M. Ramli, “PCA based feature reduction to improve the accuracy of decision tree C4.5 classification”, 2nd International Conference on Computing and Applied Informatics Universitas Sumatera Utara (USU) Medan, Indonesia,pp.1-6, 2017.
[17] S. Sathya, S. Joshi, S. Padmavathi, “Classification of breast cancer dataset by different classification algorithms”, 2017 4th International Conference on Advanced Computing and Communication Systems (ICACCS), Coimbatore, India, pp.1-4, 2017.
[18] Y.M.S. Al-Wesabi, A. Choudhury, D. Won, “Classification of cervical cancer dataset”, Proceedings of the 2018 IISE Annual Conference, Loews Royal Pacific Resort, Orlando, Florida, pp.1456-1461, 2018.
[19] P. Douangnoulack, V. Boonjing, “Building minimal classification rules for breast cancer diagnosis”, 2018 10th International Conference on Knowledge and Smart Technology (KST), Thailand, pp.278-281, 2018.
[20] M.I. Faisal, S. Bashir, Z.S. Khan, F.H. Khan, “An evaluation of machine learning classifiers and ensembles for early stage prediction of lung cancer”, 2018 3rd International Conference on Emerging Trends in Engineering, Sciences and Technology (ICEEST), Karachi, Pakistan, pp.1-4, 2018.
[21] T-I. Tang, G. Zheng, Y. Huang, G. Shu, P. Wang, “A comparative study of medical data classification methods based on decision tree and system reconstruction analysis”, Industrial Engineering & Management Systems (IEMS), Vol. 4, Issue. 1, pp.102-108, 2005.