Early-Stage Diabetes Risk Detection Using Data Mining Techniques With Particle Swarm Optimization
Research Paper | Journal Paper
Vol.9 , Issue.9 , pp.63-65, Sep-2021
CrossRef-DOI: https://doi.org/10.26438/ijcse/v9i9.6365
Abstract
In this study feature Selection technique (FST) namely Particle Swarm Optimization (PSO) is used to optimize the features of diabetes datasets. There are different types of classifiers that give low performances. So we need an FST to combined classifier may be required for best results. We used FST to improve the overall performance of the classification model. Classification of diabetes dataset classifier C4.5 and Support Vector Machine (SVM) is applied. The selected feature of diabetes is applied to classifiers and a comparative study was conducted. The experimental outcome reveals that the C4.5 is performed better with selected features compared to other models.
Key-Words / Index Term
Classification, C4.5, feature Selection technique, Particle Swarm Optimization, Support Vector Machine
References
[1] S. K. Sahu and P. K. Chandrakar, “Classification of Chronic Kidney Disease with Genetic Search Intersection Based Feature,” in Advances in Intelligent Systems and Computing 1122, vol. 1, pp. 11–21, 2020.
[2] S. M. Alzahani, A. Althopity, A. Alghamdi, B. Alshehri, and S. Aljuaid, “An Overview of Data Mining Techniques Applied for Heart Disease Diagnosis and Prediction,” Eng. Technol. Publ., vol. 2, no. 4, pp. 310–315, 2014.
[3] B. O. Eriksen and O. C. Ingebretsen, “In chronic kidney disease staging the use of the chronicity criterion affects prognosis and the rate of progression,” Kidney Int., vol. 72, no. 10, pp. 1242–1248, 2007.
[4] A. M. Altamimi, “Performance Analysis of Supervised Classifying Algorithms to Predict Diabetes in Children,” J. Xi’an Univ. Archit. Technol., vol. XII, no. III, pp. 2010–2017, 2020.
[5] G. A. Pethunachiyar, “Classification of diabetes patients using kernel based support vector machines,” in 2020 International Conference on Computer Communication and Informatics, ICCCI 2020, pp. 22–25, 2020.
[6] H. Kaur and G. Kaur, “Prediction of Diabetes Using Support Vector Machine,” Int. J. Res. Eng. Appl. Manag., vol. 05, no. 02, pp. 470–473, 2019.
[7] A. Kumari and R. Chitra, “Classification Of Diabetes Disease Using Support Vector Machine,” Int. J. Eng. Res. Appl., vol. 3, no. 2, pp. 1797–1801, 2013.
[8] O. S.Soliman and E. AboElhamd, “Classification of Diabetes Mellitus using Modified Particle Swarm Optimization and Least Squares Support Vector Machine,” Int. J. Comput. Trends Technol., vol. 8, no. 1, pp. 38–44, 2014.
[9] S. Sathyanarayana and S. Amarappa, “Data classification using Support vector Machine (SVM), a simplified approaCH,” Int. J. Electron. Comput. S cience Eng. Vol. 3, Number 4, ISSN- 2277-1956, pp. 435–445, 2014.
[10] J. R. Quinlan, “Improved Use of Continuous Attributes in C4 . 5,” vol. 4, no. 1996, pp. 77–90, 2006.
[11] A. K. Shrivas and S. K. Sahu, “Classification of Chronic Kidney Disease using Combination Feature Selection Techniques and Classifiers,” vol. 7, no. 3, pp. 114–117, 2019.
[12] N. Boodhun and M. Jayabalan, “Risk prediction in life insurance industry using supervised learning algorithms,” Complex Intell. Syst., no. March, 2018.
[13] M. Dash and H. Liu, “Feature selection for classification,” Intell. Data Anal., vol. 1, no. 3, pp. 131–156, 1997.
[14] P. Verma, V. K. Awasthi, and S. K. Sahu, “An Ensemble Model With Genetic Algorithm for Classification of Coronary Artery Disease,” Int. J. Comput. Vis. Image Process., vol. 11, no. 3, pp. 70–83, 2021.
[15] L. Bianchi, M. Dorigo, L. Maria, and W. J. Gutjahr, “A survey on metaheuristics for stochastic combinatorial optimization,” pp. 239–287, 2009.
[16] J. Kennedy and R. Eberhart, “Particle Swarm Optimization,” pp. 1942–1948, 1995.
[17] Y. Shi and R. Eberhart, “A Modified Particle Swarm Optimizer,” pp. 69–73, 1998.
Citation
Sanat Kumar Sahu, "Early-Stage Diabetes Risk Detection Using Data Mining Techniques With Particle Swarm Optimization," International Journal of Computer Sciences and Engineering, Vol.9, Issue.9, pp.63-65, 2021.
A Novel Ensemble Model for Classification of Chronic Kidney Disease With Selected Features and Components
Research Paper | Journal Paper
Vol.9 , Issue.9 , pp.66-69, Sep-2021
CrossRef-DOI: https://doi.org/10.26438/ijcse/v9i9.6669
Abstract
Diagnosis of health conditions is an incredibly difficult and significant issue in the field of medical science. Classification, dimension reduction technique (DRT), feature selection techniques (FST) play a very important role in the quick and accurate identification of disease. The chronic kidneys disease (CKD) dataset is connected into three classification methods like RF, J48 and C5.0. The proposed ensemble model (RF, J48 and C5.0) gives better accuracy i.e. 99.75% contrast with all classifiers with selected feature subset. All classification models give a better outcome with proposed PC-DRT and GA-FST when contrasted with without FST. The outcomes showed that utilizing GA-FST has computationally enhanced the classification accuracy.
Key-Words / Index Term
Classification, chronic kidneys disease, dimension reduction technique, ensemble model, feature selection techniques, genetic algorithm, principal component analysis
References
[1] A. Subasi, E. Alickovic, and J. Kevric, “Diagnosis of Chronic Kidney Disease by Using Random Forest,” C. 2017 Proc. Int. Conf. Med. Biol. Eng. 2017, vol. 7, no. 1, pp. 589–594, 2017.
[2] H. Polat, H. Danaei Mehr, and A. Cetin, “Diagnosis of Chronic Kidney Disease Based on Support Vector Machine by Feature Selection Methods,” J. Med. Syst., vol. 41, no. 4, 2017.
[3] N. Boodhun and M. Jayabalan, “Risk prediction in life insurance industry using supervised learning algorithms,” Complex Intell. Syst., no. March, 2018.
[4] M. Dash and H. Liu, “Feature selection for classification,” Intell. Data Anal., vol. 1, no. 3, pp. 131–156, 1997.
[5] C. W. Tsai, S. J. Liu, and Y. C. Wang, “A parallel metaheuristic data clustering framework for cloud,” J. Parallel Distrib. Comput., vol. 116, pp. 39–49, 2018.
[6] M. C. Thomas, W. Zhu, and J. A. Romagnoli, “Data mining and clustering in chemical process databases for monitoring and knowledge discovery,” J. Process Control, vol. 67, pp. 160–175, 2018.
[7] M. K. Najafabadi, M. N. ri Mahrin, S. Chuprat, and H. M. Sarkan, “Improving the accuracy of collaborative filtering recommendations using clustering and association rules mining on implicit data,” Comput. Human Behav., vol. 67, pp. 113–128, 2017.
[8] J. Hou and W. Liu, “Parameter independent clustering based on dominant sets and cluster merging,” Inf. Sci. (Ny)., vol. 405, pp. 1–17, 2017.
[9] A. Subasi, E. Alickovic, and J. Kevric, “Diagnosis of Chronic Kidney Disease by Using Random Forest,” vol. 7, no. 1, pp. 589–594, 2017.
[10] V. Kunwar, K. Chandel, A. S. Sabitha, and A. Bansal, “Chronic Kidney Disease Analysis Using Data Mining Classification,” Cloud Syst. Big Data Eng. (Confluence), 2016 6th Int. Conf. IEEE, pp. 300–305, 2016.
[11] P. Sinha, “Comparative Study of Chronic Kidney Disease Prediction using KNN and SVM,” vol. 4, no. 12, pp. 608–612, 2015.
[12] A. Pujari, Data mining techniques, Third. University press, 2013.
[13] J. Ross, Q. Morgan, and K. Publishers, “Book Review?: C4 . 5?: Programs for Machine Learning,” vol. 240, pp. 235–240, 1994.
[14] Y. Zhang and L. Wu, “Classification of fruits using computer vision and a multiclass support vector machine,” Sensors (Switzerland), vol. 12, no. 9, pp. 12489–12505, 2012.
[15] L. C. Paul, A. Al Suman, and N. Sultan, “Methodological Analysis of Principal Component Analysis (PCA) Method,” IJCEM Int. J. Comput. Eng. Manag. ISSN, vol. 16, no. 2, pp. 2230–7893, 2013.
[16] P. Yildirim, “Filter Based Feature Selection Methods for Prediction of Risks in Hepatitis Disease,” Int. J. Mach. Learn. Comput., vol. 5, no. 4, pp. 258–263, 2015.
[17] M. Einasto et al., “SDSS DR7 superclusters. Principal component analysis,” pp. 1–13, 2011.
[18] P. Verma, V. K. Awasthi, and S. K. Sahu, “A Novel Design of Classification of Coronary Artery Disease Using Deep Learning and Data Mining Algorithms,” Rev. d ’ Intell. Artif., vol. 35, no. 3, pp. 209–215, 2021.
[19] D. E. Goldberg, “Genetic Algorithms in Search Optimization & Machine Learning.” p. 412, 1989.
[20] M. Hall, “Correlation-based Feature Selection for Machine Learning,” Methodology, vol. 21i195-i20, no. April, pp. 1–5, 1999.
[21] P. J. Fleming and C. M. Fonseca, “Genetic Algorithms in Control Systems Engineering,” IFAC Proc. Vol., vol. 26, no. 2, pp. 605–612, 1993.
[22] S. K. Sahu and A. K. Shrivas, “Analysis and Comparison of Clustering Techniques for Chronic Kidney Disease With Genetic Algorithm,” Int. J. Comput. Vis. Image Process., vol. 8, no. 4, pp. 16–25, 2018.
[23] “CKD,” 2015. [Online]. Available: https://archive.ics.uci.edu/ml/datasets/Chronic_Kidney_Disease. [Accessed: 01-Jan-2016].
Citation
Sanat Kumar Sahu, "A Novel Ensemble Model for Classification of Chronic Kidney Disease With Selected Features and Components," International Journal of Computer Sciences and Engineering, Vol.9, Issue.9, pp.66-69, 2021.
Blockchain Based Secure Online Voting System
Research Paper | Journal Paper
Vol.9 , Issue.9 , pp.70-74, Sep-2021
CrossRef-DOI: https://doi.org/10.26438/ijcse/v9i9.7074
Abstract
“Large sections of society today do not trust their government or election transparency. The issue with the current EVM system is that it can be easily manipulated by power-hungry organizations. Blockchain is a disruptive technology of current era and promises to improve the overall resilience of e-voting systems. This project presents an effort to leverage benefits of blockchain such as cryptographic foundations and transparency to achieve an effective scheme for e-voting. The proposed scheme conforms to the fundamental requirements for e-voting schemes and achieves end-to-end verifiability. We can achieve this e-voting scheme along with its implementation using Multichain platform. This project presents an effort to leverage benefits of blockchain such as cryptographic foundations and transparency to achieve an effective scheme for e-voting. This requires minimum requirements needed by a voter is a smartphone or a computer with a webcam and an internet connection. This private key public key wallet for users can completely eliminate chances of double voting. This will also lead to minimum involvement of human engagement and we can rely on a trusted software architecture. More number of voters would be able to participate in elections.
Key-Words / Index Term
Block chain, EVM, Power Hungry, Resilence, Cryptography, Multichain, Decentrolised
References
[1] Khan, K. M., Arshad, J., & Khan, M. M. Investigating performance constraints for blockchain based secure e-voting system. Future Generation Computer Systems. Pawade, D. 105, 13-26, 2020.
[2] Sakhapara, A., Badgujar, A., Adepu, D., & Andrade, M. Secure online voting system using biometric and blockchain. In Data Management, Analytics and Innovation. Springer, Singapore. pp. 93-110, 2020.
[3] Krishnan, R., Thangavelu, A., Prabhavathy, P., Sudheer, D., Putrevu, D., & Misra, A. Web-based remote sensing image retrieval using multiscale and multidirectional analysis based on Contourlet and Haralick texture features. International Journal of Intelligent Computing and Cybernetics, 2021.
[4] Elisa, N., Yang, L., Chao, F., & Cao, Y. A framework of blockchain-based secure and privacy-preserving E-government system. Wireless Networks, 1-11, 2018.
[5] Ayed, A. B. A conceptual secure blockchain-based electronic voting system. International Journal of Network Security & Its Applications, 9(3), 01-09, 2017.
[6] Daniel, M. Blockchain Technology: The Key to Secure Online Voting. Regulation, 2017.
[7] Devulapalli, S., Potti, A., Krishnan, R., & Khan, M. S. Experimental evaluation of unsupervised image retrieval application using hybrid feature extraction by integrating deep learning and handcrafted techniques. Materials Today: Proceedings, 2021.
[8] Daniel, M. Blockchain Technology: The Key to Secure Online Voting. Regulation, 2017.
[9] Yu, B., Liu, J. K., Sakzad, A., Nepal, S., Steinfeld, R., Rimba, P., & Au, M. H. Platform-independent secure blockchain-based voting system. In International Conference on Information Security. Springer, Cham. pp. 369-386, 2018, September.
[10] Sudheer, D., & Krishnan, R. Multiscale Texture Analysis and Color Coherence Vector Based Fea-ture Descriptor for Multispectral Image Retrieval. ASTES J., 4(6), 270-279, 2019.
[11] Chaieb, M., Yousfi, S., Lafourcade, P., & Robbana, R. Verify-your-vote: A verifiable blockchain-based online voting protocol. In European, Mediterranean, and Middle Eastern Conference on Information Systems, pp. 16-30, 2018, October. Springer, Cham.
[12] Devulapalli, S., & Krishnan, R. Synthesized pansharpening using curvelet transform and adaptive neuro-fuzzy inference system. Journal of Applied Remote Sensing, 13(3), 034519, 2019.
Citation
C. Gouthami, G. Vidyulatha, K. Bhavani, G. Akhila, "Blockchain Based Secure Online Voting System," International Journal of Computer Sciences and Engineering, Vol.9, Issue.9, pp.70-74, 2021.
Diabetic Disease Prediction System using Supervised Machine Learning Approaches
Research Paper | Journal Paper
Vol.9 , Issue.9 , pp.75-82, Sep-2021
CrossRef-DOI: https://doi.org/10.26438/ijcse/v9i9.7582
Abstract
In the present study Diabetics is one of the critical diseases which can fall at any group of age and gender. The major causes lead to diabetics is mostly inheritance, in a proper healthy lifestyle, Irregular food habits, stress, and no physical exercise. Prediction of Diabetics is a very important study since it is one of the leading causes of sudden kidney failures, heart attacks, and brain stroke etc. The diabetic patient treatment can be done through patient health history. The Doctor can find hidden information about the patient through healthcare applications and it will be used for effective decision-making for the patient’s health condition. The healthcare industry is also collecting a large amounts of patient health information from different data warehouses. Using these healthcare databases researchers used to extract information for predicting the diabetics of the patient. Researchers are focused on developing software with the help of machine learning methods that can help clinicians to make better decisions about a patient`s health based on their prediction and diagnosis. The main purpose of this program is to diagnose a patient`s diabetes using machine learning methods. A relative study of the various competences of machine learning approaches will be done through a graphical representation of the results. The goal and objective of this project is to predict the chances of diabetics then provide early treatment to patients, which will reduce the life-risk and cost of treatment. For this purpose a probability modeling and machine learning approach like Support Vector Machine algorithm Decision tree algorithm, Naive Bayes algorithm, Logistic regression algorithm are used to predict diabetics.
Key-Words / Index Term
SVM (Support Vector Machine), Decision Tree, Naïve Bayes, Linear Regression, accuracy comparison, machine learning techniques, predicting data values, analysis and results.
References
[1] S.Deepti, S.Dilip Singh, “Prediction of Diabetes using Classification Algorithms”, Procedia Computer Science, Vol.132, pp. 1578-1585, 2018.
[2] Larabi-Marie-Sainte, S, Aburahmah, L, Almohaini, R, & Saba, T, “Current techniques for diabetes prediction review and case study”, Applied Sciences, Vol.9, Issue.21, pp.460, 2019.
[3] Rodríguez-Rodríguez, I Rodríguez, J.V.Woo, W. L, Wei, B, & Pardo-Quiles, D. J, “A Comparison of Feature Selection and Forecasting Machine Learning Algorithms for Predicting Glycaemia in Type 1 Diabetes Mellitus”, Applied Sciences, Vol.11, Issue.4, pp.1742, 2021.
[4] Nedyalkova, M., Madurga, S., & Simeonov, V. Combinatorial “k-means clustering as a machine learning tool applied to diabetes mellitus type 2”, International Journal of Environmental Research and Public Health, Vol.18, Issue 4, pp.1919, 2021.
[5] Daniel, P. “An application of the free moment for the diabetic patients` classification-a pilot study”, E-Health and Bioengineering Conference (EHB), pp. 1-4, IEEE, 2015.
[6] Taser, P.Y, “Application of Bagging and Boosting Approaches Using Decision Tree-Based Algorithms in Diabetes Risk Prediction”, Multidisciplinary Digital Publishing Institute Proceedings Vol. 74, No. 1, p. 6, 2021.
[7] Kanchan, B. D, & Kishor, M.M, “Study of machine learning algorithms for special disease prediction using principal of component analysis”, International conference on global trends in signal processing, information computing and communication (ICGTSPICC), pp. 5-10, IEEE 2016.
[8] Mohanty, K. K, Barik, P. K, Barik, R. C, & Bhuyan, K. C, “An efficient prediction of diabetic from retinopathy using machine learning and signal processing approach”, International Conference on Information Technology (ICIT) pp. 103-108, IEEE, 2019.
[9] Shi, G, Zou, S, & Huang, A, “Glucose-tracking: A postprandial glucose prediction system for diabetic self-management”, 2nd International Symposium on Future Information and Communication Technologies for Ubiquitous HealthCare Ubi-HealthTech pp. 1-9, IEEE 2015.
[10] www.kaggle.com/uciml/pima-indians-diabetes-database.
[11] Godi, B, Viswanadham, S, Muttipati A. S, Samantray O. P, & Gadiraju S. R, “E-healthcare monitoring system using IoT with machine learning approaches”, International Conference on Computer Science, Engineering and Applications (ICCSEA), pp. 1-5, IEEE 2020.
Citation
Ommi Ramu, Brahmaji Godi, Om Prakash Samantray, "Diabetic Disease Prediction System using Supervised Machine Learning Approaches," International Journal of Computer Sciences and Engineering, Vol.9, Issue.9, pp.75-82, 2021.
Quantile Regression Models for Rainfall Data
Research Paper | Journal Paper
Vol.9 , Issue.9 , pp.83-85, Sep-2021
CrossRef-DOI: https://doi.org/10.26438/ijcse/v9i9.8385
Abstract
Rainfall is important for human beings, animals and plants for their survival. Rainfall depends on many variables such as wind speed, temperature, humidity etc. Mathematical modelling of rainfall data is a stochastic process. Several mathematical models based on the probability concept are available. These models help in knowing the probable weekly, monthly or annually rainfall. Over the past decade or so, a number of models have been developed to generate rainfall and runoff. Monthly rainfall and temperature were analyzed using time series analysis. In this paper we are fitted linear regression model and quartile regression model at various values of tau 0.25, 0.5 and 0.75 for North west India (NWI), West Central India (WCI), North East India(NEI), Central North East India (CNEI) and Peninsular India (PI). Best model among fitted four models is choosing by using root mean square error (RMSE) criteria.
Key-Words / Index Term
Rainfall, Quantile Regression, Linear regression, RMSE
References
[1] P.N.M. Momani, “Time Series Analysis Model for Rainfall Data in Jordan: Case Study for Using Time Series Analysis” American Journal of Environmental Sciences, Vol 5, No5, pp.599-604, 2009.
[2] Mostata Dastorani, Mohammad Mirzawad, Mohammad Taghi Dastorani, and Syed Javad Sadatinejad, “Comparative study among different time series models applied to monthly rainfall forecasting in semi-arid climate condition”, Natural Hazards, Vol. 81, No.3, pp. 1811–1827, 2016.
[3] N. Hasan, N. C. Nath and R. I. Rasel, “A support vector regression model for forecasting rainfall,” 2015 2nd International Conference on Electrical Information and Communication Technologies (EICT), pp. 554-559, 2015, doi: 10.1109/EICT.2015.7392014.
[4] Kin C. Luk, J.E. Ball and A. Sharma, “An Application of Artificial Neural Networks for Rainfall Forecasting”, Mathematical and Computer Modelling, Vol.33, No.6-7, pp. 683-693, 2001.
[5] N.Q.Hung, M.S. Babel, S. Weejakul and N.K. Tripathi, “An Artificial Neural Network model for rainfall forecasting in Bangkok, Thailand”, Hydrology and earth sciences, Vol.13, No.8, pp1413-1425, 2009.
[6] M. P. Darji, V. K. Dabhi and H. B. Prajapati, "Rainfall forecasting using neural network: A survey," 2015 International Conference on Advances in Computer Engineering and Applications, pp. 706-713, 2015. doi: 10.1109/ICACEA.2015.7164782.
[7] Kartik Jawanjal, Mayank Modi, Aditya Panchwagh, Nagesh Jadhav, “Weather Prediction Using Indian Almanac Rules,” International Journal of Computer Sciences and Engineering, Vol.9, No.6, pp.19-24, 2021.
[8] M.Varsha, B. Poornima, “A Comparative Study of Ensemble Classifiers for Paddy Blast Disease Prediction Model”, International Journal of Computer Sciences and Engineering, Vol.8, No.9, pp.91-100, 2020.
Citation
S. Damodharan, S. Venkatramana Reddy, B. Sarojamma, "Quantile Regression Models for Rainfall Data," International Journal of Computer Sciences and Engineering, Vol.9, Issue.9, pp.83-85, 2021.