Open Access   Article Go Back

Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models

Harshavardhan Patil1 , Priti Malkhede2 , Shreyash Madake3 , Ashutosh Kokate4 , Yash Bhandure5

  1. Dept. of Artificial Intelligence and Data Science, PES’s Modern College of Engineering, India.
  2. Dept. of Artificial Intelligence and Data Science, PES’s Modern College of Engineering, India.
  3. Dept. of Artificial Intelligence and Data Science, PES’s Modern College of Engineering, India.
  4. Dept. of Artificial Intelligence and Data Science, PES’s Modern College of Engineering, India.
  5. Dept. of Artificial Intelligence and Data Science, PES’s Modern College of Engineering, India.

Section:Research Paper, Product Type: Journal Paper
Volume-12 , Issue-7 , Page no. 24-32, Jul-2024

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v12i7.2432

Online published on Jul 31, 2024

Copyright © Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure, “Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models,” International Journal of Computer Sciences and Engineering, Vol.12, Issue.7, pp.24-32, 2024.

MLA Style Citation: Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure "Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models." International Journal of Computer Sciences and Engineering 12.7 (2024): 24-32.

APA Style Citation: Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure, (2024). Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models. International Journal of Computer Sciences and Engineering, 12(7), 24-32.

BibTex Style Citation:
@article{Patil_2024,
author = {Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure},
title = {Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {7 2024},
volume = {12},
Issue = {7},
month = {7},
year = {2024},
issn = {2347-2693},
pages = {24-32},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=5707},
doi = {https://doi.org/10.26438/ijcse/v12i7.2432}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v12i7.2432}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=5707
TI - Precise Human Activity Recognition using Convolutional Neural Network and Deep Learning Models
T2 - International Journal of Computer Sciences and Engineering
AU - Harshavardhan Patil, Priti Malkhede, Shreyash Madake, Ashutosh Kokate, Yash Bhandure
PY - 2024
DA - 2024/07/31
PB - IJCSE, Indore, INDIA
SP - 24-32
IS - 7
VL - 12
SN - 2347-2693
ER -

VIEWS PDF XML
135 183 downloads 57 downloads
  
  
           

Abstract

Human Activity Recognition (HAR) plays a pivotal role in various domains, ranging from healthcare to surveillance and robotics. This paper offers a comprehensive detail of Convolutional Neural Network (CNN)-based methodologies in HAR, emphasizing their efficiency in accurately recognizing human activities from video data. We used the UCF50 dataset, which contains videos of 50 different human activities, making it a suitable benchmark for evaluating CNN-based HAR models. The study investigates the utilization of CNNs for feature extraction and classification in HAR, focusing on techniques such as frame extraction, data preprocessing, and model architectures. Detailed analysis of convolutional layers, pooling layers, and activation functions within CNNs showcases their ability to capture intricate spatial and temporal features. The research also delves into the benefits of data augmentation and normalization in enhancing model performance and generalization. The findings highlight the significant advantages of CNNs in capturing spatial information and improving accuracy in HAR tasks, making them highly effective for real-world applications across various domains.

Key-Words / Index Term

Convolutional Neural Network, Human Activity Recognition, UCF50 Dataset, Data Preprocessing, Frame Extraction, Model Architecture, Feature Extraction

References

[1] X. Liang, H. -B. Zhang, Y. -X. Zhang and J. -L. Huang, "JTCR: Joint Trajectory Character Recognition for human action recognition," 2019 IEEE Eurasia Conference on IOT, Communication and Engineering (ECICE), Yunlin, Taiwan, pp.350-353, 2019.
[2] R. R. Koli and T. I. Bagban, "Human Action Recognition Using Deep Neural Networks," 2020 Fourth World Conference on Smart Trends in Systems, Security and Sustainability (WorldS4), London, UK, pp.376-380, 2020.
[3] W. Ma and S. Liang, "Human-Object Relation Network For Action Recognition In Still Images," 2020 IEEE International Conference on Multimedia and Expo (ICME), London, UK, pp.1-6, 2020.
[4] M. M. Hossain Shuvo, N. Ahmed, K. Nouduri and K. Palaniappan, "A Hybrid Approach for Human Activity Recognition with Support Vector Machine and 1D Convolutional Neural Network," 2020 IEEE Applied Imagery Pattern Recognition Workshop (AIPR), Washington DC, DC, USA, pp.1-5, 2020.
[5] D. R. Beddiar and B. Nini, "Vision based abnormal human activities recognition: An overview," 2017 8th International Conference on Information Technology (ICIT), Amman, Jordan, pp.548-553, 2017.
[6] A. M. F and S. Singh, "Computer Vision-based Survey on Human Activity Recognition System, Challenges and Applications," 2021 3rd International Conference on Signal Processing and Communication (ICPSC), Coimbatore, India, pp.110-114, 2021.
[7] K. Raja, I. Laptev, P. Pérez and L. Oisel, "Joint pose estimation and action recognition in image graphs," 2011 18th IEEE International Conference on Image Processing, Brussels, Belgium, 2011.
[8] H. Nishimura, Y. Ozasa, Y. Ariki and M. Nakano, "Object Recognition by Integrated Information Using Web Images," 2013 2nd IAPR Asian Conference on Pattern Recognition, Naha, Japan, pp.657-661, 2013.
[9] C. Wang, "A learning-based human facial image quality evaluation method in video-based face recognition systems," 2017 3rd IEEE International Conference on Computer and Communications (ICCC), Chengdu, China, pp.1632-1636, 2017.
[10] D. Meng, X. Peng, K. Wang and Y. Qiao, "Frame Attention Networks for Facial Expression Recognition in Videos," 2019 IEEE International Conference on Image Processing (ICIP), Taipei, Taiwan, pp.3866-3870, 2019.
[11] Souradeep Ghosh, "Proposal of a Real-time American Sign Language Detector using MediaPipe and Recurrent Neural Network", International Journal of Computer Sciences and Engineering, Vol.9, Issue.7, pp.46-52, 2021.
[12] Chalavadi Sravanth, Gadde Harshavardhan, Kamineni. Kavya, Shaik Mohammad Akbar, Ch.M.H. Sai Baba, "Real-Time Human Detection in Video Surveillance", International Journal of Computer Sciences and Engineering, Vol.9, Issue.1, pp.44-50, 2021.