Open Access   Article Go Back

Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents

Shaina 1 , Naresh Kumar2

Section:Research Paper, Product Type: Journal Paper
Volume-7 , Issue-6 , Page no. 314-318, Jun-2019

CrossRef-DOI:   https://doi.org/10.26438/ijcse/v7i6.314318

Online published on Jun 30, 2019

Copyright © Shaina, Naresh Kumar . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.

View this paper at   Google Scholar | DPI Digital Library

How to Cite this Paper

  • IEEE Citation
  • MLA Citation
  • APA Citation
  • BibTex Citation
  • RIS Citation

IEEE Style Citation: Shaina, Naresh Kumar, “Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.6, pp.314-318, 2019.

MLA Style Citation: Shaina, Naresh Kumar "Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents." International Journal of Computer Sciences and Engineering 7.6 (2019): 314-318.

APA Style Citation: Shaina, Naresh Kumar, (2019). Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents. International Journal of Computer Sciences and Engineering, 7(6), 314-318.

BibTex Style Citation:
@article{Kumar_2019,
author = {Shaina, Naresh Kumar},
title = {Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {6 2019},
volume = {7},
Issue = {6},
month = {6},
year = {2019},
issn = {2347-2693},
pages = {314-318},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4550},
doi = {https://doi.org/10.26438/ijcse/v7i6.314318}
publisher = {IJCSE, Indore, INDIA},
}

RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i6.314318}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4550
TI - Detection and correcting the wrong words from Hindi, English and Punjabi Text Documents
T2 - International Journal of Computer Sciences and Engineering
AU - Shaina, Naresh Kumar
PY - 2019
DA - 2019/06/30
PB - IJCSE, Indore, INDIA
SP - 314-318
IS - 6
VL - 7
SN - 2347-2693
ER -

VIEWS PDF XML
329 318 downloads 178 downloads
  
  
           

Abstract

Spell checking is a very important phase of any document processing system and Natural Language Processing. Spell Checking is a process to find the incorrect spells in a text document and to correct that particular incorrect spelling. There are various spell checking systems for various languages Like Hindi, Punjabi, English, French, Germen that can detect and correct the spell from a particular document. In this paper, we proposed a hybrid algorithm to detect and correct misspelled words from a text document written in three languages Hindi, English and Punjabi. Hybrid approach is a combination of various approaches like Dictionary lookup approach, Edit Distance Approach, Rule based approach and N-Gram approach. Proposed system can detect and correct the misspelled words from three given languages. A collision detection and correction system for alternates for misspell words has been also provided. Performance of proposed system is checked on various inputs collected from various books, websites etc. Results of the proposed system are evaluated on these outputs which have accuracy values higher than that of existing system.

Key-Words / Index Term

Spell Checking; Hybrid approach for Spell Checking; N-Gram Approach; Rule Based Approach; Edit distance approach

References

[1] Ritika Mishra, Navjot Kaur, Design and Implementation of Online Punjabi Spell Checker Based on Dynamic Programming, Volume 3, Issue 8, August 2013, ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering
[2] Neha Gupta, Pratistha Mathur, Spell Checking Techniques in NLP: A Survey, Volume 2, Issue 12, December 2012 , ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering
[3] Baljeet Kaur, Review On Error Detection and Error Correction Techniques in NLP: Volume 4, Issue 6, June 2014 ISSN: 2277 128X, International Journal of Advanced Research in Computer Science and Software Engineering.
[4] Rupinderdeep Kaur and Parteek Bhatia, “Design and Implementation of SUDHAAR-Punjabi Spell Checker,” International Journal of Information and Telecommunication Technology, Vol. 1, Issue 15 May, 2010.
[5] S. Dasgupta, C.H. Papadimitriou, and U.V. Vazirani, `Algorithms`, p173, available at http:/ / www.cs.berkeley.edu/ ~vazirani/ algorithms.html.
[6] Neha Gupta &PratisthaMathur,“Spell Checking Techniques in NLP: A Survey,” International Journal of Advanced Research in Computer Science and Software Engineering, Vol. 2, Issue 12, December 2012.
[7] Gurpreet Singh Lehal, “Design and Implementation of Punjabi Spell Checker”, International Journal of Systemics, Cybemetics and Infomatics, 2007.
[8] Amit Sharma & Pulkit Jain, “Hindi Spell Checker”, Indian Institute of Technology Kanpur, April 17, 2013.
[9] MeenuBhagat, (2007), “Spelling Error Pattern Analysis of Punjabi Typed Text”, Thesis Report, Thapar University, Patiala.
[10] F.J. Damerau (1964), “A Technique for Error Detection and Correction of Spelling Errors”, Communication ACM, pp. 171-176.
[11] Monisha Das, S. Borgohain, JuliGogoi, S. B. Nair (2002), “Design and Implementation of a Spell Checker for Assamese”,lec, pp. 156, Language Engineering Conference (LEC’02).
[12] Morris, Robert & Cherry, Lorinda L, “Computer Detection of typographic errors”, IEEE Trans Professional Communications, vol. PC-18, no. 1, pp 54-64, March 1975.
[13] R.E. Gorin (1971), “SPELL: A spelling checking and correction program”, Online documentation for the DEC-10 computer.
[14] K. Kukich (1992) “Techniques for automatically correcting words in text”. ACM Computing Surveys. 24(4): 377-439.
[15] Peterson James (1980), “Computer Programs for Detecting and Correcting Spelling Errors”, Computing Practices, Communications of the ACM.
[16] G S Lehal & MeenuBhagat, “Spelling Error Pattern Analysis of Punjabi Typed Text”, In Proceedings of International Symposum on Machine Translation, NLP and TSS, pp. 128-141, 2007.
[17] Jesus Vilares& Manuel Vilares, “Managing Misspelled Queries in IR Application,” Issue 8, October 2010.
[18] Youssef Bassil& Mohammad Alwani, “Context-sensitive Spelling Correction using Google Web IT 5-Gram Information,” Department of Computer and Information Science, Vol. 5,No.3, May 2012.G. Eason, B. Noble, and I.N. Sneddon, “On certain integrals of Lipschitz-Hankel type involving products of Bessel functions,” Phil. Trans. Roy. Soc. London, vol. A247, pp. 529-551, April 1955.