Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network
Namrata Choudhary1 , Kirti Jain2
Section:Research Paper, Product Type: Journal Paper
Volume-7 ,
Issue-5 , Page no. 718-723, May-2019
CrossRef-DOI: https://doi.org/10.26438/ijcse/v7i5.718723
Online published on May 31, 2019
Copyright © Namrata Choudhary, Kirti Jain . This is an open access article distributed under the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.
View this paper at Google Scholar | DPI Digital Library
How to Cite this Paper
- IEEE Citation
- MLA Citation
- APA Citation
- BibTex Citation
- RIS Citation
IEEE Style Citation: Namrata Choudhary, Kirti Jain, “Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network,” International Journal of Computer Sciences and Engineering, Vol.7, Issue.5, pp.718-723, 2019.
MLA Style Citation: Namrata Choudhary, Kirti Jain "Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network." International Journal of Computer Sciences and Engineering 7.5 (2019): 718-723.
APA Style Citation: Namrata Choudhary, Kirti Jain, (2019). Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network. International Journal of Computer Sciences and Engineering, 7(5), 718-723.
BibTex Style Citation:
@article{Choudhary_2019,
author = {Namrata Choudhary, Kirti Jain},
title = {Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network},
journal = {International Journal of Computer Sciences and Engineering},
issue_date = {5 2019},
volume = {7},
Issue = {5},
month = {5},
year = {2019},
issn = {2347-2693},
pages = {718-723},
url = {https://www.ijcseonline.org/full_paper_view.php?paper_id=4305},
doi = {https://doi.org/10.26438/ijcse/v7i5.718723}
publisher = {IJCSE, Indore, INDIA},
}
RIS Style Citation:
TY - JOUR
DO = {https://doi.org/10.26438/ijcse/v7i5.718723}
UR - https://www.ijcseonline.org/full_paper_view.php?paper_id=4305
TI - Video Text Detection and Recognition Based on a Transferred Deep Convolutional Neural Network
T2 - International Journal of Computer Sciences and Engineering
AU - Namrata Choudhary, Kirti Jain
PY - 2019
DA - 2019/05/31
PB - IJCSE, Indore, INDIA
SP - 718-723
IS - 5
VL - 7
SN - 2347-2693
ER -
VIEWS | XML | |
275 | 175 downloads | 130 downloads |
Abstract
The text presented in videos contains important information for content analysis, indexing, and retrieval of videos. The key technique for extracting this information is to find, verify, and recognize video text in various languages and fonts against complex backgrounds. In this paper, we propose a novel method that transferred deep convolutional neural networks for detecting and recognizing video text. We partition the candidate text regions into candidate text lines by projection analysis using two alternative methods. We develop a novel fuzzy c-means clustering-based separation algorithm to obtain a clean text layer from complex backgrounds so that the text is correctly recognized by commercial optical character recognition software. The proposed method is robust and has good performance on video text detection and recognition, which was evaluated on three publicly available test data sets and on the high-resolution test data set we constructed.
Key-Words / Index Term
Video Text Detection, Recognition, Transferred convolutional neural network, Fuzzy c-means Clustering.
References
[1] Y. A. Aslandogan and C. T. Yu, ‘‘Techniques and systems for image and video retrieval,’’ IEEE Trans. Knowl. Data Eng., vol. 11, no. 1, pp. 56–63, Jan. 1999.
[2] H. Bhaskar and L. Mihaylova, ‘‘Combined feature-level video indexing using block-based motion estimation,’’ in Proc. Conf. Inf. Fusion, Jul. 2010, pp. 1–8.
[3] W. Hu, N. Xie, L. Li, X. Zeng, and S. Maybank, ‘‘A survey on visual content-based video indexing and retrieval,’’ IEEE Trans. Syst., Man, Cybern. C, Appl. Rev., vol. 41, no. 6, pp. 797–819, Nov. 2011.
[4] K. Jung, K. I. Kim, and A. K. Jain, ‘‘Text information extraction in images and video: A survey,’’ Pattern Recognit., vol. 37, no. 5, pp. 977–997, 2004.
[5] M. Khodadadi and A. Behrad, ‘‘Text localization, extraction and inpainting in color images,’’ in Proc. Iranian Conf. Elect. Eng., May 2012, pp. 1035–1040.
[6] A. Mosleh, N. Bouguila, and A. B. Hamza, ‘‘Automatic inpainting scheme for video text detection and removal,’’ IEEE Trans. Image Process., vol. 22, no. 11, pp. 4460–4472, Nov. 2013.
[7] M. Cai, J. Song, and M. R. Lyu, ‘‘A new approach for video text detection,’’ in Proc. Int. Conf. Image Process., vol. 1, Sep. 2002, pp. I-117–I-120.
[8] T. Yusufu, Y. Wang, and X. Fang, ‘‘A video text detection and tracking system,’’ in Proc. IEEE Int. Symp. Multimedia, Dec. 2013, pp. 522–529.
[9] X. Huang, ‘‘A novel video text extraction approach based on Log–Gabor filters,’’ in Proc. Int. Congr. Image Signal Process., vol. 1, Oct. 2011, pp. 474–478.
[10] P. Shivakumara, W. Huang, and C. L. Tan, ‘‘Efficient video text detection using edge features,’’ in Proc. Int. Conf. Pattern Recognit., Dec. 2008, pp. 1–4.
[11] X. Zhao, K.-H. Lin, Y. Fu, Y. Hu, Y. Liu, and T. S. Huang, ‘‘Text from corners: A novel approach to detect text and caption in videos,’’ IEEE Image Process., vol. 20, no. 3, pp. 790–799, Mar. 2011.