Multi-script-oriented text detection and recognition in video/scene/born digital images

Raghunandan, K. S. and Shivakumara, P. and Sangheeta Roy and Hemantha Kumar, G. and Pal, Umapada and Lu, Tong (2019) Multi-script-oriented text detection and recognition in video/scene/born digital images. IEEE Transactions on Circuits and Systems for Video Technology, 29 (4). pp. 1145-1162. ISSN 1558-2205

Full text not available from this repository. (Request a copy)

Abstract

Achieving good text detection and recognition results for multi-script-oriented images is a challenging task. First, we explore bit plane slicing in order to utilize the advantage of the most significant bit information to identify text components. A new iterative nearest neighbor symmetry is then proposed based on shapes of convex and concave deficiencies of text components in bit planes to identify candidate planes. Further, we introduce a new concept called mutual nearest neighbor pair components based on gradient direction to identify representative pairs of texts in each candidate bit plane. The representative pairs are used to restore words with the help of edge image of the input one, which results in text detection results (words). Second, we propose a new idea by fixing window for character components of arbitrary oriented words based on angular relationship between sub-bands and a fused band. For each window, we extract features in contourlet wavelet domain to detect characters with the help of an SVM classifier. Further, we propose to explore HMM for recognizing characters and words of any orientation using the same feature vector. The proposed method is evaluated on standard databases such as ICDAR, YVT video, ICDAR, SVT, MSRA scene data, ICDAR born digital data, and multi-lingual data to show its superiority to the state of the art methods.

Item Type: Article
Uncontrolled Keywords: feature extraction;hidden Markov models;image classification;support vector machines;text detection;video signal processing;multiscript-oriented text detection;good text detection;multiscript-oriented images;bit plane slicing;text components;iterative nearest neighbor symmetry;concave deficiencies;bit planes;mutual nearest neighbor pair components;representative pairs;edge image;text detection results;character components;arbitrary oriented words;ICDAR born digital data;multilingual data;convex deficiencies;Text recognition;Digital images;Feature extraction;Image edge detection;Shape;Character recognition;Bit plane slicing;convex and concave deficiencies;wavelet sub-bands;arbitrarily-oriented text detection and recognition;hidden Markov model;multi-lingual text detection and recognition
Subjects: D Physical Science > Computer Science
Divisions: Department of > Computer Science
Depositing User: C Swapna Library Assistant
Date Deposited: 07 Mar 2020 09:19
Last Modified: 11 Mar 2020 05:49
URI: http://eprints.uni-mysore.ac.in/id/eprint/11618

Actions (login required)

View Item View Item