0% found this document useful (0 votes)

88 views16 pages

Comarison PDF

This document summarizes various approaches for extracting text from images. It begins by introducing the importance and challenges of text extraction from the growing amount of multimedia documents in different formats. It then classifies images into three main types - document images, caption text images, and scene text images - and provides examples of each. The rest of the document reviews existing text extraction methods for different image types, including documents, captions, and scenes. It analyzes algorithms used like morphological operators and wavelet transforms. Tables at the end compare the performance of several text extraction methods.

Uploaded by

vandana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

88 views16 pages

Comarison PDF

Uploaded by

vandana

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 16

International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.

4, August 2012

A SURVEY ON VARIOUS APPROACHES OF TEXT

EXTRACTION IN IMAGES

C.P. Sumathi1, T. Santhanam2 and G.Gayathri Devi3

1
Department of Computer Science, SDNB Vaishnav College for Women, Chennai,
India
santsum@hotmail.com
2
Department of Computer Application, DG Vaishnav College for Men, Chennai, India
santhanam_dgvc@yahoo.com
3
Department of Computer Science, SDNB Vaishnav College for Women, Chennai,
India
mail2gg@yahoo.co.in

ABSTRACT
Text Extraction plays a major role in finding vital and valuable information. Text extraction involves
detection, localization, tracking, binarization, extraction, enhancement and recognition of the text from
the given image. These text characters are difficult to be detected and recognized due to their deviation of
size, font, style, orientation, alignment, contrast, complex colored, textured background. Due to rapid
growth of available multimedia documents and growing requirement for information, identification,
indexing and retrieval, many researches have been done on text extraction in images.Several techniques
have been developed for extracting the text from an image. The proposed methods were based on
morphological operators, wavelet transform, artificial neural network,skeletonization operation,edge
detection algorithm, histogram technique etc. All these techniques have their benefits and restrictions.
This article discusses various schemes proposed earlier for extracting the text from an image. This paper
also provides the performance comparison of several existing methods proposed by researchers in
extracting the text from an image.

KEYWORDS

Text Extraction, Document Text Images, Caption Text Images, Scene Text, Heterogeneous Images.

1. INTRODUCTION
Text Extraction from image is concerned with extracting the relevant text data from a collection
of images. Rapid development of digital technology has resulted in digitization of all categories
of materials. Lot of resources are available in electronic medium. Many existing paper-based
collections , historical manuscripts , records, books, journals, scanned document , book covers ,
video images, maps, manuscripts, pamphlets, posters, broadsides, newspapers,, micro facsimile,
microfilms, university archives, slides and films, book plates, pictures, painting, graphic
materials, coins and currency, stamps, magazines, clipping files, educational , TV programs ,
business card, magazines, advertisements, web pages , mixed text-picture-graphics regions etc
are converted to images. These images present many challenging research issues in text
extraction and recognition. Text extraction from images have many useful applications in
document analysis , detection of vehicle license plate, analysis of article with tables, maps,
charts, diagrams etc., keyword based image search, identification of parts in industrial
automation , content based retrieval, name plates, object identification, street signs, text based
video indexing, video content analysis, page segmentation, document retrieving, address block
location etc.
DOI : 10.5121/ijcses.2012.3403 27
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

Images can be broadly classified into Document images, Caption text images and Scene text
images. Figures 1-3 show some examples of text in images. A document image (Figure 1a, 1b)
usually contains text and few graphics components. Document images are acquired by scanning
journal, printed document, degraded document images, handwritten historical document, and
book cover etc. The text may appear in a virtually unlimited number of fonts, style, alignment,
size, shapes, colors, etc. Extraction of text in documents with text on complex color background
is difficult due to complexity of the background and mix up of color(s) of fore-ground text with
colors of background.

Figure 1a: Document Text Image Figure 1b: Colored Text Image
(courierexpressandpostal.blogspot.com) (athleticaid.com)

Caption text is also known as Overlay text or Cut line text. Caption text (Figure 2) is artificially
superimposed on the video/image at the time of editing and it usually describes or identifies the
subject of the image/video content. The superimposed text is a powerful source of high-level
semantics. These text occurrences could be detected, segmented, and recognized automatically
for indexing, retrieval and summarization. The extraction of the superimposed text in sports
video is very useful for the creation of sports summary, highlights etc These types of caption
text include moving text, rotating text, growing text, shrinking text, text of arbitrary orientation,
and text of arbitrary size.
Scene text (Figure 3) appears within the scene which is then captured by the recording device
i.e. text which is present in the scene when the image or video is shot. Scene texts occurs
naturally as a part of the scene and contain important semantic information such as
advertisements that include artistic fonts, names of streets, institutes, shops, road signs, traffic
information, board signs, nameplates, food containers, cloth, street signs, bill boards,
banners,and text on vehicle etc. Scene text extraction can be used in detecting text-based
landmarks, vehicle license detection/recognition, and object identification rather than general
indexing and retrieval. It is difficult to detect and extract since it may appear in a virtually
unlimited number of poses, size, shapes and colors, low resolution, complex background, non-
uniform lightning or blurring effects of varying lighting, complex movement and
transformation, unknown layout, uneven lighting, shadowing and variation in font style, size,
orientation, alignment & complexity of background.

28
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

Figure 2: Caption Text Image Figure 3: Scene Text Image

(googlecode.blogspot.com) (ICDAR Dataset)

Due to very fast growth of available multimedia documents and growing requirement, studies in
the field of pattern recognition shows a great amount of interest in efficient extraction of text,
indexing and retrieval from digital video/document images. Intensive research projects are
performed for text extraction in images by many scholars. Text extraction involves detection,
localization, tracking, binarization, extraction, enhancement and recognition of the text from the
given image.Several techniques have been developed for extracting the text from an image. The
proposed methods were based on morphological operators, wavelet transform, artificial neural
network,skeletonization operation,edge detection algorithm, histogram technique etc. The
methods cited in this paper on text extraction in images are classified according to different
types of images.

2.REVIEW
A large number of approaches have been proposed for text extraction from images. The existing
work on text extraction from images can be classified according to different criteria. This article
classifies according to the different types of image, analyzes those algorithms and discusses the
performance evaluation. The performance measure is presentedin Table 1,2,3,4.The purpose of
the survey is donated to a remarkable growth of text extraction techniques.

2.1 Text Extraction Work

Various methods are used for the extraction of text from colored journal images, camera
captured images, video images, printed document, degraded document images, handwritten
historical document, graphical and color document images, low resolution images, book cover
and web pages.

2.1.1 Document Text Images

A robust approach to segment text from color images was put forth by Y. Zhan et.al [1]. The
proposed algorithm uses the multiscale wavelet features and the structural information to locate
candidate text lines. Then a SVM classifier was used to identify true text from the candidate text
lines .This approach mainly included four stages. In preprocessing step text blocks were
enhanced by using cubic interpolation to rescale the input text blocks and a Gaussian filter to
29
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

smooth the text blocks and remove noises. These image blocks were split into connected
components and non-text connected components were eliminated by a component filtering
procedure. The left connected components were merged using K-means clustering algorithm
into several text layers, and a set of appropriate constraints were applied to find the real text
layer. Finally, the text layer was refined through a post-processing step.
Thai et.al [2] described an approach for effective text extraction from graphical document
images. The algorithm used Morphological Component Analysis (MCA) algorithm, an
advancement of sparse representation framework with two appropriately chosen discriminative
over complete dictionaries. Two discriminative dictionaries were based on undecimated wavelet
transform and curvelet transform. This method overcame the problem of touching between text
and graphics and also insensitive to different font styles, sizes, and orientations.
S.Audithan et.al [3]formulated an efficient and computationally fast method to extract text
regions from documents. They proposed Haar discrete wavelet transform to detect edges of
candidate text regions. Non-text edges were removed using thresholding technique. They used
morphological dilation operator to connect the isolated candidate text edge and then a line
feature vector graph was generated based on the edge map. This method exploited an improved
canny edge detector to detect text pixels. The stroke information was extracted the spatial
distribution of edge pixels. Finally text regions were generated and filtered according to line
features.
Grover et.al [4] described an approach to detect text from documents in which text was
embedded in complex colored document images. They proposed a simple edge based feature to
perform this task. The image was converted to gray scale by forming a weighted sum of the R,
G, and B components. Then edge detection was performed on the gray-scale image by
convolving the image with Sobel masks, separately for horizontal and vertical edges.
Convolution was followed by elimination of non-maxima and thresholding of weak edges.
Next, the edge image was divided into small non overlapping blocks of m x m pixels, where m
depends on the image resolution. They performed block classification using pre-defined
threshold which would differentiate the text from the image.
P. Nagabhushan et.al [5] proposed a novel approach to extract the text in complex background
color document images. The proposed method used canny edge detector to detect edges. When
dilation operation was performed on edge image, it created holes in most of the connected
components that corresponds to character strings. Connected components without hole(s) were
eliminated. Other non-text components were eliminated by computing and analyzing the
standard deviation of each connected component. An unsupervised local thresholding was
devised to perform fore-ground segmentation in detected text regions. Finally the noisy text
regions were identified and reprocessed to further enhance the quality of retrieved foreground.
A robust and efficient algorithm to automatic text extraction from colored book and journal
cover sheets was proposed by Davodet.al[6] based on wavelet transform. A dynamic threshold
was used to detect edges from detail wavelet coefficient. Further effective edges were obtained
by blurring approximate coefficients with alternative heuristic thresholding. Region of Interest
(ROI) technique was applied and finally text was extracted. They evaluated the performance of
their algorithm on 80 pictures collected from internet.
Another algorithm for Automatic text location and identification on colored book and journal
covers was proposed by Karin et.al[7].The number of colors was reduced by applying a
clustering algorithm. Text candidates were located using a top-down analysis based on
successive splitting in horizontal and vertical direction. A bottom-up analysis detected
homogeneous regions using a region growing method; grouping step was applied to find subsets
of region. Finally text regions and non-text regions were distinguished.

30
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

Zhixin Shi et.al[8]proposed an extraction algorithm based on connectivity features for a

complex handwritten historical document. This paper presented an algorithm using adaptive
local connectivity map (ALCM) technique. Thresholding the gray scale image discloses clear
text-line patterns as connected components. Grouping algorithm was used to group the
connected components into location masks for each text line. Text line was extracted by
mapping the location masks back onto the binary image to collect the text line components.
Splitting algorithm overcame the problem of components touching multiple lines. This method
dealt with fluctuating or skewed text lines and used for other types of images such as binary
images, machine printed or even mixed script.
Syed Saqibet.al[9] described an approach for curled textline information extraction from
grayscale camera-captured document images. The grayscale textline was enhanced by using
multi-oriented multi-scale anisotropic Gaussian smoothing. Detection of central lines of curled
textlines was found using ridges. This approach was based on differential geometry, which used
local direction of gradients and second derivatives as the measure of curvature. Hessian matrix
was used for finding direction of gradients and derivatives. By using this information, ridges
were detected by finding the zero-crossing of the appropriate directional derivatives of
smoothed image. Modified coupled snakes model was used for estimating x-line and baseline
pairs from detected textlines. Their approach was robust against high degrees of curl and
requires no post-processing.
Wafa et.al[10] suggested a new enhanced text extraction algorithm from degraded document
images of both color and grayscale on the basis of the probabilistic models. Color document
image was converted to YIQ colors space image and operate on Y luminance channel. Initial
estimates and their corresponding mean and standard deviation vectors for expectation-
maximization (EM) algorithm were calculated using k-means clustering method. The EM
algorithm was used to estimate and improve the parameters of the mixtures of densities
recursively. The maximum likelihood (ML) segmentation method estimates the probability that
a pixel belongs to text or background.

Table 1 : Performance Analysis of Text Extraction in Document Text Images

S. Author Year Method used Accuracy Benefits

N
o
1 Davod et.al [6] 2011 Wavelet transform , Region 91.20% Robust to
of Interest (ROI) noise
2 Thai et.al[2] 2010 Sparse representation 94.76% Overcome
framework, Morphological s the
Component Analysis touching
(MCA) algorithm, Curvelet problem
transform between
text and
graphics
3 Nagabhushan 2010 Canny edge detector, 97.12% Handles
et.al [5] Dilation operation, degradati
Unsupervised local ons such
thresholding,Connected as blur,
component analysis wavy text

31
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

4 Grover et.al [4] 2009 Sobel edge detection, 99% insensitiv

Thresholding technique, e to
Block classification color,
fonts,
language.
5 Syed et.al [9] 2009 Anisotropic Gaussian 91% Robust to
smoothing,Ridges based on high
differential geometry, degrees of
Hessian matrix , Modified curl
coupled snakes model ,needs no
post-
processin
g
6 Wafa et. al. 2009 Expectation-maximization 96% Works
[10] (EM) algorithm, Maximum well on
likelihood (ML) old
segmentation method, k- degraded
means clustering method document
s
7 S.Audithan 2009 Haar discrete wavelet 94.80% Independe
et.al [3] transform, Morphological nt of
dilation operators, Canny contrast
edge detector
8 Zhan et al. [1] 2006 Multiscale wavelet features, 84.3%. Robust to
SVM classifier, Cubic text color,
interpolation ,Gaussian font size,
filter, K-means clustering languages
algorithm
9 Zhixin et.al [8] 2005 Adaptive local connectivity 95% Handles
map, Grouping algorithm, mixed
Splitting algorithm script.
1 Karin et.al [7] 1999 Clustering algorithm, Promising Handles
0 Bottom-up and Top-down results Web/
analysis video
images

2.1.2 Scene Text Images

Angadi et.al [11] proposed a methodology to detect and extract text regions from low resolution
natural scene images. Their proposed work used Discrete Cosine Transform (DCT) based high
pass filter to remove and suppress the constant background. The texture feature matrix was
computed on every 50x50 block of the processed image. A newly defined discriminant function
was used to classify text blocks. The detected text blocks were merged to obtain new text
regions. Finally, the refinement phase was a post processing step used to improve the detection
accuracy. This phase used to cover small portions of missed text present in adjacent undetected
blocks and unprocessed regions. The proposed methodology had been conducted on 100 indoor
and outdoor low resolution natural scene images containing text of different size, font, and
alignment with complex backgrounds containing Kannada text and English text. The approach

32
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

also detected nonlinear text regions and can be extended for text extraction from the images of
other languages with little modifications.
Pan et.al[12] proposed a novel hybrid method where in a text region detector was designed to
generate a text confidence map. A Local binarization approach was used to segment the text
components using text confidence map. A Conditional Random Field (CRF) model was used to
label components as text or non-text which was solved by minimum classification error (MCE)
learning and graph cuts inference algorithm. A learning based method by building neighbouring
components into minimum spanning tree (MST) and cutting off interline edge with an energy
minimization model to group the text components into text lines.
Fabrizio et.al[13] offered a region based approach that starts by isolating letters, then groups
them to restore words. The process was based on a new segmentation method based on
morphological operator called Toggle Mapping Morphological Segmentation (TMMS) and a
classification step based on a combination of multiple SVM classifiers. The training data base
composed of 32400 examples extracted from various urban images and different configurations
of classifiers have been tested to get the highest classification accuracy.
Kohei et.al [14] introduced a new approach to detect and extract text from commercial
screenshot images. Their approach implemented edge-based method and connected component
labeling method known as blob extraction method. Combination of homogeneity edge detection
filter and appropriate threshold number separated the text from the image.
A method for localizing text regions within scene images was introduced by Luz et.al. [15]. A
set of potential text regions was extracted from the input image using morphological filters.
Connected Components (CC) were identified using ultimate attribute openings and closings,
and selected a subset of text region after combining some of the CCs. Decision tree classifier
were used to distinguish text or non-text regions.
Shivakumara et.al[16] proposed a new method based on Maximum Color Difference (MCD)
and Boundary Growing Method (BGM) for detection of multioriented handwritten scene text
from video. Average of RGB channel was calculated of the original frame to sharpen the text
edges and increase the contrast of text pixels. .Maximum Color Difference was computed to
increase the gap between text and non-text pixel. Text clusters were obtained by K-means
clustering algorithm .These clusters were used to obtain the text candidates and also help in
eliminating false positives. To fix boundary for handwritten text, Boundary Growing Method
(BGM) based on the nearest neighbour concept was used. The method made to appear the
characters and words in regular spacing in one direction and it can grow based on orientation of
text. The concept of intrinsic and extrinsic edges was used to eliminate false positives.
The unique approach of Shyamaet.al[17]projected a text segmentation technique to extract text
from any type of camera grabbed frame image or video. Colour based segmentation
methodology was used to link consecutive pixels in the same direction by exploiting the general
text properties. Light Edge Enhancement (LEE) was used to find a set of consecutive candidate
points and enhance the edge between them. Next, heavy edge enhancement (HEE) was applied
to remove or reduce motion blur from camera image sequences. This helped to treat camera
images and video frames in the same manner.

33
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

Table 2: Performance Analysis of Text Extraction in Scene Text Images

S.No Author Year Method used Accurac Benefits

y
1 Kohei 2011 Edge-based method, 94.66% Works on
et.al [14] connected component complex
labeling method, Morphology background.
erosion filter, Comic text
extraction method
2 Shivakum 2010 Maximum Color Difference 89.67% Insensitive
araet.al (MCD), Boundary Growing to contrast
[16] Method (BGM), K-means
clustering algorithm
3 Luz et.al 2010 Morphological filters , 85.93% Insensitive
[15] Decision tree classifier to position
4 Shyama 2009 Colour based segmentation, 94% Insensitive
et.al [17] Light edge enhancement, to size,
Heavy edge enhancement orientation
5 Fabrizio 2009 Multiple SVM classifiers, 88.83% Insensitive
et.al[13] Toggle Mapping to lighting,
Morphological Segmentation orientations.
6 Pan et. al 2009 Local binarization approach , 83.44% Robust and
[12] Conditional Random Field accurately
(CRF) model,Minimum localize
classification error (MCE) texts
learning , Graph cuts
inference algorithm,
Minimum spanning tree
(MST), Energy minimization
model
7 Angadi 2009 DCT based high pass filter , 96.60% Handles
et.al [11] Discriminant functions different
type of size,
alignment,
nonlinear
text region

2.1.3 Caption Text Images

A superimposed text extraction method was introduced by V.Vijayakumar et.al [18] for
detecting video text regions containing player information and score in sports videos. Key
frames from the video were extracted using Color Histogram technique to minimize the number
of video frames and converted to gray images. Text image regions were cropped. Canny Edge
Detection algorithm was applied to detect edges on the cropped image. From this edge detected
images, text region was identified and fed to an Optical Character Recognition system which
produces index-able keywords.

34
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

The goal of Min et.al [19] approach was to detect both low-contrast and high-contrast artificial
texts invariant with language and font-size in a complex background video image. The sobel
color edge detector was applied to detect edges. Non text points were eliminated by applying
low threshold determined by the histogram of edge strength and selective local thresholding.
Further enhancement was done using Edge-Strength Smoothing (ESS) operator and Edge-
Clustering-Power (ECP) operator. To locate the text region, coarse-to-fine (horizontal and
vertical) projection was used.
Yih-Ming et.al [20] proposed a scheme to extract the caption text from various sports videos.
Iteratively temporal averaging approach was used in caption extraction process. To improve the
image quality and to reduce noise spatial-image analysis was performed. Threshold value was
determined using binarization process based on the global mean and the standard deviation of
the gray level of the averaged video image. Binarization may lead to holes and disconnectivity
on video captions with blurred background. This was cured by morphological processing. Each
connected component was used to extract geometrical features to identify the captions. A
model-based segmentation approach was applied to accurately extract the caption contents.
A technique for detecting caption text from videos for global indexing purpose based on
hierarchical region-based image model was proposed by Leon et.al [21]. Binary Partition Tree
(BPT) was created by combining color and contour homogeneity Criteria. Texture descriptors
were estimated on the full image by means of a multi-resolution analysis using a Haar wavelet
decomposition to highlight the candidate regions in the BPT. The largest connected component
was selected as the area of support for computing geometric descriptors. Region evaluation was
carried out by combining region-based texture information and geometric features. Final caption
text nodes were selected by analyzing the various subtrees in BPT.
Zhong et.al [22] introduced a method to automatically localize captions in JPEG compressed
images and the I-frames of MPEG compressed video. They proposed a texture-based caption
text localization method that operates directly in the Discrete Cosine Transform (DCT) domain
for MPEG video or JPEG images. The DCT coefficients which capture the directionality and
periodicity of local image blocks were used as texture measures to identify text regions.
Morphological operations and connected component analysis were performed to remove noisy
blocks and merge disconnected text blocks.
An effective approach to extracting captions from videos was projected by Liu et.al [23].
Spatial localization and temporal localization were used in their approach. Candidate caption
region were detected by exploiting the distribution of corners in spatial localization. The
temporal localization for different captions in a video was performed by identifying the change
of stroke directions and decomposed the video into a sequence of clips, each clip containing the
same caption.
Table 3: Performance Analysis of Text Extraction in Caption Text Images

S.No Author Year Method used Accuracy Benefits

1 Vijayakumar 2011 Histogram technique,Canny 84.89% Extracts
[18] et.al Edge Detection algorithm caption
text
from any
type of
video.
2 Liu et.al 2010 Spatial localization, 91.1% Insensiti
[23] Temporal localization, clip- ve to
basedSegmentation anguage
35
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

s,
symbols
3 Leon et.al 2010 Generic indexing system, 85.78% Insensiti
[26] Wavelet Transform, ve to
Hierarchical image model different
size,
color,
complex
backgro
und
4 Leon et.al 2009 Haar wavelet decomposition 86.35% Insensiti
[21] and geometric information ve to
through hierarchical image different
model size,com
plex
backgro
und
5 Yih-Ming 2006 Temporal averaging 92.18% Indepen
et.al [20] technique,Spatial-image dent of
analysis, Binarization caption
process, morphological size,
operations, Model-based color,
segmentation approach. location,
shape,
layout
6 Luo et.al 2003 Supervised classification of 94.2% Indepen
[24] the temporal feature vector dent of
size,
font,
alignme
nt
7 Min Cai[19] 2002 Sobel color edge detector, 93.6%. Robust
et.al Histogram technique, Edge for
strength smoothing (ESS) contrast,
operator, Edge-clustering- font size,
power(ECP) operator, language
Coarse-to-fine projection ,
backgro
und
complex
ity
8 Tang et.al 2002 Fuzzy-clustering, neural 99% Indepen
[25] network classifier, Minimum dent of
pixel search method, frame size,
averaging methods Quantized shape,
spatial difference density, alignme
Morphological operations nt.

36
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

9 Zhong et.al 1999 Discrete Cosine Transform, 98.42% Method

[22] Morphological operations, is very
connected component nalysis fast

Luo et.al [24] proposed a technique to extract the text information in video to create a summary
of the video segment. The text information in video was extracted by using brightness values of
a pixel to form a vector, called Temporal Feature Vector (TFV). The vector was formed by
tracing the gray-level of each pixel in time over a sequence of consecutive frames. By
analyzing the pixel changes in the sequence, they located the appearing frames of captions.
Finally they extracted the captions to create a summary of the video segment.
Tang et.al [25] presented a video caption detection and recognition system based on a fuzzy-
clustering neural network (FCNN) classifier. A self-organizing neural network and fuzzy
clustering classifier were used to segment the video sequence into basic frame units representing
continuous action. The frame difference metrics, the histogram difference metric (HDM) and
the spatial difference metric (SDM) were used to detect boundaries. They proposed a new
metric, quantized spatial difference density (QSDD), to detect the caption transition frame and
located the caption image region
Leon et.al [26] projected a technique for caption text detection that combines texture
information and geometric information. Texture features were estimated through Haar wavelet
decomposition and geometric information were estimated through the analysis of the regions
proposed by the hierarchical image model.

2.1.4 Heterogeneous Text Images

Rama et.al [27] proposed a new text extraction algorithm which was insensitive to noise,
skewness and text orientation, color or intensity, layout and orientation from a text/graphics
heterogeneous document images. The edge detection operation was performed using the basic
operators of mathematical morphology. The text candidate connected components were found
using the edges. These components have been labeled to identify different components of the
image. The variance was found for each connected component considering the gray levels of
those components. Then the text was extracted by selecting those connected components whose
variance was less than some threshold value.
A unified approach for text extraction to handle all kinds of image such as scene text images,
caption text & document images was put forth by Gopala et.al [28]. They presented an
algorithm, based on the non-subsampled contourlet transform NSCT for text extraction. The
contourlet wavelet transform using multiscale and directional filter banks (DFB) captures
smooth contours images that were the dominant feature in natural images. The original image
was decomposed into eight directional sub band outputs using the DFB and the energy of each
sub band were obtained. The Sub bands were categorized as Strong & weak based on the value
of the computed energy. Weak sub bands were boosted to get the proper edges. Detected edges
were dilated using morphological dilation to enlarge or group the identified text regions. Strong
& boosted edges after dilation were combined with addition followed by logic AND operation
to extract text regions. Finally, remaining non text regions were identified & eliminated.
Khelifi et.al in [29] proposed an approach for text extraction to categorize and index documents
from heterogeneous documents. This approach was to identify similar text regions based on
their fonts. SVM classifier, fractal descriptors were used to perform this task.To make image
homogenous different color layers were extracted. The results were binarized, and text region
candidates were grouped. An SVM classifier was used to detect if a connected component
belonged to text entities or not. The CDB (Counting Densities per Box) was calculated to locate
text region. Fractal descriptors were calculated for extracted local text zones and to perform a
37
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

font classification step. Experiments were evaluated for both maps taken from geological
department database and ancient documents. The detection of text zones were done for every
direction, but the font recognition was well achieved only for horizontal zones.
Sunil et.al [30] proposed a scheme for the extraction of textual areas from an image using
Globally Matched Wavelet Filters(GMW) filters with Fisher classifiers.GMW filters was
estimated using clustering-based technique. They have used these filters to segment the
document images and classify them into text, background, and picture components. To improve
the result Markov random field (MRF) based post processing had been applied.
Liu et.al [31]’s dealt with printed document images as well as with scene text. They proposed
edge based method with edge strength, density and the orientation variance as distinguishing
characteristics of text embedded in images to build a feature map. This method used multiscale
edge detector for text detection and morphological dilation operator for text localization stages.
A new text extraction method which was insensitive to variations in font, color, or size of the
text in mixed-type color documents was presented by C.Strouthopoulos et.al [32]. The method
was based on a combination of an adaptive color reduction (ACR) technique and a page layout
analysis (PLA) approach. The Adaptive tree clustering procedure using principal component
analyzer (PCA) and self-organized feature map (SOFM) was used to achieve color reduction.
On individual color planes the PLA technique based on a Run Length Segmentation algorithm
(RLSA) and a neural network block classifier fed by suitable texture spatial features was applied
to identify text regions. The text regions were merged to determine final text regions.
G. Sahooet.al [33] projected a set of sequential algorithms for text extraction and enhancement
of image using cellular automata. The Luminance-based algorithm was used to convert the
image in to grey scale image.Converted image have only luminosity attribute. The edge
detection was performed using a 3 × 3 Sobel operator and it was then followed by the
elimination of non-maxima and thresholding of weak edges. The edge-bounded averaging was
performed through Moore neighborhood to obtain smooth non-edge regions. The image was
classified in to text based or non-text based region using constant threshold.
A combination of two learning mechanism, an artificial neural network (ANNs)-based approach
and Non-negative Matrix Factorization (NMF)-based filtering was proposed by Keechul et.al
[34] for text extraction in complex images. Multilayer Perceptron ANN classifier increased a
recall rate and a precision rate with NMF-filtering-based Connected Component (CC) analysis.
Detection of text was performed using neural networks without any explicit feature extraction
stage. MLPs automatically generated a texture classifier that discriminates between text regions
and non-text regions on three color bands. Bootstrap method was used in MLPs to learn a
precise boundary between text and non-text classes. To overcome the locality property of the
texture-based method, they used CC-based filtering using the NMF technique. Enhancement on
time was done using CAMShift for video images and X-Y recursive cut algorithm for document
images.
Table 4 : Performance Analysis of Text Extraction in Heterogeneous Text Images

S.No Author Year Method used Accuracy Benefits

1 Rama et al. 2010 Mathematical 84.01% Insensitive to
[27] Morphology noise, skew
orientation
2 Trung et.al 2010 Skeletonization operation, 84.90% Detect s
[35] Laplacian operator, multioriented
Morphological operation text.

38
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

3 G. Sahoo 2009 Luminance-based 91.88% Insensitive to

et.al [33] algorithm ,Sobel Edge color,orientat
detection, Moore ions
neighborhood
4 Gopala et al. 2008 Contourlet wavelet 91% Handle
[28] transform , Morphological different
dilation operator, languages
5 Khelifi et al. 2008 SVM classifier, Fractal Good Handles
[29] descriptors, Adaptive Result horizontal
Wiener filter Text zones
6 Sunil et 2007 Globally matched wavelet 85.54% Independent
al.[30] filters, Fisher classifiers, of scripts,
clustering-based font, font-
technique, Markov size,
random field (MRF) based geometric
post processing distortion
7 Liu et.al 2006 Multiscale edge detector , 94.30% Insensitive to
[31] Morphological dilation font style,
operator alignment
8 Keechul 2004 artificial neural 92.38% No feature
et.al [34] network,non-negative extraction
matrix factorization, stage.
Bootstrap method,
9 C.Strouthop 2002 Adaptive color reduction majority Robust to
oulos et.al technique , Page layout of the text variations in
[32] analysis, , Neural network areas text size,
classifier, Self-organized were font, color
feature map Principal correctly
component analyzer obtained

Phan et.al [35] proposed an approach based on the skeletonization operation for multi-oriented
graphics text and scene text in video images. .This method used the laplacian operator to
highlight the transitions between text and background. K-means was used to classify text and
non-text region. The morphological open operation was used to remove small artifacts from the
text cluster. Each region was classified as either a simple or complex connected component
depending on the number of intersection points in its skeleton. Complex connected components
were then segmented into essential parts based on the skeleton segments in order to separate the
text strings from each other. Finally, text string straightness and edge density were used for false
positive elimination.

2.2 Performance Evaluation

There are several performance evaluations to estimate the algorithm for text extraction. Most of
the approaches quoted here used Precision, Recall and F-Score metrics to evaluate the
performance of the algorithm. Precision, Recall and F-Score rates are computed based on the
number of correctly detected characters (CDC) in an image, in order to evaluate the efficiency
and robustness of the algorithm. The performance metrics are as follows:

39
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

2.2.1 False Positives

False Positives (FP) / False alarms are those regions in the image which are actually not
characters of a text, but have been detected by the algorithm as text.

2.2.2 False Negatives

False Negatives (FN)/ Misses are those regions in the image which are actually text characters,
but have not been detected by the algorithm.

2.2.3 Precision rate

Precision rate (P) is defined as the ratio of correctly detected characters to the sum of correctly
detected characters plus false positives.

2.2.4 Recall rate

Recall rate (R) is defined as the ratio of the correctly detected characters to sum of correctly
detected characters plus false negatives.

2.2.5 F-score
F-Score is the harmonic mean of recall and precision rates.

3.CONCLUSION &FUTUREWORK
There are many applications of a text extraction such as Keyword based image search, text
based image indexing and retrieval , document analysis, vehicle license detection and
recognition, page segmentation ,technical paper analysis, street signs, name plates, document
coding, object identification, text based video indexing, video content analysis etc. A number of
methods have been proposed in the past for extraction of text in images. These approaches
considered the different attributes related to text in an image such as of size, font, style,
orientation, alignment, contrast, color, intensity, connected-components, edges etc. These
attributes are used to classify text regions from their background or other regions within the
image. This paper provides a broad study of the various text extractiontechniques and
algorithms proposed earlier. This paper also exposed a performance comparison table of
differenttechnique that was proposed earlier for textextraction from an image. Every
approachhas its own benefits and restrictions. Even though there are many numbers of
algorithms, there is no single unified approach that fits for all the applications. The future work
mainly concentrates ondeveloping an algorithm for exact and fast text extraction from an image.

ACKNOWLEDGEMENTS
This work is a part of UGC Minor Research Project. ReferenceNumber-No F. MRP-
3725/11(MRP/UGC-SERO), Dated: 08/09/2011.Authors thank the Management and Principal
for extending their support for this project.

REFERENCES
[1] Y. Zhan, W. Wang, W. Gao (2006), “A Robust Split-And-Merge Text Segmentation Approach
For Images”, International Conference On Pattern Recognition,06(2):pp 1002-1005.
[2] Thai V. Hoang , S. Tabbone(2010),“Text Extraction From Graphical Document Images Using
Sparse Representation”in Proc. Das, pp 143–150.

40
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

[3] Audithan,,R.M.Chandrasekaran (2009), "Document Text Extraction From Document Images

Using Haar Discrete Wavelet Transform",European Journal Of Scientific Research, Vol.36 No.4
, pp.502-512.
[4] Sachin, Grover,Kushal Arora,,Suman K. Mitra(2009),“Text Extraction From Document Images
Using Edge Information”,IEEE India Council Conference.
[5] P. Nagabhushan, S. Nirmala(2009) ,”Text Extraction In Complex Color Document Images For
Enhanced Readabi
[6] lity”,Intelligent Information Management, pp: 120-133.
[7] Davod Zaravi, Habib Rostami, Alireza Malahzaheh, S.S Mortazavi(2011),” Journals
Subheadlines Text Extraction Using Wavelet Thresholding And New Projection Profile”, World
Academy Of Science, Engineering And Technology .Issue 73.
[8] Karin Sobottka, Horst Bunke and Heino Kronenberg(2009), “Identification Of Text On Colored
Book And Journal Covers”, ICDAR.
[9] Zhixin Shi, Srirangaraj Setlur And Venu Govindaraju(2005), “Text Extraction From Gray Scale
Historical Document Image Using Adaptive Local Connectivity Map”, Proceeding Of The
Eighth International Conference On Document Analysis And Recognition, Vol. 2, pp: 794–798.
[10] Syed Saqib Bukhari , Thomas M. Breuel,Faisal Shafait(2009), “Textline Information Extraction
From Grayscale Camera-Captured Document Images “, ICIP Proceedings Of The 16th IEEE
International Conference On Image Processing, pp: 2013 – 2016.
[11] Boussellaa , Aymen Bougacha, Abderrazak Zahour, Haikal El Abed, Adel Alimi(2009)
,“Enhanced Text Extraction From Arabic Degraded Document Images Using Em Algorithm”,
10th International Conference On Document Analysis And Recognition.
[12] S. A. Angadi , M. M. Kodabagi(2009) , ”A Texture Based Methodology For Text Region
Extraction From Low Resolution Natural Scene Images “, International Journal Of Image
Processing (Ijip) Volume(3), Issue(5).
[13] Yi-Feng Pan, Xinwen Hou, Cheng-Lin Liu(2009), “Text Localization In Natural Scene Images
Based On Conditional Random Field,” ICDAR,pp 6-10.
[14] .J. Fabrizio, M. Cord, And B. Marcotegui(2009), “Text Extraction From Street Level Images,”,
CMRT, Vol. Xxxviii, Part 3/W4 , pp. 199–204.
[15] Kohei Arai1 , Herman Tolle(2011),” Text Extraction From Tv Commercial Using Blob
Extraction Method”, International Journal Of Research And Reviews In Computer Science Vol.
2, No. 3
[16] Wonder Alexandre Luz Alves And Ronaldo Fumio Hashimoto(2010),”Text Regions Extracted
From Scene Images By Ultimate Attribute Opening And Decision Tree Classification”,
Proceedings of the 23rd Sibgrapi Conference On Graphics, Patterns And Images.
[17] Shivakumara P, A Dutta, U Pal And C L Tan(2010), “A New Method For Handwritten Scene
Text Detection In Video”, International Conference On Frontiers In Handwriting Recognition,
pp: 16-18.
[18] Shyama Prosad Chowdhury,Soumyadeep Dhar,Karen Rafferty,Amit Kumar Das,Bhabatosh
Chanda(2009),”Robust Extraction Of Text From Camera Images Using Colour And Spatial
Information Simultaneously”,Journal Of Universal Computer Science,Vol. 15, No.18 , pp:3325-
3342.
[19] V.Vijayakumar,R.Nedunchezhianm(2011),”A Novel Method For Super Imposed Text Extraction
In A Sports Video”,International Journal Of Computer Applications,Volume 15– No.1.
[20] Min Cai, Jiqiang Song, Michael R. Lyu(2002),”A New Approach For Video Text
Detection”,Proceedings International Conference On Image Processing , Volume 1, pp: I-117-I-
120.

41
International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.4, August 2012

[21] Yih-Ming Su, Chaur-Heh Hsieh(2006), "A Novel Model-Based Segmentation Approach To
Extract Caption Contents On Sports Videos", IEEE International Conference On Multimedia
And Expo,pp:1829 - 1832 .
[22] Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques(2009) , "Caption Text
Extraction For Indexing Purposes Using A Hierarchical Region-Based Image Model",
,Proceedings Of The 16th IEEE International Conference On Image Processing, pp:1869-1872.
[23] Yu Zhong, Hongjiang Zhang, And Anil K. Jain(1999),"Automatic Caption Localization In
Compressed Video", International Conference On Image Processing, pp: 96 - 100 Vol.2.
[24] Xiaoqian Liu,Weiqiang Wang(2010) ,"Extracting Captions From Videos Using Temporal
Feature",Proceedings Of The International Conference On Acm Multimedia ,pp:843-846.
[25] Bo Lilo, Xaoou Tang, Jianzhuang Liu, And Hongiiang Zhan(2003) ,"Video Caption Detection
And Extraction Using Temporal Information", International Conference On Image Processing,
Vol.1 , pp:I 297-300 .
[26] Tang X, Gao X, Liu J, Zhang H(2002). "A Spatial-Temporal Approach For Video Caption
Detection And Recognition",IEEE Transactions On Neural Networks, Vol. 13, No. 4.
[27] Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques(2010),"Region-Based
Caption Text Extraction",11th International Workshop On Image Analysis For Multimedia
Interactive Services (Wiamis).
[28] G. Rama Mohan Babu, P. Srimaiyee, A.Srikrishna(201), “Text Extraction From Heterogeneous
Images Using Mathematical Morphology”,Journal Of Theoretical And Applied Information
Technology,Vol.16,No.1,pp 39-47.
[29] Chitrakala Gopalan , Manjula(2008) ,“Text Region Segmentation From Heterogeneous Images”,
International Journal Of Computer Science And Network Security, Vol.8 No.10, pp.108-113.
[30] Badreddine Khelifi, Nizar Zaghden, Adel M. Alimi, Rémy Mullot(2008), “Unsupervised
Categorization Of Heterogeneous Text Images Based On Fractals”,Proceedings of ICPR',pp.1-4.
[31] Sunil Kumar, Rajat Gupta, Nitin Khanna, Santanu Chaudhury, Shiv Dutt Joshi(2007),“Text
Extraction And Document Image Segmentation Using Matched Wavelets And MRF Model”,
IEEE Transactions On Image Processing, Vol. 16, No. 8.
[32] X. Liu And J.Samarabandu(2006), "Multiscale Edge-Based Text Extraction From Complex
Images," Proc. International Conference Of Multimedia And Expo, pp.1721-1724.
[33] Strouthopoulos, N. Papamarkos, A.E. Atsalakis(2002),” Text Extraction in Complex Color
Documents”, Pattern Recognition 35 pp:1743–1758 .
[34] G. Sahoo, T. Kumar, B. L. Raina, C. M. Bhatia(2009), "Text Extraction And Enhancement Of
Binary Images Using Cellular Automata", International Journal Of Automation and Computing,
Vol. 6, No. 3, pp. 254-260.
[35] Keechul Jung, Eun Yi Ki(2004),. "Automatic Text Extraction For Content-Based Image
Indexing",Proceedings of PAKDD,pp.497-507.
[36] Trung Quy Phan ,Palaiahnakote ,Shivakumara Chew Lim Tan(2010),” A Skeleton-Based
Method For Multi-Oriented Video Text Detection”, Das '10 Proceedings Of The 9th IAPR
International Workshop On Document Analysis Systems,pp :271-278.

ONT HGU Brochure (HG6145D2)
50% (2)
ONT HGU Brochure (HG6145D2)
2 pages
Huawei STB EC6108V9 PDF
No ratings yet
Huawei STB EC6108V9 PDF
10 pages
Employee Engagement and Talent Acquisition in Reliance Jio
79% (14)
Employee Engagement and Talent Acquisition in Reliance Jio
112 pages
Apa Citation Style
100% (1)
Apa Citation Style
14 pages
ExamStudyGuide 801 CIV v3
No ratings yet
ExamStudyGuide 801 CIV v3
13 pages
Methodology For Eliminating Plain Regions From Captured Images
No ratings yet
Methodology For Eliminating Plain Regions From Captured Images
13 pages
Mca1414garbybaby 170131175855
No ratings yet
Mca1414garbybaby 170131175855
44 pages
Review of Text Extraction Algorithms For Scene-Text and Document Images
No ratings yet
Review of Text Extraction Algorithms For Scene-Text and Document Images
22 pages
7sem Project Report
No ratings yet
7sem Project Report
27 pages
Investigating The Effect of Bd-Craft To Text Detection Algorithms
No ratings yet
Investigating The Effect of Bd-Craft To Text Detection Algorithms
16 pages
Automated Text Extraction
No ratings yet
Automated Text Extraction
6 pages
Text Detection Based On MSER and CNN Features: Houssem Turki, Mohamed Ben Halima, Adel M. Alimi
No ratings yet
Text Detection Based On MSER and CNN Features: Houssem Turki, Mohamed Ben Halima, Adel M. Alimi
6 pages
Scene Text Detection With Novel Superpixel Based Character Candidate Extraction
No ratings yet
Scene Text Detection With Novel Superpixel Based Character Candidate Extraction
6 pages
Visapp Vocr
No ratings yet
Visapp Vocr
6 pages
Detection of Text From Lecture Video Images
No ratings yet
Detection of Text From Lecture Video Images
5 pages
Turki2016 AICCSA
No ratings yet
Turki2016 AICCSA
6 pages
المشروع
No ratings yet
المشروع
17 pages
Social Networking Mastery: " Tested and Proven Techniques and Scripts That Work If You Do"
No ratings yet
Social Networking Mastery: " Tested and Proven Techniques and Scripts That Work If You Do"
23 pages
Overview On Image Captioning Techniques
No ratings yet
Overview On Image Captioning Techniques
6 pages
Untitled0.ipynb - Colab
No ratings yet
Untitled0.ipynb - Colab
2 pages
admin,+17.++SI+JJT+VOL+78.4-2+2016 Sana Final
No ratings yet
admin,+17.++SI+JJT+VOL+78.4-2+2016 Sana Final
12 pages
Review of Scene Text Detection and Recognition: Han Lin Peng Yang Fanlong Zhang
No ratings yet
Review of Scene Text Detection and Recognition: Han Lin Peng Yang Fanlong Zhang
22 pages
Reference
No ratings yet
Reference
4 pages
Yerrijdnewpaper
No ratings yet
Yerrijdnewpaper
5 pages
Rainarli 2020 IOP Conf. Ser. Mater. Sci. Eng. 879 012106
No ratings yet
Rainarli 2020 IOP Conf. Ser. Mater. Sci. Eng. 879 012106
9 pages
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
No ratings yet
Text Extraction From Digital Images With Text To Speech Conversion and Language Translation
3 pages
An Intelligent and Unified Text and Non-Text Object Extraction From PDF Using Support Vector Machine
No ratings yet
An Intelligent and Unified Text and Non-Text Object Extraction From PDF Using Support Vector Machine
9 pages
Cohesive Multi-Oriented Text Detection and Recognition Structure in Natural Scene Images Regions Has Exposed
No ratings yet
Cohesive Multi-Oriented Text Detection and Recognition Structure in Natural Scene Images Regions Has Exposed
15 pages
Journal Publishers
No ratings yet
Journal Publishers
4 pages
JournalNX - Textual Content Video Stream
No ratings yet
JournalNX - Textual Content Video Stream
5 pages
Signboard Detection and Text Recognition Using Artificial Neural Networks
No ratings yet
Signboard Detection and Text Recognition Using Artificial Neural Networks
4 pages
Ijarcce 208
No ratings yet
Ijarcce 208
3 pages
Image Segmentation For Text Extraction: Neha Gupta, V .K. Banga
No ratings yet
Image Segmentation For Text Extraction: Neha Gupta, V .K. Banga
4 pages
User Download 12012023 143254
No ratings yet
User Download 12012023 143254
2 pages
Ijarcce 38
No ratings yet
Ijarcce 38
5 pages
Extracting Text Part Using MATLAB: Poonam Rani, Payal Taneja Daulat Sihag
No ratings yet
Extracting Text Part Using MATLAB: Poonam Rani, Payal Taneja Daulat Sihag
3 pages
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
No ratings yet
Ijecet: International Journal of Electronics and Communication Engineering & Technology (Ijecet)
8 pages
2005 6606 1 PB
No ratings yet
2005 6606 1 PB
21 pages
DSP Project
No ratings yet
DSP Project
16 pages
Automatic Text Detection Using Morphological Operations and Inpainting
No ratings yet
Automatic Text Detection Using Morphological Operations and Inpainting
5 pages
Stanford CS193p: Developing Applications For iOS Spring 2016
No ratings yet
Stanford CS193p: Developing Applications For iOS Spring 2016
80 pages
Char RCG TH
No ratings yet
Char RCG TH
11 pages
IJERT Segmentation and Detection of Text
No ratings yet
IJERT Segmentation and Detection of Text
6 pages
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
No ratings yet
(IJCST-V12I2P9) :Dr.M. Praneesh, Ashwanth.V, Febina.N, Sai Krishna P K
8 pages
Ote-Ocr Based Text Recognition and Extraction From Video Frames
No ratings yet
Ote-Ocr Based Text Recognition and Extraction From Video Frames
4 pages
Department of Electronics and Communication Engineering
No ratings yet
Department of Electronics and Communication Engineering
25 pages
2021 Marketing Plan Template (Mayple)
100% (1)
2021 Marketing Plan Template (Mayple)
53 pages
Deep Learning Approaches To Scene Text Detection A
No ratings yet
Deep Learning Approaches To Scene Text Detection A
61 pages
Experiment No.-5 (A) : AIM: - Graphical User Interface in Matlab
No ratings yet
Experiment No.-5 (A) : AIM: - Graphical User Interface in Matlab
3 pages
Certificacion Cisco Ccent/Ccna Icdn1 100-105
No ratings yet
Certificacion Cisco Ccent/Ccna Icdn1 100-105
1 page
Research Topics: Image Processing
No ratings yet
Research Topics: Image Processing
2 pages
Iclock Server en
No ratings yet
Iclock Server en
39 pages
Robustdetection of Text in Natural Scene Images
No ratings yet
Robustdetection of Text in Natural Scene Images
4 pages
Text Ertraction
No ratings yet
Text Ertraction
1 page
Tlte FDD TTD
No ratings yet
Tlte FDD TTD
5 pages
Extraction Text From Camera Images
No ratings yet
Extraction Text From Camera Images
14 pages
Exception-Based Customer Service From Order To Invoice With SAP Event Management at Colgate-Palmolive
No ratings yet
Exception-Based Customer Service From Order To Invoice With SAP Event Management at Colgate-Palmolive
33 pages
Gmax Stereo 3d - 极迈电子科技（上海）有限公司
No ratings yet
Gmax Stereo 3d - 极迈电子科技（上海）有限公司
18 pages
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
No ratings yet
Multi-Script-Oriented Text Detection and Recognition in Video/Scene/Born Digital Images
18 pages
Scene Text Detection Using Machine Learning Classifiers
No ratings yet
Scene Text Detection Using Machine Learning Classifiers
5 pages
POLS151 Diagram 3
No ratings yet
POLS151 Diagram 3
2 pages
Visual Image Caption Generator
No ratings yet
Visual Image Caption Generator
8 pages
Regarding Payment
No ratings yet
Regarding Payment
2 pages
System For Identifying Texts Written in Kazakh Language
No ratings yet
System For Identifying Texts Written in Kazakh Language
5 pages
Automatically Detect and Recognize Text in Natural Images
No ratings yet
Automatically Detect and Recognize Text in Natural Images
19 pages
Ijcatr 03041009
No ratings yet
Ijcatr 03041009
5 pages
IJCSNS International Journal of Computer Science and Network Security, VOL.8
No ratings yet
IJCSNS International Journal of Computer Science and Network Security, VOL.8
6 pages
Fortios v6.4.12 Release Notes
No ratings yet
Fortios v6.4.12 Release Notes
41 pages
Leon Zurawicki (Auth.) - Neuromarketing - 2010
100% (6)
Leon Zurawicki (Auth.) - Neuromarketing - 2010
261 pages
Latest Base Paper
No ratings yet
Latest Base Paper
4 pages
Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques (Veronica - Vilaplana, Antoni - Gasull, Ferran - Marques) @upc - Edu
No ratings yet
Miriam Leon, Veronica Vilaplana, Antoni Gasull, Ferran Marques (Veronica - Vilaplana, Antoni - Gasull, Ferran - Marques) @upc - Edu
4 pages
Title: Spatial Cohesion Refers To The Fact That Text
No ratings yet
Title: Spatial Cohesion Refers To The Fact That Text
6 pages
Stroke Width Transform
No ratings yet
Stroke Width Transform
8 pages
X2 - Text Recognition PDF
No ratings yet
X2 - Text Recognition PDF
14 pages
RP 0215 5826
No ratings yet
RP 0215 5826
3 pages
Mixed Extended Questions - 6 Markers ANSWERS
No ratings yet
Mixed Extended Questions - 6 Markers ANSWERS
8 pages
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
No ratings yet
Scene Text Recognition by Using EE-MSER and Optical Character Recognition For Natural Images-35843
5 pages
Text Color Images
No ratings yet
Text Color Images
6 pages
Elcometer 456 User Guide
No ratings yet
Elcometer 456 User Guide
112 pages
Long2021 Article SceneTextDetectionAndRecogniti
No ratings yet
Long2021 Article SceneTextDetectionAndRecogniti
24 pages
Hotel Angel (Thai: เทพธิดาโรงแรม, or Theptida rong ram,
No ratings yet
Hotel Angel (Thai: เทพธิดาโรงแรม, or Theptida rong ram,
2 pages
Manet Thesis 99mar
No ratings yet
Manet Thesis 99mar
5 pages
A Robust and Fast Text Extraction in Images and Video Frames
No ratings yet
A Robust and Fast Text Extraction in Images and Video Frames
7 pages
Activity #: Worksheets Preparing A Bibliography (Week 6)
No ratings yet
Activity #: Worksheets Preparing A Bibliography (Week 6)
3 pages
Character Recoganization
No ratings yet
Character Recoganization
6 pages
OCR Using Image Processing
No ratings yet
OCR Using Image Processing
8 pages
DLL Mil DLL in Mil Quarter 1 2023
No ratings yet
DLL Mil DLL in Mil Quarter 1 2023
30 pages
L2 V4 06 C264 PSL & Interlock Configuration E 01
No ratings yet
L2 V4 06 C264 PSL & Interlock Configuration E 01
30 pages
Light Brigade
100% (1)
Light Brigade
74 pages
BSC IP Application Set Up: User Guide
No ratings yet
BSC IP Application Set Up: User Guide
40 pages
How To Interface The 24LC256 EEPROM To Arduino
No ratings yet
How To Interface The 24LC256 EEPROM To Arduino
5 pages

Note: This service is not intended for secure transactions such as banking, social media, email, or purchasing. Use at your own risk. We assume no liability whatsoever for broken pages.

Comarison PDF

Uploaded by

Comarison PDF

Uploaded by

International Journal of Computer Science & Engineering Survey (IJCSES) Vol.3, No.

A SURVEY ON VARIOUS APPROACHES OF TEXT

C.P. Sumathi1, T. Santhanam2 and G.Gayathri Devi3

Figure 2: Caption Text Image Figure 3: Scene Text Image

2.1 Text Extraction Work

2.1.1 Document Text Images

Zhixin Shi et.al[8]proposed an extraction algorithm based on connectivity features for a

Table 1 : Performance Analysis of Text Extraction in Document Text Images

S. Author Year Method used Accuracy Benefits

4 Grover et.al [4] 2009 Sobel edge detection, 99% insensitiv

2.1.2 Scene Text Images

Table 2: Performance Analysis of Text Extraction in Scene Text Images

S.No Author Year Method used Accurac Benefits

2.1.3 Caption Text Images

S.No Author Year Method used Accuracy Benefits

9 Zhong et.al 1999 Discrete Cosine Transform, 98.42% Method

2.1.4 Heterogeneous Text Images

S.No Author Year Method used Accuracy Benefits

3 G. Sahoo 2009 Luminance-based 91.88% Insensitive to

2.2 Performance Evaluation

2.2.1 False Positives

2.2.2 False Negatives

2.2.3 Precision rate

2.2.4 Recall rate

[3] Audithan,,R.M.Chandrasekaran (2009), "Document Text Extraction From Document Images

You might also like

Pfad - The Proxy pFad of © 2024 Garber Painting. All rights reserved.