The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 1
ISSN 2229-5518
Modified Method of Document Text Extraction from Document Images Using Haar DWT
Navjot Kaur
—————————— ——————————
arge amounts of information are embedded in im-
recognized and processed. Texts in images contain
raw input to document analysis.
Some examples of Document Images are as shown in the fig-
ages which are often required to be automatically
useful information which can be useful to fully understand images. Text recognition from document images receives a growing attention because of potential applications in content based indexing, archiving documents.
The term document in no longer confined to scanned pages
and any camera based image can be subject to operations like text information extraction (TIE) for applications such as opti- cal character recognition (OCR), image/video indexing, mobile reading system for visually challenged persons etc.
Data capture of documents by optical scanning or by digi- tal video yields a file of picture elements, or pixels, that is the
————————————————
Navjot Kaur has completedmasters degree program in computer engineer- ing from Punjabi University,Patiala(Punjab),India. E-mail: jyo-
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 2
ISSN 2229-5518
Y=0.299 0.587 0.114 Y R G (1)
Image Y is then processed with discrete wavelet transform and the whole extraction algorithm afterward. If the input image itself is stored in the DWT compressed form, DWT operation
can be omitted in the proposed algorithm.
Text extraction is a critical and essential step as it sets up the quality of the final recognition result. It aims at segmenting text from background, meaning isolated text pixels from those of background. A text extraction system usually assumes that text is the major input contributor, but it also has to be robust against variations in the detected text's bounding box size. A very efficient text extraction method could enable the use of commercial OCR without any other modifications.
To extract text from Document images using 2-D Haar Wavelet and by eliminating large size areas in the image. In case we have larger area components in the image, we can get better result.
The edges detection is accomplished by using 2-D Haar DWT and some of the non-text edges are removed using threshold- ing. Afterward, we connect the isolated candidate text edges in each detail component sub-band of the binary image. Al- though the color component may differ in a text region, the information about colors does not help extracting texts from images. If the input image is a gray-level image, the image is processed directly starting at discrete wavelet transform. If the input image is colored, its RGB components are combined to give an intensity image Y as follows:
To remove the large size area in the image, the outline of the suggested idea (MATLAB code) is as follows:
[CC NOB] = bwlabel(final, 8); S = regionprops(CC, 'Area'); Binary=final;
Stats = regionprops(CC, 'Area', 'pixelidxlist');
Cleaned = binary; For p=1:length(stats) Ara(p)=stats(p).Area; End Area=sort(ara,'descend'); MIN_AREA=area(2)+10;
For region = 1 : length(stats)
If stats(region).Area > MIN_AREA Cleaned(stats(region).pixelidxlist) = 0; End
End
Cl=imresize(cleaned,[512 512]); B=[0 1 0; 1 1 1;0 1 0]; K=imdilate(IM2,B);
J = immultiply(cl,k); B=[1 1 1; 1 1 1;1 1 1]; K=imdilate(J,B);
J = immultiply(IM2,k);
Final = bwareaopen(final,20);
where
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 3
ISSN 2229-5518
- bwlabel(final,8) returns matrix CC, of the same size as final, containing labels for the connected objects in fi- nal having 8-connected objects.
- regionprops measures a set of properties for each connected component (object) in CC, which is a struc- ture returned by bwconncomp.
- length(stats) finds number of elememts along the largest dimension of an array.
- B is the dilation operator.
- Imdilate(IM2, B) dilates the gray-scale, binary, or packed binary image IM2.
- Immultiply(cl, k) multiplies each element in array cl by the corresponding element in array k.
- bwareaopen(final, 20) removes from a binary image
all CCs that have fewer than 20 pixels, providing another binary image.
The experimentation of the proposed algorithm was carried out on a data set consisting of different document images. Currently the data set consists of 10 images (All images are given in the Appendix A).
We tried the implemented technique on a set of test images
and get the results as follows:
We will check the performance of the implemented technique using the following Statistical measures of the performance.
1) Sensitivity/ Recall rate: Sensitivity relates to the test's ability to identify positive results.
2) Specificity: Specificity relates to the ability of the test to identify negative results.
3) Precision: Precision is defined as the proportion of the true positives against all the positive results (both true positives and false positives)
4) F-measure: The F-measure can be used as a single measure of performance of the test. The F-measure is the harmonic mean of precision and recall.
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 4
ISSN 2229-5518
5) Accuracy: The accuracy is the proportion of true re- sults (both true positives and true negatives).
where
True positive: wrong is correctly diagnosed as wrong False positive: right incorrectly identified as wrong True negative: right is correctly identified as right
False negative: wrong incorrectly identified as right
Measures are obtained in case of the test images (given in the
Appendix-A). The average of those measures is as follows.
Taking all test images into consideration (All images are given
in Appendix-A),
Average Recall Rate = 54.87% Average Specificity= 78.80%
Average Precision Rate= 53.327% Average Accuracy= 70.886%
Average F-measure=54.090%
For the procedure to be effective, a priori knowledge about the structure of the page is necessary. This technique is there- fore particularly useful when the layout is constrained, such as is often the case when considering pages from scientific jour- nals.
We have implemented an effective document text extraction method based on the fact that in text regions, horizontal edges, vertical edges and diagonal edges are mingled together while they are distributed separately in non-text regions. Larger areas are detected to ease the method and try removing the non-text regions which are left even after the above process- ings.
Bottom-up technique merge evidence at increasing scales to
form, e.g., words from characters, lines from words. Actually the processing of document image segmentation and classifi- cation comes under an OCR pre-processor step. The text blocks that are detected by this technique are used as an input to the OCR system.
Despite the many efforts spent on the subject, there is still much room for improvement in document segmentation tech-
niques, which is the key factor to improve the overall perfor-
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 5
ISSN 2229-5518
mance of an automatic reading/processing system.
fig5
fig6
fig7
IJSER © 2012
http://www.ijser.org
fig8
fig9
fig10
fig11
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 6
ISSN 2229-5518
fig12
fig13
fig14
fig15. Result of fig.5
fig16. Result of fig.6
fig17. Result of fig.7
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 7
ISSN 2229-5518
fig18. Result of fig.8
fig19. Result of fig.9
fig20. Result of fig.10
fig21. Result of fig.11
fig 22. Result of fig.12
fig23. Result of fig.13
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 8
ISSN 2229-5518
pp.502-512.
[2] Shyama Prosad Chowdhury, Soumyadeep Dhar, Amit Ku- mar Das, Bhabatosh Chanda, Karen mcmenemy (2009),”ROBUST EXTRACTION OF TEXT FROM CAMERA IMAGES”, ICDAR ’09 Proceedings of the 2009 10th Interna- tional Conference on Documant Analysis and Recognition.
[3] Ujjwal Bhattacharya, Swapan Kumar Parui, Srikanta Mon-
fig24. Result of fig.14
First of all, I would like to express my deep sense of respect and gratitude towards my guide Dr. Rajesh Kumar Bawa, Pro- fessor, Department of Computer Science, Punjabi University, Patiala, who has been the guiding force behind this work. I am greatly indebted to him for his constant guidance, useful sug- gestion and sustained encouragement throughout the work.
I also wish to acknowledge valuable interaction i’ve had with my other teachers of the department. Thanks are also due to all of my lab mates, from whom I learned a lot.
Finally, my parents... I am endlessly grateful to my parents, for giving me the opportunity to open my eyes in one of the most beautiful planets I have ever known. I would like to express my sincere thanks to the almighty who kept me motivated to do some purposeful work.
[1] S.Audithan, RM. Chandrasekaran (2009), “DOCUMENT TEXT EXTRACTION FROM DOCUMENT IMAGES USING HAAR DISCRETE WAVELET TRANSFORM”, European Jour- nal of Scientific Research ISSN 1450-216X Vol.36 No.4 (2009),
dal (2009), "DEVANAGARI AND BANGLA TEXT EXTRAC- TION FROM NATURAL SCENE IMAGES," icdar, pp.171-175,
2009 10th International Conference on Document Analysis and
Recognition.
[4] Keechul Jung, Kwang In Kim and Anil K. Jain(2004), “TEXT INFORMATION EXTRACTION IN IMAGES AND VIDEOS: A SURVEY”, The journal of the Pattern Recognition society.
[5] G. Rama Mohan Babu, P. Srimaiyee, 3A. Srikrishna(2005-
2010), “TEXT EXTRACTION FROMHETROGENOUS IM- AGES USING MATHEMATICAL MORPHOLOGY”, Journal of Theoretical and Applied Information Technology.
[6] S. A. Angadi, M. M. Kodabagi,” A TEXTURE BASED ME- THODOLOGY FOR TEXT REGION EXTRACTION FROM LOW RESOLUTION NATURAL SCENE IMAGES” Interna- tional Journal of Image Processing (IJIP) Volume(3), Issue(5)
[7] H. Tran, A lux, H.L. Nguyen T. And A. Boucher(2005),” A NOVEL APPROACH FOR TEXT DETECTION IN IMAGES USING STRUCTURAL FEATURES”, The 3rd International Con- ference on Advances in Pattern Recognition, LNCS Vol. 3686, pp.
627-63.
[8] X. Liu, H. Fu and Y. Jia.(2008),” GAUSSIAN MIXTURE MODELING AND LEARNNG OF NEIGHBOR CHARAC- TERS FOR MULTILINGUAL TEXT EXTRACTION IN IM- AGES”, Pattern Recognition, Vol. 41, pp. 484-493.
[9] P. Dubey(2006),” EDGE BASED TEXT DETECTION FOR MULTI-PURPOSE APPLICATION”, Proceedings of International Conference Signal Processing, IEEE, Vol. 4.
[10] K. Subramanian, P. Natajajan, M. Decerbo, and D. Casta-
IJSER © 2012
The research paper published by IJSER journal is about Modified Method Of Document Text Extraction From Document Images Using Haar DWT 9
ISSN 2229-5518
non(2007),” CHARACTER-STROKE DETECTION FOR TEXT-
LOCALIZATION AND EXTRACTION”, Proceedings of Ninth International Conference on Document Analysis and Recognition, IEEE, pp. 33-37.
[11] C. Mancas-Thilou, B. Gosselin(2006),” SPATIAL AND COLOR SPACES COMBINATION FOR NATURAL SCENE TEXT EXTRACTION”, Proceedings of IEEE International Confe- rence on Iimage Processing, pp. 985-988.
[12] W. M. Pan, T. D. Bui, and C. Y. Suen(2007),”TEXT SEG- MENTATION FROM COMPLEX BACKGROUND USING SPARSE REPRESENTATIONS”, Proceedings of Ninth Interna- tional Conference on Document Analysis and Recognition, IEEE, pp. 412-416.
[13] J. Liang, D. Doermann, and H. P. Li.(2005),” CAMERA- BASED ANALYSIS OF TEXT AND DOCUMENTS: A SUR- VEY”. Int’l J. Document Analysis and Recognition, 7(2-3):84–104. [14] D. F. Dunn and N. E. Mathew(2000), “EXTRACTING CO- LOUR HALFTONES FROM PRINTED DOCUMENTS USING TEXTURE ANALYSIS,” Pattern Recognition, vol. 33, no. 3, pp.
445–463.
[15] M. I. C. Murguiu(1998), “DOCUMENT SEGMENTATION USING TEXTURE VARIANCE AND LOW RESOLUTION IMAGES,” in Proceedings of IEEE Southwest Syniposium on Im- age Analysis and Interpretation, Tucson, Arizona, USA, pp.164–
167.
[16] L. Clique, L. Lombardi, and G. Mazini(1998), “A MULTI- RESTORATION APPROACH FOR PAGE SEGMENTATION,” Pattern Recognition Letters, vol. 19, no. 2, pp. 217–225,.
[17] K. Etemad, D. S. Doermann, and R. Chellappa(1998), “MULTISCALE SEGMENTATION OF UNSTRUCTURED DOCUMENT PAGES USING SOFT DECISION INTEGRA- TION,” IEEE Transactions on Pattern Analysis and Machine Intel-
ligence, vol. 19, no. 1, pp. 92–96.
2, pp. 743–770.
[19] Y. K. Ham, M. S. Kang, H. K. Chung, and R. H. Park(1995),” RECOGNITION OF RAISED CHARACTERS FOR AUTOMATIC CLASSIFICATION OF RUBBER TIRES”, Opt. Eng., Vol. 34, pp.102-108.
[20] T. Sato, T. Kanade, E. K. Hughes, and M. A. Smith(1998),” VIDEO OCR FOR DIGITAL NEWS ARCHIVE”, Proc. Of IEEE Workshop on Content based Access of Image and Video Data- bases, pp. 52-60.
[21] B. Shahraray and D. C. Gibbon(1995),” AUTOMATIC GENERATION OF PICTORIAL TRANSCRIPTS OF VIDEO PROGRAMS”, Proc. Of SPIE, Vol. 2417.
[22] Julinda Gllavata, Ralph Ewerth and Bernd Freisleben (2003), “A ROBUST ALGORITM FOR TEXT DETECTION IN IMAGES”, Proceedings of the 3rd international symposium on Image and Signal Processing and Analysis.
[18] A. K. Jain and Y. Zhong(1996), “PAGE SEGMENTATION
USING TEXTURE ANALYSIS,” Pattern Recognition, vol. 23, no.
IJSER © 2012