detected Image and text in document.
Mostra commenti meno recenti
0 down vote favorite
I have a binary image (image1). Now I want to detection where is the figure ( may be include big text) in original image. I use haar wavelet transform and detec a image B include some position may be the figure of A. (image 2). If I use image A - image B = image C (image 3) it may be not good be cause we have some boundary. Now I want remove the boundary or detect exactly the figure in image A? how to do that ?. I try use conected component but it run over time.
There is my image: Image A:

Image B:

Image C:

Image A- imageB =Image C ( that mean if A(i,j)==1 and B(i,j)==1 then C(i,j)=0;)
Please help me. Thank you so much
1 Commento
Tran Tuan Anh
il 23 Mag 2014
Risposte (2)
Image Analyst
il 23 Mag 2014
Modificato: Image Analyst
il 23 Mag 2014
0 voti
Take the image, call imfill(), then erode it enough to make the letters disappear. Then use imreconstruct. See attached demos.
6 Commenti
Tran Tuan Anh
il 24 Mag 2014
Tran Tuan Anh
il 24 Mag 2014
Modificato: Tran Tuan Anh
il 24 Mag 2014
Image Analyst
il 24 Mag 2014
OK, that's fine. You don't have to use my algorithm. If you have some algorithm from a paper that's working will for you, then that's fine.
Tran Tuan Anh
il 24 Mag 2014
Image Analyst
il 24 Mag 2014
It looks like B gets all the large blobs. There are a few small scattered dots around the big blobs and it's not picking those up. As far as it's concerned if it's small it could be text. If you want to capture the small surrounding dots, call imclose(). It will dilate the large blobs to engulf the small blobs or connect to nearby blobs, then it will erode to shrink it back down to the original size but without breaking any connections that were made during dialation.
closedImage = imclose(binaryImage, true(9)); % Use whatever window size you want.
Tran Tuan Anh
il 26 Mag 2014
Modificato: Tran Tuan Anh
il 26 Mag 2014
Tran Tuan Anh
il 26 Mag 2014
Modificato: Tran Tuan Anh
il 26 Mag 2014
6 Commenti
Image Analyst
il 26 Mag 2014
You may have to have some sort of first pass to detect what kind of figure might be present and then use a different algorithm for each kind of figure.
Tran Tuan Anh
il 26 Mag 2014
Image Analyst
il 26 Mag 2014
Let's say you have algorithm1 that does a good job at spotting gray scale images on the page, and algorithm2 that does a good job handling line art. You might have some algorithm that recognized, just roughly and approximately, what kind of figure is there, and then apply algorithm1 or algorithm2 for better extraction of the figure, depending on what was found in the first pass.
Tran Tuan Anh
il 26 Mag 2014
Modificato: Tran Tuan Anh
il 26 Mag 2014
Image Analyst
il 26 Mag 2014
Why don't you just threshold and find the areas of all the blobs? All the text will be in a narrow range. Any outliers (bigger or smaller) will be non-letters and might be considered as noise (if smaller) or part of a figure (if bigger).
Tran Tuan Anh
il 26 Mag 2014
Categorie
Scopri di più su Signal Analysis in Centro assistenza e File Exchange
Prodotti
Community Treasure Hunt
Find the treasures in MATLAB Central and discover how the community can help you!
Start Hunting!



