Thursday, August 6, 2009

A10 Preprocessing Text

Int this activity we tried to extract handwritten text from an imaged document with lines. This is often used in in handwriting recognition where individual examples of letters must be extracted.










Fig. 5. Used the enhanced image in Figure 4. So, from the left the image, its shifted Fourier tranform, its mask to remove the horizontal lines and the final image with only the text and the horizontal lines gone.


Figure 6. The histogram of the masked or cleaned image with handwriting. After thresholding , it was converted to a binary image to extract the text, but the text is still dirty.

In Figure 5 the desired text is already achieved and the image cleaned , but trying to extract the text requires more cleaning. But I think results inFigure 5 is already ok. I am still working on the threshoding and cleaning Figure 5 image... to get better results than what is shown in Figure 6.

No comments:

Post a Comment