TY - GEN
T1 - Reference line extraction from form documents with complicated backgrounds
AU - Xi, Dihua
AU - Lee, Seong Whan
N1 - Publisher Copyright:
© 2003 IEEE.
PY - 2003
Y1 - 2003
N2 - Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference lines which are contained in almost all form documents. This paper presents an efficient methodology for the complicated grey-level form image processing. We construct a non-orthogonal wavelet with adjustable rectangle supports and offer algorithms for the extraction of the reference lines based on the strip growth method using the multiresolution wavelet sub images. We have compared this system with the popular Hough transform (HT) based and the novel orthogonal wavelet based methods. As shown in the experiments, the proposed algorithmdemonstrates high performance and fast speed for the complicated form images. This system is also effective for the form images with slight skew.
AB - Form document analysis is one of the most essential tasks in document analysis and recognition. One of the most fundamental and crucial tasks is the extraction of the reference lines which are contained in almost all form documents. This paper presents an efficient methodology for the complicated grey-level form image processing. We construct a non-orthogonal wavelet with adjustable rectangle supports and offer algorithms for the extraction of the reference lines based on the strip growth method using the multiresolution wavelet sub images. We have compared this system with the popular Hough transform (HT) based and the novel orthogonal wavelet based methods. As shown in the experiments, the proposed algorithmdemonstrates high performance and fast speed for the complicated form images. This system is also effective for the form images with slight skew.
UR - http://www.scopus.com/inward/record.url?scp=9244223005&partnerID=8YFLogxK
U2 - 10.1109/ICDAR.2003.1227823
DO - 10.1109/ICDAR.2003.1227823
M3 - Conference contribution
AN - SCOPUS:9244223005
T3 - Proceedings of the International Conference on Document Analysis and Recognition, ICDAR
SP - 1080
EP - 1084
BT - Proceedings - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
PB - IEEE Computer Society
T2 - 7th International Conference on Document Analysis and Recognition, ICDAR 2003
Y2 - 3 August 2003 through 6 August 2003
ER -