Abstract
A new method that uses a modified fractal dimension theory to segment a document image and to discriminate between Oriental and European languages is presented in this paper. Two types of techniques have been usually adopted in language discrimination: token matching and statistical analysis. A modified fractal feature is used to discriminate the distinct textual structure complexities of Oriental and European languages. Experiments show that this method is effective and reliable for processing the document image even if it is skewed or contains noise that can not be removed clearly.
Original language | English |
---|---|
Title of host publication | Proceedings of the 5th International Conference on Document Analysis and Recognition, ICDAR 1999 |
Publisher | IEEE Computer Society |
Pages | 345-348 |
Number of pages | 4 |
ISBN (Electronic) | 0769503187 |
DOIs | |
Publication status | Published - 1999 |
Event | 5th International Conference on Document Analysis and Recognition, ICDAR 1999 - Bangalore, India Duration: 1999 Sept 20 → 1999 Sept 22 |
Publication series
Name | Proceedings of the International Conference on Document Analysis and Recognition, ICDAR |
---|---|
ISSN (Print) | 1520-5363 |
Other
Other | 5th International Conference on Document Analysis and Recognition, ICDAR 1999 |
---|---|
Country/Territory | India |
City | Bangalore |
Period | 99/9/20 → 99/9/22 |
Bibliographical note
Publisher Copyright:© 1999 IEEE.
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition