Abstract
In this paper, we propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents(ToC) page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that HTML documents corresponding to the paper documents can be generated in a visual layout, and that their structured table of contents page, with the hierarchically ordered section titles hyperlinked to the contents, can be also produced by the proposed methods.
| Original language | English |
|---|---|
| Pages (from-to) | 422-425 |
| Number of pages | 4 |
| Journal | Proceedings - International Conference on Pattern Recognition |
| Volume | 15 |
| Issue number | 4 |
| Publication status | Published - 2000 |
ASJC Scopus subject areas
- Computer Vision and Pattern Recognition
Fingerprint
Dive into the research topics of 'Automatic generation of structured hyperdocuments from multi-column document images'. Together they form a unique fingerprint.Cite this
- APA
- Standard
- Harvard
- Vancouver
- Author
- BIBTEX
- RIS