Automatic generation of structured hyperdocuments from multi-column document images

Ji Yeon Lee, Song Ha Choi, Seong Whan Lee

    Research output: Contribution to journalArticlepeer-review

    Abstract

    In this paper, we propose two methods for converting complex multi-column document images into HTML documents, and a method for generating a structured table of contents(ToC) page based on the logical structure analysis of the document image. Experiments with various kinds of multi-column document images show that HTML documents corresponding to the paper documents can be generated in a visual layout, and that their structured table of contents page, with the hierarchically ordered section titles hyperlinked to the contents, can be also produced by the proposed methods.

    Original languageEnglish
    Pages (from-to)422-425
    Number of pages4
    JournalProceedings - International Conference on Pattern Recognition
    Volume15
    Issue number4
    Publication statusPublished - 2000

    ASJC Scopus subject areas

    • Computer Vision and Pattern Recognition

    Fingerprint

    Dive into the research topics of 'Automatic generation of structured hyperdocuments from multi-column document images'. Together they form a unique fingerprint.

    Cite this