Parameter-independent geometric document layout analysis

Dae Seok Ryu, Sun Mee Kang, Seong Whan Lee

    Research output: Contribution to journalArticlepeer-review

    9 Citations (Scopus)

    Abstract

    We propose a new method independent of parameters for segmenting the document images into maximal homogeneous regions and identifying them as texts, images, tables and lines. A pyramidal quadtree structure is constructed for multiscale analysis and top-down approach, and a periodicity measure is suggested to find a periodical attribute of text regions. To obtain robust page segmentation results, a confirmation procedure using texture analysis is applied to only ambiguous regions. Experimental results with the document database from the University of Washington show that the proposed method works better than the previous ones.

    Original languageEnglish
    Pages (from-to)397-400
    Number of pages4
    JournalProceedings - International Conference on Pattern Recognition
    Volume15
    Issue number4
    Publication statusPublished - 2000

    ASJC Scopus subject areas

    • Computer Vision and Pattern Recognition

    Fingerprint

    Dive into the research topics of 'Parameter-independent geometric document layout analysis'. Together they form a unique fingerprint.

    Cite this