Adaptive Methods for Robust Document Image Understanding

Konya, Iuliu

Volltext

View/Open (15.5MB)

Author

Konya, Iuliu

Type of Scholarly Publication

Dissertation

Date of Exam

13.03.2013

Date of Publication

09.04.2013

Advisor

Bauckhage, Christian

Co-Referee

Klein, Reinhard

Degree Granting Institutions

Rheinische Friedrich-Wilhelms-Universität Bonn

Metadata

Show full item record

Citable Links

Handle: https://hdl.handle.net/20.500.11811/5655
URN: https://nbn-resolving.org/urn:nbn:de:hbz:5n-31696

Abstract

A vast amount of digital document material is continuously being produced as part of major digitization efforts around the world. In this context, generic and efficient automatic solutions for document image understanding represent a stringent necessity. We propose a generic framework for document image understanding systems, usable for practically any document types available in digital form. Following the introduced workflow, we shift our attention to each of the following processing stages in turn: quality assurance, image enhancement, color reduction and binarization, skew and orientation detection, page segmentation and logical layout analysis. We review the state of the art in each area, identify current defficiencies, point out promising directions and give specific guidelines for future investigation. We address some of the identified issues by means of novel algorithmic solutions putting special focus on generality, computational efficiency and the exploitation of all available sources of information. More specifically, we introduce the following original methods: a fully automatic detection of color reference targets in digitized material, accurate foreground extraction from color historical documents, font enhancement for hot metal typesetted prints, a theoretically optimal solution for the document binarization problem from both computational complexity- and threshold selection point of view, a layout-independent skew and orientation detection, a robust and versatile page segmentation method, a semi-automatic front page detection algorithm and a complete framework for article segmentation in periodical publications. The proposed methods are experimentally evaluated on large datasets consisting of real-life heterogeneous document scans. The obtained results show that a document understanding system combining these modules is able to robustly process a wide variety of documents with good overall accuracy.

Subjects

document image analysis, document image understanding, image enhancement, character enhancement, document binarization, color reduction, skew detection, orientation detection, page segmentation, geometric layout analysis, article segmentation, logical layout analysis

Classification (DDC)

004 Informatik

Zitiervorschlag
BibTeX

Konya, Iuliu: Adaptive Methods for Robust Document Image Understanding. - Bonn, 2013. - Dissertation, Rheinische Friedrich-Wilhelms-Universität Bonn.
Online-Ausgabe in bonndoc: https://nbn-resolving.org/urn:nbn:de:hbz:5n-31696

@phdthesis{handle:20.500.11811/5655,
urn: https://nbn-resolving.org/urn:nbn:de:hbz:5n-31696,
author = {{Iuliu Konya}},
title = {Adaptive Methods for Robust Document Image Understanding},
school = {Rheinische Friedrich-Wilhelms-Universität Bonn},
year = 2013,
month = apr,
note = {A vast amount of digital document material is continuously being produced as part of major digitization efforts around the world. In this context, generic and efficient automatic solutions for document image understanding represent a stringent necessity. We propose a generic framework for document image understanding systems, usable for practically any document types available in digital form. Following the introduced workflow, we shift our attention to each of the following processing stages in turn: quality assurance, image enhancement, color reduction and binarization, skew and orientation detection, page segmentation and logical layout analysis. We review the state of the art in each area, identify current defficiencies, point out promising directions and give specific guidelines for future investigation. We address some of the identified issues by means of novel algorithmic solutions putting special focus on generality, computational efficiency and the exploitation of all available sources of information. More specifically, we introduce the following original methods: a fully automatic detection of color reference targets in digitized material, accurate foreground extraction from color historical documents, font enhancement for hot metal typesetted prints, a theoretically optimal solution for the document binarization problem from both computational complexity- and threshold selection point of view, a layout-independent skew and orientation detection, a robust and versatile page segmentation method, a semi-automatic front page detection algorithm and a complete framework for article segmentation in periodical publications. The proposed methods are experimentally evaluated on large datasets consisting of real-life heterogeneous document scans. The obtained results show that a document understanding system combining these modules is able to robustly process a wide variety of documents with good overall accuracy.},
url = {https://hdl.handle.net/20.500.11811/5655}
}

The following license files are associated with this item: