Statistical model-based binary document image coding, reconstruction, and analysis

Yandong Guo, Purdue University

Abstract

Binary document image is still one of the most important information carriers in this era of data. In this final exam, we will present two novel technologies to learn and understand low-level features from document images, and we also apply these technologies in the applications including compression, reconstruction, registration, and searching. The first learning technology is the entropy-based dictionary learning, which is a method to learn a strong prior for document images. The information in this prior is used to encode the image effectively. If there are more than one page to be encoded, we impose hierarchical structure onto the dictionary, and dynamically update the dictionary. Compared with the best existing methods, we achieve much higher compression ratio. The dictionary prior we proposed is also used to restore noisy document images. Our dictionary-based restoration improves the document image quality, and the encoding effectiveness simultaneously. The second learning technology is layout structure detection for document images. Our layout detection method is faster and more efficient, compared with conventional methods. Using this technology, we construct sparse feature set for document images, which is then used in our novel, efficient document image searching system.

Degree

Ph.D.

Advisors

Bouman, Purdue University.

Subject Area

Statistics|Electrical engineering|Computer science

Off-Campus Purdue Users:
To access this dissertation, please log in to our
proxy server
.

Share

COinS