Computer Science and Engineering, Department of


Date of this Version



G. Nagy and S. C. Seth,"Hierarchical image representation with application to optically scanned documents", Proc. 7th Int. Conference on Pattern Recognition (ICPR), 347-349, 1984.


The objective of the research to be pursued is to develop a schema for representing raster-digitized (scanned) documents, The representation is to retain not only the spatial structure of a printed document, but should also facilitate automatic labeling of various components, such as text, figures, subtitles, and figure captions, and allow the extraction of important relationships (such as reading order) among them. Intended applications include (1) data compression for document transmission and archival, and (2) document entry, with out rekeying, into editing, formatting, and information retrieval systems.