Computing, School of

School of Computing: Conference and Workshop Papers

Accessibility Remediation

If you are unable to use this item in its current form due to accessibility barriers, you may request remediation through our remediation request form.

A Trainable, Single-Pass Algorithm for Column Segmentation

Date of this Version

1995

Document Type

Article

Comments

Abstract

Column Segmentation logically precedes OCR in the document analysis process. The trainable algorithm described here, XYCUT, relies on horizontal and vertical binary profiles to produce an XY- tree representing the column structure of a page of a technical document in a single pass through the bit image. Training against ground truth adjusts a single, resolution independent, parameter using only local information and guided by an edit distance function. The algorithm correctly segments the page image for a (fairly) wide range of parameter values, although small, local and repairable errors may be made, an effect measured by a repair cost function.

Download

Included in

Computer Sciences Commons

COinS

Computing, School of

School of Computing: Conference and Workshop Papers

Accessibility Remediation

A Trainable, Single-Pass Algorithm for Column Segmentation

Date of this Version

Document Type

Comments

Abstract

Included in

Search

Browse

Author Corner

Links

Computing, School of

School of Computing: Conference and Workshop Papers

Accessibility Remediation

A Trainable, Single-Pass Algorithm for Column Segmentation

Authors

Date of this Version

Document Type

Comments

Abstract

Included in

Share

Search

Browse

Author Corner

Links