Yang, P. orcid.org/0000-0002-8553-7127, Antonacopoulos, A., Clausner, C. et al. (2 more authors) (2017) Effective geometric restoration of distorted historical document for large-scale digitisation. IET Image Processing, 11 (10). pp. 841-853. ISSN 1751-9659
Abstract
Due to storage conditions and material's non-planar shape, geometric distortion of the two-dimensional content is widely present in scanned document images. Effective geometric restoration of these distorted document images considerably increases character recognition rate in large-scale digitisation. For large-scale digitisation of historical books, geometric restoration solutions expect to be accurate, generic, robust, unsupervised and reversible. However, most methods in the literature concentrate on improving restoration accuracy for specific distortion effect, but not their applicability in large-scale digitisation. This study proposes an effective mesh based geometric restoration system (GRLSD) for large-scale distorted historical document digitisation. In this system, an automatic mesh generation based dewarping tool is proposed to geometrically model and correct arbitrary warping historical documents. An XML-based mesh recorder is proposed to record the mesh of distortion information for reversible use. A graphic user interface (GUI) toolkit is designed to visually display and manually manipulate the mesh for improving geometric restoration accuracy. Experimental results show that the proposed automatic dewarping approach efficiently corrects arbitrarily warped historical documents, with an improved performance over several state-of-the-art geometric restoration methods. By using XML mesh recorder and GUI toolkit, the GRLSD system greatly aids users to flexibly monitor and correct ambiguous points of mesh for the prevention of damaging historical document images without distortions in large-scale digitalisation.
Metadata
Item Type: | Article |
---|---|
Authors/Creators: |
|
Copyright, Publisher and Additional Information: | © 2017 The Institution of Engineering and Technology. This is an author-produced version of a paper subsequently published in IET Image Processing. Uploaded in accordance with the publisher's self-archiving policy. |
Keywords: | Geometric restoration; document processing; historical documents; large-scale digitalisation |
Dates: |
|
Institution: | The University of Sheffield |
Academic Units: | The University of Sheffield > Faculty of Engineering (Sheffield) > Department of Computer Science (Sheffield) |
Depositing User: | Symplectic Sheffield |
Date Deposited: | 12 Sep 2019 09:20 |
Last Modified: | 12 Sep 2019 09:20 |
Status: | Published |
Publisher: | Institution of Engineering and Technology (IET) |
Refereed: | Yes |
Identification Number: | 10.1049/iet-ipr.2016.0973 |
Open Archives Initiative ID (OAI ID): | oai:eprints.whiterose.ac.uk:150781 |