Marginal noise removal of document images

Kuo Chin Fan, Yuan Kai Wang, Tsann Ran Lay

Research output: Contribution to journalArticlepeer-review

59 Scopus citations

Abstract

Marginal noise is a common phenomenon in document analysis which results from the scanning of thick documents or skew documents. It usually appears in the front of a large and dark region around the margin of document images. Marginal noise might cover meaningful document objects, such as text, graphics and forms. The overlapping of marginal noise with meaningful objects makes it difficult to perform the task of segmentation and recognition of document objects. This paper proposes a novel approach to remove marginal noise. The proposed approach consists of two steps which are marginal noise detection and marginal noise deletion. Marginal noise detection will reduce an original document image into a smaller image, and then find marginal noise regions according to the shape length and location of the split blocks. After the detection of marginal noise regions, different removal methods are performed. A local thresholding method is proposed for the removal of marginal noise in gray-scale document images, whereas a region growing method is devised for binary document images. Experimenting with a wide variety of test samples reveals the feasibility and effectiveness of our proposed approach in removing marginal noises.

Original languageEnglish
Pages (from-to)2593-2611
Number of pages19
JournalPattern Recognition
Volume35
Issue number11
DOIs
StatePublished - Nov 2002

Keywords

  • Adaptive thresholding
  • Document image analysis
  • Image segmentation
  • Marginal noise removal
  • Preprocessing

Fingerprint

Dive into the research topics of 'Marginal noise removal of document images'. Together they form a unique fingerprint.

Cite this