Classification of document blocks using density feature and connectivity histogram

Kuo Chin Fan, Liang Shen Wang

Research output: Contribution to journalArticlepeer-review

16 Scopus citations

Abstract

In this paper, we present a document block classification algorithm to automatically classify different types of blocks embedded in a document image. Two kinds of features, density feature and connectivity histogram, are devised to achieve the classification goal. In our approach, segmented document blocks are first classified into text and non-text blocks via the density feature. Then, the connectivity histogram is utilized to further classify non-text blocks into image and graphics blocks. Experimental results reveal the feasibility of the new technique in classifying document blocks.

Original languageEnglish
Pages (from-to)955-962
Number of pages8
JournalPattern Recognition Letters
Volume16
Issue number9
DOIs
StatePublished - Sep 1995

Keywords

  • Block classification
  • Connectivity histogram
  • Density feature

Cite this