An overview of different binary methods for documents based on their features

Loading...
Thumbnail Image
Other Title
Authors
Fazeli, Fatemeh
Sarrafzadeh, Hossein
Shanbehzadeh, Jamshid
Author ORCID Profiles (clickable)
Degree
Grantor
Date
2013
Supervisors
Type
Conference Contribution - Paper in Published Proceedings
Ngā Upoko Tukutuku (Māori subject headings)
Keyword
binarization
thresholding
feature extraction
cost function
image preprocessing
image compression
ANZSRC Field of Research Code (2020)
Citation
Fazeli, F., Sarrafzadeh, A., and Shanbehzadeh, J. (2013). An overview of different binary methods for documents based on their features. Proceedings of the International MultiConference of Engineers and Computer Scientists, Hong Kong, 1, 491-495.
Abstract
This paper surveys binarization of document images. The main role of binarization is dimension and noise reduction. Binarization is one of the most important steps in preprocessing of document image understanding and compression. Image binarization means to classify image pixels into two classes, background and foreground. The input of this classification is a feature vector based on intensity values of image pixels. The new features are extracted from the first input vector and, according to the extracted features a cost function as a classifier is constructed. The intensity value that maximizes the cost function is considered as the boundary line of two classes. This paper divides the binarization algorithms into three groups. The first considers one input feature vector including intensity values of each pixel. The second one considers an input feature vector for each pixel based on the intensity value of the pixel and its neighbors. The third group is based on a combination of the first and second group of schemes.
Publisher
Newswood Ltd.
Link to ePress publication
DOI
Copyright holder
Newswood Ltd.
Copyright notice
All rights reserved
Copyright license
This item appears in: