Noise removal and binarization of scanned document images using clustering of features

Loading...
Thumbnail Image

Supplementary material

Other Title

Authors

Farahmand, Atena
Sarrafzadeh, Hossein
Shanbehzadeh, J.

Author ORCID Profiles (clickable)

Degree

Grantor

Date

2017-03

Supervisors

Type

Conference Contribution - Paper in Published Proceedings

Ngā Upoko Tukutuku (Māori subject headings)

Keyword

preprocessing
document noise
binarization
noise removal algorithms
clustering

ANZSRC Field of Research Code (2020)

Citation

Farahmand, A., Sarrafzadeh, A., & Shanbehzadeh, J. (2017, March). Noise Removal and Binarization of Scanned Document Images Using Clustering of Features. IMECS (Ed.), International MultiConference of Engineers and Computer Scientists (IMECS2017) (pp.410-414).

Abstract

Old documents are in printed form. Their archiving and retrieval is expensive according in terms of space requirement and physical search. One solution is to convert these documents into electronic form using scanners. The outputs of scanners are images contaminated with noise. The outcomes are more storage requirement and low OCR accuracy. A solution is noise reduction. This paper employs KFCM algorithm to cluster pixels into text, background and noise according to their features. As a result, noise removal and binarization is done simultaneously.

Publisher

Newswood and International Association of Engineers

Link to ePress publication

DOI

Copyright holder

Authors

Copyright notice

All rights reserved

Copyright license

This item appears in: