QUT ePrints

Texture for Script identification

Boles, Wageeh, Busch, Andrew, & Sridharan, Subramanian (2005) Texture for Script identification. IEEE Transcriptions on Pattern Analysis and Machine Intelligence, 27(11), pp. 1720-1731.

View at publisher

Abstract

The problem of determining the script and language of a document image has a number of important applications in the field of document analysis, such as indexing and sorting of large collections of such images, or as a precursor to optical character recognition (OCR). In this paper, we investigate the use of texture as a tool for determining the script of a document image, based on the observation that text has a distinct visual texture. An experimental evaluation of a number of commonly used texture features is conducted on a newly created script database, providing a qualitative measure of which features are most appropriate for this task. Strategies for improving classification results in situations with limited training data and multiple font types are also proposed.

Impact and interest:

51 citations in Scopus
Search Google Scholar™
29 citations in Web of Science®

Citation countsare sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 23167
Item Type: Journal Article
Keywords: Script Identification, Wavelets and Fractals, Texture, Document Analysis, Clustering, Classification and Association Rules
DOI: 10.1109/TPAMI.2005.227
ISSN: 0162-8828
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Pattern Recognition and Data Mining (080109)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Artificial Intelligence and Image Processing not elsewhere classified (080199)
Divisions: Past > QUT Faculties & Divisions > Faculty of Built Environment and Engineering
Deposited On: 17 Jun 2009 23:43
Last Modified: 25 Feb 2013 15:21

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page