Scoring-thresholding pattern based text classifier
Bijaksana, Moch Arif, Li, Yuefeng, & Algarni, Abdulmohsen (2013) Scoring-thresholding pattern based text classifier. In Selamat, Ali, Nguyen, Ngoc Thanh, & Haron, Habibollah (Eds.) Intelligent Information and Database Systems : 5th Asian Conference, ACIIDS 2013, Kuala Lumpur, Malaysia, March 18-20, 2013, Proceedings, Part I, Springer Berlin Heidelberg, Istana Hotel, Kuala Lumpur, Malaysia, pp. 206-215.
A big challenge for classification on text is the noisy of text data. It makes classification quality low. Many classification process can be divided into two sequential steps scoring and threshold setting (thresholding). Therefore to deal with noisy data problem, it is important to describe positive feature effectively scoring and to set a suitable threshold. Most existing text classifiers do not concentrate on these two jobs. In this paper, we propose a novel text classifier with pattern-based scoring that describe positive feature effectively, followed by threshold setting. The thresholding is based on score of training set, make it is simple to implement in other scoring methods. Experiment shows that our pattern-based classifier is promising.
Impact and interest:
Citation counts are sourced monthly from and citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from theindexing service can be viewed at the linked Google Scholar™ search.
|Item Type:||Conference Paper|
|Keywords:||Text classification, Pattern mining, Scoring, Thresholding|
|Divisions:||Current > Schools > School of Electrical Engineering & Computer Science
Current > QUT Faculties and Divisions > Science & Engineering Faculty
|Copyright Owner:||Copyright 2013 Springer-Verlag Berlin, Heidelberg|
Conference proceedings published, by Springer Verlag, will be available via SpringerLink.
or Lecture Notes in Computer Science
|Deposited On:||17 Apr 2013 05:17|
|Last Modified:||14 Jul 2013 21:54|
Repository Staff Only: item control page