Effective pattern taxonomy mining in text documents

Li, Yuefeng, Tao, Xiaohui, & Wu, Sheng-Tang (2008) Effective pattern taxonomy mining in text documents. In Shanahan, James (Ed.) CIKM 08 Proceedings of the 17th ACM Conference on Information and Knowledge Management, Association for Computing Machinery, Napa Valley, CA, pp. 1509-1510.

View at publisher


Many data mining techniques have been proposed for mining useful patterns in databases. However, how to effectively utilize discovered patterns is still an open research issue, especially in the domain of text mining. Most existing methods adopt term-based approaches. However, they all suffer from the problems of polysemy and synonymy. This paper presents an innovative technique, pattern taxonomy mining, to improve the effectiveness of using discovered patterns for finding useful information. Substantial experiments on RCV1 demonstrate that the proposed solution achieves encouraging performance.

Impact and interest:

3 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 30447
Item Type: Conference Paper
Refereed: Yes
Keywords: Information filtering, Text mining, Pattern mining, Pattern taxonomy
DOI: 10.1145/1458082.1458360
ISBN: 978-1-59593-991-3
Subjects: Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > ARTIFICIAL INTELLIGENCE AND IMAGE PROCESSING (080100) > Pattern Recognition and Data Mining (080109)
Australian and New Zealand Standard Research Classification > INFORMATION AND COMPUTING SCIENCES (080000) > LIBRARY AND INFORMATION STUDIES (080700) > Records and Information Management (excl. Business Records and Information Management) (080708)
Deposited On: 12 Feb 2010 12:36
Last Modified: 29 Feb 2012 13:47

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page