Text feature selection for relevance discovery: A fusion-based approach
|
Abdullah Samaran A. Al Harbi Thesis
(PDF 5MB)
Available under License Creative Commons Attribution Non-commercial No Derivatives 4.0. |
Description
This thesis presents innovative and effective feature selection models and frameworks to select and weight relevant features that describe user information needs. The proposed techniques fuse different text features to overcome problems in latent Dirichlet allocation and the relevant features discovered by existing relevance discovery algorithms. The proposed models and frameworks extend multiple random sets to model and understand the complex relationships between different entities that affect the weighting process of topical terms at both document- and collection-levels. The proposed techniques can reduce uncertainties in discovered relevant features and significantly improve the performance of text mining applications.
Impact and interest:
Citation counts are sourced monthly from Scopus and Web of Science® citation databases.
These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.
Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.
Full-text downloads:
Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.
ID Code: | 199275 |
---|---|
Item Type: | QUT Thesis (PhD) |
Supervisor: | Li, Yuefeng & Xu, Yue |
Additional Information: | Executive Dean’s Commendation for Outstanding Doctoral Thesis |
Keywords: | Text Feature Selection, Relevance Discovery, User Information Needs, Random Sets, Uncertainty Reduction, Term Weighting, Feature Re-Ranking, Weight Scaling, Text Feature Fusion, Passage Relevance |
DOI: | 10.5204/thesis.eprints.199275 |
Divisions: | Past > QUT Faculties & Divisions > Science & Engineering Faculty Current > Schools > School of Mathematical Sciences |
Institution: | Queensland University of Technology |
Deposited On: | 23 Apr 2020 06:34 |
Last Modified: | 01 Nov 2021 06:34 |
Export: EndNote | Dublin Core | BibTeX
Repository Staff Only: item control page