Enhanced Topic Representation by Ambiguity Handling

, , , & (2022) Enhanced Topic Representation by Ambiguity Handling. In Chbeir, Richard, Huang, Helen, Silvestri, Fabrizio, Manolopoulos, Yannis, & Zhang, Yanchun (Eds.) Web Information Systems Engineering - WISE 2022: 23rd International Conference, Biarritz, France, November 1-3, 2022, Proceedings. Springer, Cham, Switzerland, pp. 357-369.

Free-to-read version at publisher website

Description

Most of the existing semantic-based topic models and topic
generation approaches use external knowledgebases or ontology to interpret
the meanings of the words. However, general ontologies do not cover
many ambiguous or specific domain-related words in a text collection.
Hence those ambiguous or domain-specific words are neglected in capturing
the meanings in topic generation. In this paper, we introduce an
approach to disambiguate the unmatched words in a text collection based
on related and similar meaning words. Word embeddings are applied
to discover similar or related words. We evaluated the topic generation
approach with our ambiguity handling technique with a set of state-of-theart
systems which uses an external ontology. Our approach outperformed,
and the generated topics were more meaningful. Our ambiguity handling
approach interpreted all the important words and included them in the
topic generation process.

Impact and interest:

0 citations in Scopus
Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 235947
Item Type: Chapter in Book, Report or Conference volume (Conference contribution)
Series Name: Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
ORCID iD:
Xu, Yueorcid.org/0000-0002-1137-0272
Li, Yuefengorcid.org/0000-0002-3594-8980
Measurements or Duration: 13 pages
Keywords: Ambiguity, Semantics, Concepts, Topic representation
DOI: 10.1007/978-3-031-20891-1_25
ISBN: 978-3-031-20890-4
Pure ID: 117154467
Divisions: Current > QUT Faculties and Divisions > Faculty of Science
Current > Schools > School of Computer Science
Copyright Owner: 2022 The Author(s)
Copyright Statement: This work is covered by copyright. Unless the document is being made available under a Creative Commons Licence, you must assume that re-use is limited to personal use and that permission from the copyright owner must be obtained for all other uses. If the document is available under a Creative Commons License (or other specified license) then refer to the Licence for details of permitted re-use. It is a condition of access that users recognise and abide by the legal requirements associated with these rights. If you believe that this work infringes copyright please provide details by email to qut.copyright@qut.edu.au
Deposited On: 14 Feb 2023 00:57
Last Modified: 23 May 2024 00:46