Combining word semantics within complex Hilbert space for information retrieval

Wittek, Peter, Koopman, Bevan, Zuccon, Guido, & Darányi, Sándor (2013) Combining word semantics within complex Hilbert space for information retrieval. In Lecture Notes in Computer Science : Quantum Interaction - 7th International Conference, QI 2013, Leicester, UK, Springer, University of Leicester, England.

View at publisher (open access)

Abstract

Complex numbers are a fundamental aspect of the mathematical formalism of quantum physics. Quantum-like models developed outside physics often overlooked the role of complex numbers. Specifically, previous models in Information Retrieval (IR) ignored complex numbers. We argue that to advance the use of quantum models of IR, one has to lift the constraint of real-valued representations of the information space, and package more information within the representation by means of complex numbers. As a first attempt, we propose a complex-valued representation for IR, which explicitly uses complex valued Hilbert spaces, and thus where terms, documents and queries are represented as complex-valued vectors. The proposal consists of integrating distributional semantics evidence within the real component of a term vector; whereas, ontological information is encoded in the imaginary component. Our proposal has the merit of lifting the role of complex numbers from a computational byproduct of the model to the very mathematical texture that unifies different levels of semantic information. An empirical instantiation of our proposal is tested in the TREC Medical Record task of retrieving cohorts for clinical studies.

Impact and interest:

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

Full-text downloads:

47 since deposited on 07 May 2014
8 in the past twelve months

Full-text downloads displays the total number of times this work’s files (e.g., a PDF) have been downloaded from QUT ePrints as well as the number of downloads in the previous 365 days. The count includes downloads for all files if a work has more than one.

ID Code: 69312
Item Type: Conference Paper
Refereed: No
Additional URLs:
DOI: 10.1007/978-3-642-54943-4_14
Divisions: Current > Institutes > Institute for Future Environments
Current > Schools > School of Information Systems
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Copyright Owner: Copyright 2014 Springer-Verlag Berlin Heidelberg
Deposited On: 07 May 2014 00:03
Last Modified: 28 Nov 2016 10:03

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page