Discovering interesting information with advances in web technology

Nayak, Richi, Senellart, Pierre, Suchanek, Fabian M, & Varde, Aparna S (2012) Discovering interesting information with advances in web technology. SIGKDD Explorations, 14(2), pp. 63-81.

View at publisher

Abstract

The Web is a steadily evolving resource comprising much more than mere HTML pages. With its ever-growing data sources in a variety of formats, it provides great potential for knowledge discovery. In this article, we shed light on some interesting phenomena of the Web: the deep Web, which surfaces database records as Web pages; the Semantic Web, which de�nes meaningful data exchange formats; XML, which has established itself as a lingua franca for Web data exchange; and domain-speci�c markup languages, which are designed based on XML syntax with the goal of preserving semantics in targeted domains. We detail these four developments in Web technology, and explain how they can be used for data mining. Our goal is to show that all these areas can be as useful for knowledge discovery as the HTML-based part of the Web.

Impact and interest:

Search Google Scholar™

Citation counts are sourced monthly from Scopus and Web of Science® citation databases.

These databases contain citations from different subsets of available publications and different time periods and thus the citation count from each is usually different. Some works are not in either database and no count is displayed. Scopus includes citations from articles published in 1996 onwards, and Web of Science® generally from 1980 onwards.

Citations counts from the Google Scholar™ indexing service can be viewed at the linked Google Scholar™ search.

ID Code: 62117
Item Type: Journal Article
Refereed: No
Additional Information: Can not find confirmation that articles are peer reviewed: http://www.sigkdd.org/explorations/issue.php?issue=current
Keywords: Web technology, Deep web, Semantic web, Data exchange formats, XML
DOI: 10.1145/2481244.2481255
ISSN: 1931-0145
Divisions: Current > Schools > School of Electrical Engineering & Computer Science
Current > QUT Faculties and Divisions > Science & Engineering Faculty
Deposited On: 27 Aug 2013 06:37
Last Modified: 28 Aug 2013 04:18

Export: EndNote | Dublin Core | BibTeX

Repository Staff Only: item control page