Skip to main navigation Skip to search Skip to main content

Using symbolic objects to cluster web documents

Research output: Chapter in Book/Report/Conference proceedingConference contributionpeer-review

6 Scopus citations

Abstract

Web Clustering is useful for several activities in the WWW, from automatically building web directories to improve retrieval performance. Nevertheless, due to the huge size of the web, a linear mechanism must be employed to cluster web documents. The k-means is one classic algorithm used in this problem. We present a variant of the vector model to be used with the k-means algorithm. Our representation uses symbolic objects for clustering web documents. Some experiments were done with positive results and future work is optimistic.

Original languageEnglish
Title of host publicationProceedings of the 15th International Conference on World Wide Web
Pages967-968
Number of pages2
DOIs
StatePublished - 2006
Event15th International Conference on World Wide Web - Edinburgh, Scotland, United Kingdom
Duration: 23 May 200626 May 2006

Publication series

NameProceedings of the 15th International Conference on World Wide Web

Conference

Conference15th International Conference on World Wide Web
Country/TerritoryUnited Kingdom
CityEdinburgh, Scotland
Period23/05/0626/05/06

Keywords

  • Symbolic data analysis
  • Web clustering

Fingerprint

Dive into the research topics of 'Using symbolic objects to cluster web documents'. Together they form a unique fingerprint.

Cite this