The Web Science Trust

Measuring Hyperlink Distances: Wikipedia Case Study

Paim, Rodrigo R. and Figueiredo, Daniel R. (2011) Measuring Hyperlink Distances: Wikipedia Case Study. pp. 1-4. In: Proceedings of the ACM WebSci'11, June 14-17 2011, Koblenz, Germany.

[img]
Preview
PDF - Requires a PDF viewer such as GSview, Xpdf or Adobe Acrobat Reader
156Kb
[img]
Preview
Image (PNG)
30Kb

Official URL: http://www.websci11.org/fileadmin/websci/Posters/120_paper.pdf

Abstract

Hyperlinks are a fundamental aspect of the Web, as they play a major role in accomplishing important functions such as document clustering and document ranking. Despite var- ious facets of hyperlink analysis, in this work we consider a novel aspect of hyperlinks, namely their distance. How far in terms of contextual similarity will a hyperlink take you? We consider classical distance functions that capture the similarity between documents as well as propose a new distance function, an IDF-based generalization of Jaccard distance. We characterize the distance distribution of hy- perlinks considering Wikipedia as a case study. Our results indicate that hyperlink distances are strongly skewed, with the majority of hyperlinks exhibiting very long distances.

Item Type:Conference or Workshop Item (Poster)
Web Science Comments:WebSci Conference 2011
Subjects:WS1 Computer Science
Web Science Events > Web Science 2011
ID Code:524
Deposited By: Lisa Sugiura
Deposited On:07 Jun 2011 16:37
Last Modified:25 Oct 2011 17:11

Repository Staff Only: item control page

EPrints Logo
Web Science Repository is powered by EPrints 3 which is developed by the School of Electronics and Computer Science at the University of Southampton. More information and software credits.