View Full Version : MSN Search WhitePaper
gps31
08-24-2004, 09:25 AM
http://research.microsoft.com/research/pubs/view.aspx?tr_id=754
Pyrrhonist
08-24-2004, 10:51 AM
For those of us who don't feel like clicking on the link, here's the quote from the article that gps31 is linking to:
MSR-TR-2004-50
Block-level Link Analysis
Deng Cai; Xiaofei He; Ji-Rong Wen; Wei-Ying Ma
June 2004
8 p.
Available Documents:
PDF 574 Kb
Link Analysis has shown great potential in improving the per-formance of web search. PageRank and HITS are two of the most popular algorithms. Most of the existing link analysis algorithms treat a web page as a single node in the web graph. However, in most cases, a web page contains multiple semantics and hence the web page might not be considered as the atomic node. In this paper, the web page is partitioned into blocks using the vision-based page segmentation algorithm. By extracting the page-to-block, block-to-page relationships from link structure and page layout analysis, we can construct a semantic graph over the WWW such that each node exactly represents a single semantic topic. This graph can better describe the semantic structure of the web. Based on block-level link analysis, we proposed two new algorithms, Block Level PageRank and Block Level HITS, whose performances we study extensively using web data.
Keywords: Web information retrieval, VIsion-based Page Segmentation, Graph Model, Link Analysis
gps31
08-24-2004, 10:58 AM
Here's the link to the PDF:
ftp://ftp.research.microsoft.com/pub/tr/TR-2004-50.pdf
Oh cool, thank you, that's really interesting! :cool:
vBulletin v3.0.3, Copyright ©2000-2013, Jelsoft Enterprises Ltd.