Professional Documents
Culture Documents
Milyir
Milyir
Introduction
Hyperlink analysis HITS Algorithm Example Page Ranking Example
Classification of rankers
Content based
Connectivity based
Web Page Authors use hyperlinks can give them valuable information content.
navigational aids
point to high quality pages that might be on the same topic as the page containing the hyperlink.
Hyperlink analysis algorithms make either one or both of the following simplifying assumptions:
Assumption 1. A hyperlink from page A to page B is a recommendation of page B by the author of page A. Assumption 2. If page A and page B are connected by a hyperlink, then they might be on the same topic.
The power of hyperlink analysis comes from the fact that it uses the content of other pages to rank the current page.
HITS ALGORITHM
AUTHORITY
HUBS
Links to authorities
Indegree number of incoming links to a given node, used to measure the authoritativeness. Outdegree number of outgoing links from a given node, here it is used to measure the hubness.
HITS algorithm tries to determine good hubs and authorities. Given a user query, the algorithm iteratively computes hub and authority scores for each node in the neighborhood graph, and then ranks the nodes by those scores. A document that points to many others is a good hub, and a document that many documents point to is a good authority.
Let A be the adjacency matrix of the neighborhood graph. Denote the authority weight vector by v and the hub weight vector by u
Ranking is a process of ordering the returned documents in decreasing order of relevance, that is so that the best answers are on the top.
Basic Concepts
->web pages with high value will be ranked higher
->PageRank can only be increased or improved by getting quality links from other web pages.
-> There should be a page or pages that must be more importance than the others with the same topic in the world wide web.
VOTE When Page 1 links out to Page 2, then Page 1 cast a vote to Page 2.
BACKLINKS When Page 1 links out to Page 2 internally, then Page 2 has a Backlink from Page 1. INTERNAL LINKS Links from web pages within your website. OUTGOING LINKS Links to other web pages within a web site
INBOUND LINK
OUTBOUND LINK
DANGLING LINK
The average PageRank number of pages is always one. Inbound Links will increase PageRank value of a page. Outbound Links will loss a portion of PageRank to the linked page.
Page A has two backlines - a Backlink from Page 1 with PageRank value of 4 and a Backlink from Page 2 with PageRank value of 2. Page 1 has two outbound Links and Page 2 has only one Outbound Link.
THANK YOU