• Gold M - Skip to Main Content.
  • University of Minnesota
  • Search U of M
  • CSE Home
  • IT Home
  • Directories
  • One Stop
  • myU
Computer Science & Engineering
Prospective Students
Current Students
Alumni
Industry

Computer Science & Engineering

  • Department Info
    • About Us
    • Contact Info
    • Department News
    • Giving
  •  
  • Admissions
    • Undergraduate
    • Graduate
  •  
  • Academics
    • Undergraduate
    • Graduate
  •  
  • People
    • Faculty
    • Graduate Students
  •  
  • Research
    • Research Areas
    • Tech Reports
    • Related Centers
  •  
  • Resources
    • Forms
    • Systems Help
    • Faculty Portal locked external link
    • Computing Facilities
    • Department Wiki locked external link
    • Employment
  •  
  • Site Map
  •  
  •  
Institute of Technology Logo
Home > Research > Tech Reports
Browse reports by year:
[ ALL 1990 1991 1992 1993 1994 1995 1996 1997 1998 1999 2000 2001 2002 2003 2004 2005 2006 2007 2008 2009 ]
Browse report authors:
[ ALL A B C D E F G H I J K L M N O P Q R S T U V W X Y Z ]
Browse reports by title:
[ ALL A B C D E F G H I J K L M N O P Q R S T U V W X Y Z ]

University of Minnesota - Computer Science and Engineering Technical Report Abstract

Usage Meets Link Analysis: Towards Improving Site Specific and Intranet Search via Usage Statistics

Report Number: 04-019
Date of Submission: 5/24/2004

Authors:
   
   

View Report:
   PDF format

Abstract:

In this paper, we explore the possibility of incorporating usage statistics to improve ranking quality in site specific and intranet search engines. We introduce a number of usage based ranking approaches including a PageRank extension, Usage aware PageRank (UPR), an extension to HITS (UHITS), and a naive approach that uses number of visits to pages as a quality measure. We compare these methods against each other and against two major link analysis approaches (PageRank and HITS). We investigate weighting schemes that take into account the probability of visiting a page directly (by typing or via bookmarks), as well as the relative probability of following a particular link from a given page. Both of these probabilities can be approximated from usage logs. We developed a site specific search engine (http://usearch.cs.umn.edu/), and incorporated the above methods. The parameter space for UPR and UHITS are sampled to examine the effects of varying usage emphasis factors. Experimental results are carried out on a medium size domain, cs.umn.edu, with 20K static web pages. We provide both global and query dependent comparisons. Experiments suggest that UPR is promising and has a number of desirable properties. It generalizes PageRank and inherits basic PageRank properties. It is also stable and flexible. The emphasis given to usage information is controlled via two parameters. If the parameters are set to zero, the algorithm reduces to the original PageRank algorithm; if they are set to one, the emphasis shifts to the usage graph; for values in between, both of the graphs are used with the specified weights. UPR is relatively inexpensive. The usage graph can be updated incrementally and efficiently as new usage information becomes available. A UPR iteration has a space/time complexity similar to a PageRank iteration.

Related Links

  • U of M Research centers and institutes
  • Undergraduate Research Opportunities Program
  • Experts@Minnesota
  • Office of Graduate School Outreach
  • IT Faculty & research
  • Colloquia
  • Talks

 

  • ©2006 - 2009 Regents of the University of Minnesota. All rights reserved.
  • Privacy
  • Contact U of M
  • Contact CSE
  • CSE Employment
  • Site Map
  • The University of Minnesota is an equal opportunity educator and employer.
  • Last modified on July 23, 2008