Algorithms for Top-k Personalized PageRank Queries

Manish Gupta

IIT Bombay
Mumbai, India

Amit Pathak

IIT Bombay
Mumbai, India

Soumen Chakrabarti

IIT Bombay
Mumbai, India

Copyright is held by the World Wide Web Conference Committee (IW3C2). Distribution of these papers is limited to classroom use, and personal use by others.
WWW 2008, April 21-25, 2008, Beijing, China.
ACM 978-1-60558-085-2/08/04.

ABSTRACT

In entity-relation (ER) graphs (V,E), nodes V represent typed entities and edges E represent typed relations. For dynamic personalized PageRank queries, nodes are ranked by their steady-state probabilities obtained using the stan- dard random surfer model. In this work, we propose a frame- work to answer top-k graph conductance queries. Our top-k ranking technique leads to a 4X speedup, and overall, our system executes queries 200{1600X faster than whole-graph PageRank. Some queries might contain hard predicates i.e. predicates that must be satisfied by the answer nodes. E.g., we may seek authoritative papers on public key cryptog- raphy, but only those written during 1997. We extend our system to handle hard predicates. Our system achieves these substantial query speedups while consuming only 10-20% of the space taken by a regular text index.

Categories & Subject Descriptors

H.3.1textbf[Information Systems]:Information Storage and Retrieval: Content Analysis and Indexing; H.3.3textbf[Information Systems]:Information Search and Retrieval

General Terms

Algorithms, Experimentation, Measurement, Performance

Keywords

top-k, Pagerank, HubRank, Node-deletion, personalized

REFERENCES

[1] P. Berkhin. Bookmark-coloring approach to personalized pagerank computing. Internet Mathematics, 3(1):41-62, Jan. 2007.

[2] A. Z. Broder, R. Kumar, F. Maghoul, P. Raghavan, S. Rajagopalan, R. Stata, A. Tomkins, and J. L. Wiener. Graph structure in the web. Computer Networks, 33(1-6):309-320, 2000.

[3] S. Chakrabarti. Dynamic personalized PageRank in entity-relation graphs. In www, Banff, May 2007.

[4] G. Jeh and J. Widom. Scaling personalized web search. In WWW Conference, pages 271-279, 2003.

[5] S. D. Kamvar, T. H. Haveliwala, C. D. Manning, and G. H. Golub. Exploiting the block structure of the web for computing, Mar. 12 2003

Algorithms for Top-k Personalized PageRank Queries

Manish Gupta

IIT Bombay
Mumbai, India

manishg@cse.iitb.ac.in

Amit Pathak

IIT Bombay
Mumbai, India

amit@cse.iitb.ac.in

Soumen Chakrabarti

IIT Bombay
Mumbai, India

soumen@cse.iitb.ac.in

ABSTRACT

Categories & Subject Descriptors

General Terms

Keywords

Introduction and related work

Basic TopK Framework

Hard Predicates

Experiments

REFERENCES

Algorithms for Top-k Personalized PageRank Queries

Manish Gupta

IIT BombayMumbai, India

manishg@cse.iitb.ac.in

Amit Pathak

IIT BombayMumbai, India

amit@cse.iitb.ac.in

Soumen Chakrabarti

IIT BombayMumbai, India

soumen@cse.iitb.ac.in

ABSTRACT

Categories & Subject Descriptors

General Terms

Keywords

Introduction and related work

Basic TopK Framework

Hard Predicates

Experiments

REFERENCES

IIT Bombay
Mumbai, India

IIT Bombay
Mumbai, India

IIT Bombay
Mumbai, India