PivotBrowser: A Tag-Space Image Searching Prototype

Xiaoyan Li

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Lidan Shou

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Gang Chen

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Xiaolong Zhang

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Tianlei Hu

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Jinxiang Dong

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

Copyright is held by the World Wide Web Conference Committee (IW3C2). Distribution of these papers is limited to classroom use, and personal use by others.
WWW 2008, April 21-25, 2008, Beijing, China.
ACM 978-1-60558-085-2/08/04.

ABSTRACT

We propose a novel iterative searching and refining prototype for tagged images. This prototype, named PivotBrowser, captures semantically similar tag sets in a structure called pivot. By constructing a pivot for a textual query, PivotBrowser first selects candidate images possibly relevant to the query. The tags contained in these candidate images are then selected in terms of their tag relevances to the pivot. The shortlisted tags are clustered and one of the tag clusters is used to select the results from the candidate images. Ranking of the images in each partition is based on their relevance to the tag cluster. With the guidance of the tag clusters presented, a user is able to perform searching and iterative query refinement.

Categories & Subject Descriptors

H.4Information Systems ApplicationsMiscellaneous

General Terms

Algorithms, Design

Keywords

tag, inconsistency, ambiguity, relevance

1 Introduction

Tagging based search systems are known to be prone to semantic errors or limitations [2]. To name a few: Different users may use different tags (maybe synonyms) to describe the same object, causing inconsistency in tagging; The existence of polysemy (single term having multiple meanings) in a query causes ambiguity, and the query is often hard to refine; The distribution of the tags being used is usually skewed and has the long-tail characteristic. Therefore, on one hand, images with rare tags cannot be easily found. On the other hand, queries with rare tags may need to be expanded to larger scopes. We propose PivotBrowser, an iterative searching and refining prototype for tagged images. PivotBrowser employs a novel tag-based structure called pivot to address the above problems. Our approach is different from a previous work on social tag clustering [1] as we can handle both synonymy and ambiguity.

2 The pivot browsing scheme

To introduce the concept of pivot, we first give the definition of tag atom based on the availability of a tag thesaurus. The tag thesaurus contains lexical relevance information for all tags, such as the synonyms (``flower, bloom, blossom''), the spelling variations (plural, abbreviation, etc.), and the other highly relevant terms (``film'' vs. ``movie''). A good example of tag thesaurus is the one used in the WordNet[3]. A tag atom $\widehat{A}$ is a set of tags that satisfy the following requirements: (1) If a tag atom $\widehat{A}$ contains a tag

, it must also contain all lexically relevant tags of

as defined in the thesaurus; (2) For any two tags in $\widehat{A}$ ,

and

, they must be lexically relevant to each other. It is important to note that one tag may possibly appear in multiple tag atoms as it can have more than one lexical meaning in the thesaurus. Therefore, given a universe of tags $\{t_i\}$ , we can precompute an inverted list for all possible tag atoms based on the tag thesaurus. Each entry in the inverted list is like following

A pivot atom of tag

, denoted as

, is defined as the union of all tag atoms which contain

(or those in the same entry of

in TAIL). A pivot of

tags, $P(t_1, t_2, \ldots,t_n)$ , is defined as the set containing all pivot atoms of its tags,

1 The precomputation

2 Pivot browsing

The interactive pivot browsing is an iterative process consisting of the following three phases:

(1) First, when a user issues a query

containing tags $\{q_1,\ldots,q_n\}$ , the system looks up the TAIL to find the tag atoms for each query tag

. By merging the tag atoms for each query tag, we obtain a pivot $P(Q)=\{PA(q_1),\ldots,PA(q_n)\}$ . For each tag set

supported by

, we look up the inverted index of the image database to find images where all tags in

co-occur. These images are saved as a candidate image set $I_{can}$ , and all tags associated with them are saved (except for the query tags in

) as a candidate tag set $T_{can}$ for further consideration in the subsequent phases.

(2) Second, all candidate tags in $T_{can}$ will undergo a selection pass, and the top-

candidates relevant to

will be obtained. Meanwhile, the relevance value of each tag is saved as its weight. The tag selection method will be discussed in section 2.3.

(3) Third, the

tags in the output of the previous phase will be clustered on the fly using a graph-partitioning algorithm as proposed in [4]. The affinity metric for clustering is based on the precomputed tag-to-tag affinity values. These

tags, grouped in their clusters, will then be presented to the user for a new round of tag selection/refinement. Meanwhile, one of the tag clusters (by default the most compact one) will be used to select the images in $I_{can}$ - only candidates with tags which appear in the cluster are selected. Ranking of the output images is based on the relevance between the tag vector of each image and the weighted vector of the cluster. The latter can be obtained from the results of phase 2. A user can certainly choose another tag cluster for image selection and browsing. If a user subsequently adds a new tag to or removes an old one from the query, the pivot browsing process enters the next iteration (goto phase 1).

3 Selecting pivot-relevant tags

3 Results and conclusion

**Figure 1:** A Search Result Page for query ``window''
$\begin{figure} \centering \psfig{width=0.9\columnwidth,figure=interface1.eps} \vspace*{-1em} \end{figure}$

We perform 200 unique queries on the prototype. Each query is executed for 100 times. Table 1 presents the average CPU time for selecting the candidate image set on the inverted index of the image database (SelectI), generating the top-

relevant tags (SelectT), clustering the

tags (ClusterT), and ranking the results (Rank). The time for creating a pivot is negligible. The results in the table reveal that the query cost is dominated by the selection on the inverted index of the image database.

In conclusion, the pivot browsing scheme realizes effective query expansion and image searching in the tag-space at a low expense of computation and storage. Therefore, it can help users to find the intended results more effectively compared to conventional methods. We believe that pivot browsing can potentially become a general tag-space search paradigm not only limited to images.

For future work, we would conduct a usability study on PivotBrowser. We would also consider a comprehensive study on incorporating visual feature comparison, and other tag selection and clustering strategies into it.

	SelectI	SelectT	ClusterT	Rank
CPU Time (ms)	535.3	15.5	97.5	78.1

REFERENCES

[1] G. Begelman, P. Keller, and F. Smadja. Automated Tag Clustering: Improving search and exploration in the tag space. In WWW Collaborative Web Tagging Workshop, 2006.

[2] S. A. Golder and B. A. Huberman. Usage patterns of collaborative tagging systems. J. Inf. Sci., pages 198-208, 2006.

[3] G. A. Miller, R. Beckwith, C. Fellbaum, D. Gross, and K. Miller. Introduction to WordNet: an on-line lexical database. International Journal of Lexicography, 3(4):235-244, 1990.

[4] S. White and P. Smyth. A spectral clustering approach to finding communities in graphs. In SDM, 2005.

PivotBrowser: A Tag-Space Image Searching Prototype

Xiaoyan Li

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

kricel_lee@yahoo.com.cn

Lidan Shou

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

should@zju.edu.cn

Gang Chen

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

cg@zju.edu.cn

Xiaolong Zhang

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

xiaolongzhang@zju.edu.cn

Tianlei Hu

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

htl@zju.edu.cn

Jinxiang Dong

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

djx@zju.edu.cn

ABSTRACT

Categories & Subject Descriptors

General Terms

Keywords

1 Introduction

2 The pivot browsing scheme

1 The precomputation

2 Pivot browsing

3 Selecting pivot-relevant tags

3 Results and conclusion

REFERENCES

PivotBrowser: A Tag-Space Image Searching Prototype

Xiaoyan Li

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

Lidan Shou

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

Gang Chen

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

Xiaolong Zhang

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

Tianlei Hu

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

Jinxiang Dong

College of Computer Science, Zhejiang UniversityHangzhou, P.R.China 310027

ABSTRACT

Categories & Subject Descriptors

General Terms

Keywords

3 Selecting pivot-relevant tags

REFERENCES

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027

College of Computer Science, Zhejiang University
Hangzhou, P.R.China 310027