Finding Specification Pages According to Attributes
This paper presents a method for finding a specification page on the web for a given object (e.g., ``Titanic'') and its class label (e.g., ``film''). A specification page for an object is a web page which gives concise attribute-value information about the object (e.g., ``director''-``James Cameron'' for ``Titanic''). A simple unsupervised method using layout and symbolic decoration cues was applied to a large number of the web pages to acquire the class attributes. We used these acquired attributes to select a representative specification page for a given object from the web pages retrieved by a normal search engine. Experimental results revealed that our method greatly outperformed the normal search engine in terms of specification retrieval.
Sponsor of The CIO Dinner