Marius Pasca, Benjamin Van Durme
Within the larger area of automatic acquisition of knowledge from the Web, we introduce a method for extracting relevant attributes, or quantifiable properties, for various classes of objects. The method extracts attributes such as "capital city" and "President" for the class Country, or "cost," "manufacturer" and "side effects" for the class Drug, without relying on any expensive language resources or complex processing tools. In a departure from previous approaches to large-scale information extraction, we explore the role of Web query logs, rather than Web documents, as an alternative source of class attributes. The quality of the extracted attributes recommends query logs as a valuable, albeit little explored, resource for information extraction.
Subjects: 13. Natural Language Processing; 10. Knowledge Acquisition
Submitted: Sep 29, 2006