Current Issues in Markup-Based Knowledge Extraction

Authors

Udo Kruschwitz

Proceedings:

Acquiring (and Using) Linguistic (and World) Knowledge for Information Access

Volume

Issue:

Papers from the 2002 AAAI Spring Symposium

Track:

Contents

Downloads:

Download PDF

Abstract:

Extracting content from Web pages can be useful for a number of reasons. Our motivation is to help a user in the search for documents in subdomains of the Web such as company sites and intranets. Unlike online product catalogues, the data sources we are interested in are of heterogenous nature. A model that reflects the underlying semantic structure of the document collection can be very helpful. However, it is difficult to get hold of a domain model that can easily be plugged into such a system. We have been working on this problem for some time now and this paper will report our ongoing work in the field of markup-based knowledge extraction. Markup is used to identify conceptual information. This enables us to build a simple domain model automatically. Such a model can be used to enhance standard search facilities by engaging a user in a system initiated dialogue. Another aspect of ongoing research is the improvement of the domain model using ideas adopted from collaborative filtering.

Spring

Papers from the 2002 AAAI Spring Symposium

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.