Semantic Indexing of Document Bases

Authors

Roberto Basili and Maria Teresa Pazienza

Track:

Contents

Downloads:

Abstract:

Browsing and navigating into a document base can be significantly improved by an easy access to textual sources. Many efficient indexing and search techniques have been proposed in the literature. Word vectors are commonly used to approximate the notion of document content and to support matching algorithms during the retrieval process. Efficiency criteria push for linear non-recursive representations. The text of a document is never processed for its linguistic information content. The gap between the implicit content of (a set of) texts and the rich structured formats (i.e. networks) able support intelligent browsing is well known. In this paper the overall architecture of a language oriented methodology of document processing for a content driven retrieval is described. Lexical acquisition modules are integrated with indexing and browsing ones, in order to support a significant semantic coverage and to guarantee portability throughout different domains. The experience in the development of different IR systems (based on linguistic processing of document content) used to demonstrate feasibility and strengthness of the methodology.

Cookie	Duration	Description
cookielawinfo-checkbox-analytics	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Analytics".
cookielawinfo-checkbox-functional	11 months	The cookie is set by GDPR cookie consent to record the user consent for the cookies in the category "Functional".
cookielawinfo-checkbox-necessary	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookies is used to store the user consent for the cookies in the category "Necessary".
cookielawinfo-checkbox-others	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Other.
cookielawinfo-checkbox-performance	11 months	This cookie is set by GDPR Cookie Consent plugin. The cookie is used to store the user consent for the cookies in the category "Performance".
viewed_cookie_policy	11 months	The cookie is set by the GDPR Cookie Consent plugin and is used to store whether or not user has consented to the use of cookies. It does not store any personal data.