Unsupervised Discovery of Event Scenarios from Texts

Cosmin A Bejan

We propose a new unsupervised learning approach for discovering event scenarios from texts. We interpret an event scenario as a collection of related events that characterize a specific situation. The approach uses the Latent Dirichlet Allocation (LDA) probabilistic model to automatically learn the probability distribution of events corresponding to event scenarios. We performed experiments on an event annotated corpus and compared the automatically extracted event scenarios with frame scenarios defined in FrameNet. The results show a better coverage for those event scenarios that are described in more detail in the event annotated corpus. When compared with a smoothed unigram model, the event scenario model achieves a perplexity reduction of 93.46%.

Subjects: 13. Natural Language Processing; 13.1 Discourse

Submitted: Feb 25, 2008

This page is copyrighted by AAAI. All rights reserved. Your use of this site constitutes acceptance of all of AAAI's terms and conditions and privacy policy.