AAAI Publications, Sixth International AAAI Conference on Weblogs and Social Media

Font Size: 
Unsupervised Real-Time Company Name Disambiguation in Twitter
Agustín D. Delgado Muñoz, Raquel Martínez Unanue, Alberto Pérez García-Plaza, Víctor Fresno

Last modified: 2012-05-20

Abstract


This paper presents a new approach to disambiguate company names in the Twitter social network. We have focused on making lighter the processing of comparing company profiles with tweets in order to obtain a competitive real-time system. With this aim, we only use the home page of each company as information source to create a unique profile. On the other hand, we compute the similarity of a tweet in connection to a profile by comparing the content of the tweet with the profile. Both steps do not use any other external information source and all the process is developed in an unsupervised way. We have tested our application with the test WePS-3 CLEF ORM corpus obtaining encouraging results.

Full Text: PDF