AAAI Publications, 2013 AAAI Spring Symposium Series

Font Size: 
The Ubuntu Chat Corpus for Multiparticipant Chat Analysis
David C. Uthus, David W. Aha

Last modified: 2013-03-15

Abstract


We present the Ubuntu Chat Corpus as a data source for multiparticipant chat analysis. This addresses the problem of the lack of a large, publicly suitable corpora for research in this medium. The advantages of using this corpus for research is its large number of chat messages, its multiple languages, its technical nature, and all of the original chat messages are in the public domain.

Keywords


Multiparticipant Chat; Corpus

Full Text: PDF