AAAI Publications, First AAAI Conference on Human Computation and Crowdsourcing

Font Size: 
Conversations in the Crowd: Collecting Data for Task-Oriented Dialog Learning
Walter Stephen Lasecki, Ece Kamar, Dan Bohus

Last modified: 2013-11-03


A major challenge in developing dialog systems is obtaining realistic data to train the systems for specific domains. We study the opportunity for using crowdsourcing methods to collect dialog datasets. Specifically, we introduce ChatCollect, a system that allows researchers to collect conversations focused around definable tasks from pairs of workers in the crowd. We demonstrate that varied and in-depth dialogs can be collected using this system, then discuss ongoing work on creating a crowd-powered system for parsing semantic frames. We then discuss research opportunities in using this approach to train and improve automated dialog systems in the future.


conversational data; dialog data; data collection; crowdsourcing

Full Text: PDF