AAAI Publications, Thirty-Second AAAI Conference on Artificial Intelligence

Font Size: 
Consonant-Vowel Sequences as Subword Units for Code-Mixed Languages
Upendra Kumar, Vishal Singh, Chris Andrew, Santhoshini Reddy, Amitava Das

Last modified: 2018-04-29

Abstract


In this research work, we develop a state-of-art model for identifying sentiment in Hindi-English code-mixed language. We introduce new phonemic sub-word units for Hindi-English code-mixed text along with a hierarchical deep learning model which uses these sub-word units for predicting sentiment. The results indicate that the model yields a significant increase in accuracy as compared to other models.

Keywords


Code Mixing; Deep Learning

Full Text: PDF