Recognizing Unseen Attribute-Object Pair with Generative Model

Zhixiong Nan; Yang Liu; Nanning Zheng; Song-Chun Zhu

doi:10.1609/aaai.v33i01.33018811

Authors

Zhixiong Nan Xi'an Jiaotong University
Yang Liu University of California, Los Angeles
Nanning Zheng Xi'an Jiaotong University
Song-Chun Zhu University of California, Los Angeles

DOI:

https://doi.org/10.1609/aaai.v33i01.33018811

Abstract

In this paper, we are studying the problem of recognizing attribute-object pairs that do not appear in the training dataset, which is called unseen attribute-object pair recognition. Existing methods mainly learn a discriminative classifier or compose multiple classifiers to tackle this problem, which exhibit poor performance for unseen pairs. The key reasons for this failure are 1) they have not learned an intrinsic attributeobject representation, and 2) the attribute and object are processed either separately or equally so that the inner relation between the attribute and object has not been explored. To explore the inner relation of attribute and object as well as the intrinsic attribute-object representation, we propose a generative model with the encoder-decoder mechanism that bridges visual and linguistic information in a unified end-to-end network. The encoder-decoder mechanism presents the impressive potential to find an intrinsic attribute-object feature representation. In addition, combining visual and linguistic features in a unified model allows to mine the relation of attribute and object. We conducted extensive experiments to compare our method with several state-of-the-art methods on two challenging datasets. The results show that our method outperforms all other methods.

Recognizing Unseen Attribute-Object Pair with Generative Model

Authors

DOI:

Abstract

Downloads

Published

How to Cite

Issue

Section

Information

Developed By

Subscription