Part Annotations via Pairwise Correspondence
Subhransu Maji, Gregory Shakhnarovich

Last modified: 2012-07-15


We explore the use of an interface to mark pairs of points on two images which are in correspondence with one another, as a way of collecting part annotations. The interface allows annotations of visual categories that are structurally diverse, such as chairs and buildings, where it is difficult to define a set of parts, or landmarks, that are consistent, namable or uniquely defined across all instances of the category. It allows flexibility in annotation — the landmarks can be instance specific, are not constrained by language, could be many to one, etc and requires little category specific instructions. We compare our approach to two popular methods of collecting part annotations, (1) drawing bounding boxes for a set of parts, and (2) annotating a set of landmarks, in terms of annotation setup overhead, cost, difficulty, applicability and utility, and identify scenarios where one method is better suited than the others. Preliminary experiments suggest that such annotations between a sparse set of pairs can be used to bootstrap many high level visual recognition tasks such as part discovery and semantic saliency.

