Intelligent Integration and Use of Text, Image, Video, and Audio Corpora
Papers from the 1997 AAAI Spring Symposium
Alex Hauptmann and Michael Witbrock, Program Cochairs
Technical Report SS-97-03. Published by The AAAI Press, Menlo Park, California. This technical report is also available in book and CD format.
Please Note: Abstracts are linked to individual titles, and will appear in a separate browser window. Full-text versions of the papers are linked to the abstract text. Access to full text may be restricted to AAAI members. PDF file sizes may be large!
Contents
A Similarity Measure for Automatic Audio Classification / 1
Jonathan Foote
Integration of a Large Text and Audio Corpus using Speaker Identification / 8
Deb Roy and Carl Malamud
Music Information Retrieval Using Audio Input / 12
Lloyd A. Smith, Rodger J. McNab and lan H. Witten
Show and Tell: Using Speech Input for Image Interpretation and Annotation / 17
Rohini K Srihari, Zhongfei Zhang and Ranjiv Chopra
Analysis of Gesture and Action in Technical Talks for Video Indexing / 25
Shanon X Ju, Michael J. Black Scott Minneman and Don Kimber
Exploration in a Large Corpus: Research on the Integration of Eye Gaze and Speech with Visual Information in a Virtual Reality System / 32
C. R. Voss, J. Gurney and J. Walrath
Acquiring and Integrating Knowledge from Images for a Large Scale Hypermedia System / 37
John H. Boose, Larry Baum and Randy J. Kelley
Finding Photograph Captions Multimodally on the World Wide Web / 45
Neil C. Rowe and Brian Frew
Integrating Image Content and its Associated Text in a Web Image Retrieval Agent / 52
Victoria Meza and Jesus Favela
Challenges in the Fusion of Video and Audio for Robust Speech Recognition / 57
Jer-Sen Chen and Oscar N. Garcia
Improving Acoustic Models by Watching Television / 61
Michael Witbrock and Alexander G. Hauptmann
Metadata for Integrating Chinese Text and Speech Documents in a Multi-media Retrieval System / 64
Yue-Shi Lee and Hsin-Hsi Chen
Studying Search and Archiving in a Real Audio Database / 70
Julia Hirschberg and Steve Whitaker
Integration of Pattern Information and Natural Language Information for Image Analysis and Retrieval / 77
Yasuhiko Watanabe and Makoto Nagao
A General Framework To Control Media that only Exist in Time / 85
Philippe Joly and Philippe Lepain
Reasoning about Form and Content of Multimedia Objects (Extended Abstract) / 89
Carlo Meghini, Fabrizio Sebastiani and Umberto Straccia
Integrating Text and Face Detection for Finding Informative Poster Frames / 95
Michael Smith, Shumeet Baluja and Henry A. Rowley
Segmentation, Content Extraction and Visualization of Broadcast News Video Using Multistream Analysis / 102
Mark Maybury, Andrew Merlino and James Ravson
Spotting by Association in News Video / 113
Yuichi Nakamura and Takeo Kanade
Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library / 120
Alexander G. Hauptmann and Michael J. Witbrock
A Practical Video Database Based on Language and Image Analysis / 127
Yiqing Liang, Bede Liu, Wayne Wolf and Thomas Yeh
Efficient Archiving and Content-based Retrieval of Video Information on the Web / 133
Behzad Shahrarav and David C. Gibbon