Intelligent Integration and Use of Text, Image, Video, and Audio Corpora
Papers from the 1997 AAAI Spring Symposium
Alex Hauptmann and Michael Witbrock, Program Cochairs
Technical Report SS-97-03. Published by The AAAI Press, Menlo Park, California. This technical report is also available in book and CD format.
Please Note: Abstracts are linked to individual titles, and will appear in a separate browser window. Full-text versions of the papers are linked to the abstract text. Access to full text may be restricted to AAAI members. PDF file sizes may be large!
Contents
A Similarity Measure for Automatic Audio Classification / 1
Jonathan Foote
Integration of a Large Text and Audio Corpus using Speaker Identification / 8
Deb Roy and Carl Malamud
Music Information Retrieval Using Audio Input / 12
Lloyd A. Smith, Rodger J. McNab and lan H. Witten
Show and Tell: Using Speech Input for Image Interpretation and Annotation / 17
Rohini K Srihari, Zhongfei Zhang and Ranjiv Chopra
Analysis of Gesture and Action in Technical Talks for Video Indexing / 25
Shanon X Ju, Michael J. Black Scott Minneman and Don Kimber
Exploration in a Large Corpus: Research on the Integration of Eye Gaze and Speech with Visual Information in a Virtual Reality System / 32
C. R. Voss, J. Gurney and J. Walrath
Acquiring and Integrating Knowledge from Images for a Large Scale Hypermedia System / 37
John H. Boose, Larry Baum and Randy J. Kelley
Finding Photograph Captions Multimodally on the World Wide Web / 45
Neil C. Rowe and Brian Frew
Integrating Image Content and its Associated Text in a Web Image Retrieval Agent / 52
Victoria Meza and Jesus Favela
Challenges in the Fusion of Video and Audio for Robust Speech Recognition / 57
Jer-Sen Chen and Oscar N. Garcia
Improving Acoustic Models by Watching Television / 61
Michael Witbrock and Alexander G. Hauptmann
Metadata for Integrating Chinese Text and Speech Documents in a Multi-media Retrieval System / 64
Yue-Shi Lee and Hsin-Hsi Chen
Studying Search and Archiving in a Real Audio Database / 70
Julia Hirschberg and Steve Whitaker
Integration of Pattern Information and Natural Language Information for Image Analysis and Retrieval / 77
Yasuhiko Watanabe and Makoto Nagao
A General Framework To Control Media that only Exist in Time / 85
Philippe Joly and Philippe Lepain
Reasoning about Form and Content of Multimedia Objects (Extended Abstract) / 89
Carlo Meghini, Fabrizio Sebastiani and Umberto Straccia
Integrating Text and Face Detection for Finding Informative Poster Frames / 95
Michael Smith, Shumeet Baluja and Henry A. Rowley
Segmentation, Content Extraction and Visualization of Broadcast News Video Using Multistream Analysis / 102
Mark Maybury, Andrew Merlino and James Ravson
Spotting by Association in News Video / 113
Yuichi Nakamura and Takeo Kanade
Informedia News-on-Demand: Using Speech Recognition to Create a Digital Video Library / 120
Alexander G. Hauptmann and Michael J. Witbrock
A Practical Video Database Based on Language and Image Analysis / 127
Yiqing Liang, Bede Liu, Wayne Wolf and Thomas Yeh
Efficient Archiving and Content-based Retrieval of Video Information on the Web / 133
Behzad Shahrarav and David C. Gibbon
AAAI Digital Library
AAAI relies on your generous support through membership and donations. If you find these resources useful, we would be grateful for your support.