Behzad Shahrarav and David C. Gibbon
This paper summarizes an ongoing work in multimedia processing aimed at the automated archiving and selective retrieval of textual, pictorial and auditory information contained in video programs. Video processing performs the task representing the visual information using a small subset of the video frames. Linguistic processing refines the closed caption text, generates table of contents, and creates links to relevant multimedia documents. Audio and video information are compressed, and indexed based on their temporal association with the selected video frames and processed text. The derived information is used to automatically generate a hypermedia rendition of the program contents. This provides a compact representation of the information contained in the video program. It also serves as a textual and pictorial index for selective retrieval of the full-motion video program. A fully automatic system has been set up that generates HyperText Markup Language (HTML) renditions of television programs, and makes them available for access over the Internet within seconds of their broadcast.