TOOLBOX

BROWSE TOPICS

RESOURCES

ABOUT THIS SITE

pmwiki.org
pmwiki-2.2.0-beta65

edit SideBar

Hannah Frost
Media Preservation Librarian
Stanford Digital Library Systems & Services

Conversations and meeting with Reid Smith and Bruce Buchanan (15-Nov-2006, 06-Mar-2007, 23-Mar-2007)

Question: What would you do if you could?

  • We are building an A/V Digitization facility. HF is working with Chris Lacinak (former CTO at Vidipax, now a consultant) to select equipment and approach.
  • MPEG-2 has been used so far, but future recompression will be a problem (see McDonough paper for details).
  • STD capture, 4:2:2 sampling, in AVI or MOV, is an excellent, inexpensive approach for today. STD makes sense for now, given it is the format for most old video.
  • We pay close attention to the decisions taken by the Library of Congress. They are considering Motion JPEG 2000. This is lossless, but one gets about 3:1 improvement in storage vs. AVI or MOV. However, there is not yet a lot of experience, nor equipment. NLM has some experience (Pearson and Gill).
  • Motion JPEG2000 has not been selected as our target video preservation format because the interpretation of the standard is not consistent across the commercially-available tools and technology, which are limited in number at this point anyway. We are leaning toward standard definition uncompressed video (4:2:2) in Quicktime format.
  • We are working with Corpus Christi College on the Parker on the Web project using Aware's JPEG2000 technology for images.
  • Homogenization: Archive digital video in NTSC, rather than maintaining original format (NTSC, PAL, SECAM).
  • Maintain original video so that we can go back to it if needed (e.g., for authenticity purposes or in the event of loss of the digital files).
    • RGS: This will also help with "future-proofing" in case new technology (e.g., better compression algorithms, cheaper disk storage) suggests a new approach at some point in the future.

Question: What metadata do you include with the video and audio materials (nothing is visible on the Web site)?

  • Make metadata a primary focus of the project.
  • There isn't a standard yet for the library and archive field re video metadata, and not a lot of consensus. However, New York university ... and some others have come to agreement.
  • Indexing is a labor-intensive, manual process. For static media, people read the original material and select appropriate index terms and metadata.
  • For the Buckminster Fuller project, we used the item-level description that was already available in EAD markup and FilemakerPro format, but created file-level records to record technical metadata about the digital audio and video files. When we are ready to ingest this content into our repository system, full METS records will be created, which will include the descriptive (mapped to MODS), technical, and other preservation metadata (including select PREMIS elements). Some of the elements were pulled from the SMPTE metadata dictionary (which is much more complex than is needed for Stanford's purposes).
  • Henry Lowood has had some experience with speech recognition software for video indexing for the Silicon Genesis project.

Question: Names of video services; Prices.

  • Vidipax. Excellent facilities and reputation, full range of services, including "top tier." Example: Video tape cleaning before transfer. This can have important quality implications, depending on how the original video has been stored. Cost: ~$200-300/hr and up. There may have been a recent personnel shakeup.
  • AV Geeks selected by Stanford for the bulk of the digitization for the RBF archive on the basis of samples sent to three vendors and a competitive bidding process. The big plus is very low rates. Re customer service and quality, "you get what you pay for."
  • We are working with Monaco Digital Film Labs in San Francisco for uncompressed video for the current Stanford campaign. All metadata is encoded in METS—the standard for archiving and packaging—for the repository system.

Question: Are you serving the Bucky pages from a database? If so, what database, what code on the server side?

  • Streaming is provided by Stanford's streaming service. Contact: Jeff Bornstein.
  • Lucene is used for search.

Question: If Stanford were to act as a regional archive for AI videos, what would be the preferred form of delivery from AAAI?

  • Stanford is moving away from offline media, like DVDs. The primary interest is in retaining preservation masters as digital files. Hard drives are the preferred form of delivery.

Question: How have you approached copyright permission?

  • This is an important question, and one which we handle differently depending on each case. Many of our digital collections are public domain. In the case of Bucky Fuller, we obtained permission from the Fuller Estate to digitize and stream the materials. In some cases there, we received permissions from relevant broadcasting companies or other producers, but sometimes requests were denied.
  • It is becoming standard practice at Stanford Libraries to negotiate for the right to digitize and distribute for educational purposes when we acquire a collection. But of course in many cases we will not have the right to do so with previously acquired material. Unlike other institutions (e.g., Harvard), we don't try to obtain ownership of the copyright to the collections we acquire.
  • Despite the copyright issues complicated by digital technology, we are committed to digitizing for preservation purposes and will develop access methods that are consistent with the law. This may mean providing access to derivative (use copy) files via a stand-alone computer workstation (off the net) in the library itself, or ideally we will work toward developing a networked delivery environment where we have fine-tuned control on access and can authenticate users to access the content lawfully but with some flexibility (i.e., not restricted to the physical library).

Miscellaneous

  • Share what is learned from the AAAI project discussions with other libraries (e.g., MIT, CMU, Edinburgh). There may be a chance to help move the library conservation field forward. There may also be a possibility to use the AAAI videos in an experimental fashion with the new Stanford A/V Digitization facility.
  • It is common archivist practice to add 60 sec. of color bars and a 1 KHz tone at the beginning of each archive video.
  • The project manager for the Feigenbaum archive is Will Snow.
  • Stanford's metadata librarian is Nancy Hoebelheinrich.
  • Sarah Timby processes robotics videos received from AAAI. She works for Henry Lowood.
  • Lauren Schoenthaler is the Senior University Counsel who works with the Stanford Libraries on copyright issues.
  • We discussed the Joshua Lederberg Papers, part of the National Library of Medicine Profiles in Science. The Finding Aid is interesting. The metadata is encoded with Dublin Core.
<link rel="schema.DC" href="http://purl.org/dc/elements/1.1/" title="The Dublin Core metadata Element Set" />
<meta name="DC.Title" content="Finding Aid to the Joshua Lederberg Papers, 1904-2002" />

<meta name="DC.Publisher" content="U.S. National Library of Medicine" />
<meta name="DC.Date.Issued" content="2005-05-20" />
<meta name="DC.Date.Modified" content="2006-11-01" />
<meta name="NLMDC.Date.Expiration" content="2007-11-01" />
<meta name="NLM.Contact.Email" content="hmdweb@nlm.nih.gov" />
<meta name="DC.Type" content="Finding Aids" />
<meta name="NLM.Permanence.Level" content="Permanent: Dynamic Content" />
<meta name="NLM.Permanence.Guarantor" content="U.S. National Library of Medicine" />

<meta name="DC.Rights" content="Public Domain" />
<meta name="DC.Language" content="eng" />

<meta name="dc.title" content="Finding Aid to the Joshua Lederberg Papers, 1904-2002" />
<meta name="DC.Author" content="Lederberg, Joshua" />
<meta name="DC.Subject.Mesh" content=" Beadle, George W. " />
<meta name="DC.Subject.Mesh" content=" Cavalli-Sforza, Luca" />
<meta name="DC.Subject.Mesh" content=" Crick, Francis" />
<meta name="DC.Subject.Mesh" content=" Crow, James F. " />
<meta name="DC.Subject.Mesh" content=" Davis, Bernard D. " />
<meta name="DC.Subject.Mesh" content=" Delbruck, Max" />
<meta name="DC.Subject.Mesh" content=" Demerec, Milislav" />
<meta name="DC.Subject.Mesh" content=" Djerassi, Carl" />
<meta name="DC.Subject.Mesh" content=" Edwards, Philip R. " />

<meta name="DC.Subject.Mesh" content=" Feigenbaum, Edward A. " />
<meta name="DC.Subject.Mesh" content=" Hayes, William" />
<meta name="DC.Subject.Mesh" content=" Horowitz, Norman H. " />
<meta name="DC.Subject.Mesh" content=" Iino, Tetsuo" />
<meta name="DC.Subject.Mesh" content=" Klein, George " />
<meta name="DC.Subject.Mesh" content=" Lederberg, Esther" />
<meta name="DC.Subject.Mesh" content=" Lederberg, Joshua" />
<meta name="DC.Subject.Mesh" content=" Lein, Joseph" />
<meta name="DC.Subject.Mesh" content=" Luria, Salvador E. " />
<meta name="DC.Subject.Mesh" content=" Morse, M. Larry" />
<meta name="DC.Subject.Mesh" content=" Muller, Herman J. " />
<meta name="DC.Subject.Mesh" content=" Nossal, Gustav J. V. " />
<meta name="DC.Subject.Mesh" content=" Novick, Aaron" />
<meta name="DC.Subject.Mesh" content=" Sagan, Carl" />
<meta name="DC.Subject.Mesh" content=" Shortliffe, Edward H. " />
<meta name="DC.Subject.Mesh" content=" Sonneborn, Tracy M. " />
<meta name="DC.Subject.Mesh" content=" Stocker, Bruce A. D. " />

<meta name="DC.Subject.Mesh" content=" Tatum, Edward L. " />
<meta name="DC.Subject.Mesh" content=" Watson, James" />
<meta name="DC.Subject.Mesh" content=" Zinder, Norton D. " />
<meta name="DC.Subject.Mesh" content=" American Society for Microbiology " />
<meta name="DC.Subject.Mesh" content=" Arms Control and Disarmament Agency (ACDA) " />
<meta name="DC.Subject.Mesh" content=" Bristol Laboratories " />
<meta name="DC.Subject.Mesh" content=" Carnegie Commission " />
<meta name="DC.Subject.Mesh" content=" Center for Advanced Studies in Behavioral Sciences (CASBS) " />
<meta name="DC.Subject.Mesh" content=" Chief of Naval Operations Executive Panel (CNO/CEP)" />
<meta name="DC.Subject.Mesh" content=" Columbia University " />
<meta name="DC.Subject.Mesh" content=" Committee on International Security and Arms Control (CISAC) " />
<meta name="DC.Subject.Mesh" content=" Defense Science Board (DSB) " />
<meta name="DC.Subject.Mesh" content=" Federation of American Scientists " />
<meta name="DC.Subject.Mesh" content=" Genetics Society of America  " />
<meta name="DC.Subject.Mesh" content=" Jane Coffin Childs Memorial Fund for Medical Research " />
<meta name="DC.Subject.Mesh" content=" National Academy of Sciences " />
<meta name="DC.Subject.Mesh" content=" National Academy of Sciences Institute of Medicine " />

<meta name="DC.Subject.Mesh" content=" National Institutes of Health  " />
<meta name="DC.Subject.Mesh" content=" National Research Council " />
<meta name="DC.Subject.Mesh" content=" Office of Technology Assessment " />
<meta name="DC.Subject.Mesh" content=" Risk Assessment and Management Commission (RAMC) " />
<meta name="DC.Subject.Mesh" content=" Rockefeller University " />
<meta name="DC.Subject.Mesh" content=" Space Science Board  " />
<meta name="DC.Subject.Mesh" content=" Stanford University " />
<meta name="DC.Subject.Mesh" content=" United States House of Representatives " />
<meta name="DC.Subject.Mesh" content=" United States Senate " />
<meta name="DC.Subject.Mesh" content=" University of Wisconsin " />
<meta name="DC.Subject.Mesh" content=" World Health Organization " />
<meta name="DC.Subject.Mesh" content=" Yale University " />
<meta name="DC.Subject.Mesh" content="Advisory Committees" />
<meta name="DC.Subject.Mesh" content="Advisory Committees" />
<meta name="DC.Subject.Mesh" content="Artificial Intelligence " />
<meta name="DC.Subject.Mesh" content="Bacteria" />
<meta name="DC.Subject.Mesh" content="Bacteriophages " />

<meta name="DC.Subject.Mesh" content="Biological Warfare " />
<meta name="DC.Subject.Mesh" content="Bioterrorism" />
<meta name="DC.Subject.Mesh" content="Cell Culture " />
<meta name="DC.Subject.Mesh" content="Communicable Diseases" />
<meta name="DC.Subject.Mesh" content="Conservation of Natural Resources" />
<meta name="DC.Subject.Mesh" content="Developmental Biology" />
<meta name="DC.Subject.Mesh" content="DNA " />
<meta name="DC.Subject.Mesh" content="Drug Resistance, Microbial " />
<meta name="DC.Subject.Mesh" content="Escherichia Coli " />
<meta name="DC.Subject.Mesh" content="Eugenics" />
<meta name="DC.Subject.Mesh" content="Evolution" />
<meta name="DC.Subject.Mesh" content="Exobiology " />
<meta name="DC.Subject.Mesh" content="Expert Systems" />
<meta name="DC.Subject.Mesh" content="Genetic Engineering " />
<meta name="DC.Subject.Mesh" content="Genetics " />
<meta name="DC.Subject.Mesh" content="Genetics, Biochemical " />
<meta name="DC.Subject.Mesh" content="Genetics, Microbial " />

<meta name="DC.Subject.Mesh" content="Health Policy" />
<meta name="DC.Subject.Mesh" content="Information Science " />
<meta name="DC.Subject.Mesh" content="Lysogeny " />
<meta name="DC.Subject.Mesh" content="Mental Health" />
<meta name="DC.Subject.Mesh" content="Mental Retardation" />
<meta name="DC.Subject.Mesh" content="Mutation " />
<meta name="DC.Subject.Mesh" content="Neoplasms" />
<meta name="DC.Subject.Mesh" content="Recombination, Genetic " />
<meta name="DC.Subject.Mesh" content="Risk Assessment" />
<meta name="DC.Subject.Mesh" content="Transduction, Genetic " />
<meta name="DC.Subject.Mesh" content="Transformation, Bacterial" />
<meta name="DC.Title" content="Joshua Lederberg Papers
1904-2002" />
<meta name="DC.Type" content="text" />
<meta name="DC.Format" content="manuscripts" />
  • In April, work will start on the Arthur Kornberg papers.
  • Consider use of the SAMMA system for automated migration of media assets.

Contributions?

Tags: Frost
AAAI Home   Recent Changes   Edit   History   Print   Contact Us
Page last modified on December 01, 2007, at 05:58 AM