Information Integration on the Web (IIWeb)
Papers from the 2007 AAAI Workshop
Ullas Nambiar and Zaiqing Nie, Program Cochairs
Technical Report WS-07-14 published by The AAAI Press, Menlo Park, California
This technical report is also available in book and CD format.
Contents
Organizing Committee / vii
Ullas Nambiar and Zaiqing Nie
Preface / vii
Ullas Nambiar and Zaiqing Nie
Semantic Data Integration Environment for Biomedical Research (Mediator Infrastructure for Information Integration) / 1
Vadim Astakhov, Jeffrey S. Grethe, Edward Ross, David Little, Brian Sanders, and Amarnath Gupta
Learning Extractors from Unlabeled Text Using Relevant Databases / 10
Kedar Bellare and Andrew McCallum
Information Integration for the Masses / 16
Jim Blythe, Dipsy Kapoor, Craig A. Knoblock, Kristina Lerman, and Steven Minton
A Platform for Scalable, Collaborative, Structured Information Integration / 22
Kurt Bollacker, Patrick Tufts, Tomi Pierce, and Robert Cook
Citepack: An Autonomous Agent for Discovering and Integrating Research Sources / 28
Christopher H. Brooks, Yeh Fang, Ketaki Joshi, Papanii Okai, and Xia Zhou
Author Disambiguation using Error-driven Machine Learning with a Ranking Loss Function / 32
Aron Culotta, Pallika Kanani, Robert Hall,Michael Wick, and Andrew McCallum
Efficient Strategies for Improving Partitioning-Based Author Coreference by Incorporating Web Pages as Graph Nodes / 38
Pallika Kanani and Andrew McCallum
Query Rewriting for Semantic Web Information Integration / 44
Dave Kolas
Using Regulatory Instructions for Information Extraction / 50
Thomas Y. Lee
Name Disambiguation Using Web Connection / 56
Yiming Lu, Zaiqing Nie, Taoyuan Cheng, Ying Gao, and Ji-Rong Wen
On the Stable Marriage of Maximum Weight Royal Couples / 62
Anan Marie and Avigdor Gal
Mining Heterogeneous Transformations for Record Linkage / 68
Matthew Michelson and Craig A. Knoblock
Probabilistic Representations for Integrating Unreliable Data Sources / 74
David Mimno, Andrew McCallum, and Gerome Miklau
Putting Semantic Information Extraction on the Map: Noisy Label Models for Fact Extraction / 80
Chris Pal, Gideon Mann, and Richard Minerich
Exploiting Social Annotation for Automatic Resource Discovery / 86
Anon Plangprasopchok and Kristina Lerman
BioFederator: A Data Federation System for Bioinformatics on the Web / 92
Ahmed Radwan, Akmal Younis, Sawsan Khuri, Mauricio A. Hernandez, Howard Ho, Lucian Popa, and Shivkumar Shivaji
Answering Top K Queries Efficiently with Overlap in Sources or Source Paths / 98
Louiqa Raschid, Maria Esther Vidal, Yao Wu, Felix Naumann, and Jens Bleiholder
Data Integration Support for Mashups / 104
Andreas Thor, David Aumueller, and Erhard Rahm