An exploration of the principles underlying redundancy-based factoid question answering
Title | An exploration of the principles underlying redundancy-based factoid question answering |
Publication Type | Journal Articles |
Year of Publication | 2007 |
Authors | Jimmy Lin |
Journal | ACM Transactions on Information Systems |
Volume | 25 |
Issue | 2 |
Date Published | 2007/04// |
ISBN Number | 1046-8188 |
Keywords | Data redundancy, Web search |
Abstract | The so-called “redundancy-based” approach to question answering represents a successful strategy for mining answers to factoid questions such as “Who shot Abraham Lincoln?” from the World Wide Web. Through contrastive and ablation experiments with Aranea, a system that has performed well in several TREC QA evaluations, this work examines the underlying assumptions and principles behind redundancy-based techniques. Specifically, we develop two theses: that stable characteristics of data redundancy allow factoid systems to rely on external “black box” components, and that despite embodying a data-driven approach, redundancy-based methods encode a substantial amount of knowledge in the form of heuristics. Overall, this work attempts to address the broader question of “what really matters” and to provide guidance for future researchers. |
URL | http://doi.acm.org/10.1145/1229179.1229180 |
DOI | 10.1145/1229179.1229180 |