Open Access. Powered by Scholars. Published by Universities.®

Digital Commons Network

Open Access. Powered by Scholars. Published by Universities.®

PDF

University of Massachusetts - Amherst

2006

Algorithms

Articles 1 - 1 of 1

Full-Text Articles in Entire DC Network

Minimal Test Collections For Retrieval Evaluation, Ben Carterette, James Allan, Ramesh Sitaraman Jan 2006

Minimal Test Collections For Retrieval Evaluation, Ben Carterette, James Allan, Ramesh Sitaraman

Ramesh Sitaraman

Accurate estimation of information retrieval evaluation metrics such as average precision require large sets of relevance judgments. Building sets large enough for evaluation of real-world implementations is at best inefficient, at worst infeasible. In this work we link evaluation with test collection construction to gain an understanding of the minimal judging effort that must be done to have high confidence in the outcome of an evaluation. A new way of looking at average precision leads to a natural algorithm for selecting documents to judge and allows us to estimate the degree of confidence by defining a distribution over possible document …