Published 2009 | Version public
Book Section - Chapter

Interactively Optimizing Information Retrieval Systems as a Dueling Bandits Problem

Abstract

We present an on-line learning framework tailored towards real-time learning from observed user behavior in search engines and other information retrieval systems. In particular, we only require pairwise comparisons which were shown to be reliably inferred from implicit feedback (Joachims et al., 2007; Radlinski et al., 2008b). We will present an algorithm with theoretical guarantees as well as simulation results.

Additional Information

Copyright 2009 by the author(s)/owner(s). The work was funded under NSF Award IIS-0713483, NSF CAREER Award 0237381, and a gift from Yahoo! Research. The first author is also partly funded by a Microsoft Research Graduate Fellowship and a Yahoo! Key Technical Challenges Grant. The authors also thank Robert Kleinberg, Josef Broder and the anonymous reviewers for their helpful comments.

Additional details

Identifiers

Eprint ID
49538
DOI
10.1145/1553374.1553527
Resolver ID
CaltechAUTHORS:20140910-105428880

Related works

Funding

NSF
IIS-0713483
NSF CAREER Award
0237381
Yahoo! Research
Microsoft Research Graduate Fellowship
Yahoo! Key Technical Challenges Grant

Dates

Created
2014-09-11
Created from EPrint's datestamp field
Updated
2021-11-10
Created from EPrint's last_modified field