Basit öğe kaydını göster

dc.contributor.authorDinçer, B.T.
dc.contributor.authorOunis, I.
dc.contributor.authorMacDonald, C.
dc.date.accessioned2020-11-20T16:49:06Z
dc.date.available2020-11-20T16:49:06Z
dc.date.issued2014
dc.identifier.isbn9783319060279
dc.identifier.issn0302-9743
dc.identifier.urihttps://doi.org/10.1007/978-3-319-06028-6_3
dc.identifier.urihttps://hdl.handle.net/20.500.12809/6103
dc.descriptionCity of Amsterdam;et al.;Google;Microsoft Research;Textkernel;The Netherlands Organization for Scientific Research (NWO)en_US
dc.description36th European Conference on Information Retrieval, ECIR 2014, 13 April 2014 through 16 April 2014, Amsterdam, 105000en_US
dc.description.abstractThe aim of optimising information retrieval (IR) systems using a risk-sensitive evaluation methodology is to minimise the risk of performing any particular topic less effectively than a given baseline system. Baseline systems in this context determine the reference effectiveness for topics, relative to which the effectiveness of a given IR system in minimising the risk will be measured. However, the comparative risk-sensitive evaluation of a set of diverse IR systems - as attempted by the TREC 2013 Web track - is challenging, as the different systems under evaluation may be based upon a variety of different (base) retrieval models, such as learning to rank or language models. Hence, a question arises about how to properly measure the risk exhibited by each system. In this paper, we argue that no model of information retrieval alone is representative enough in this respect to be a true reference for the models available in the current state-of-the-art, and demonstrate, using the TREC 2012 Web track data, that as the baseline system changes, the resulting risk-based ranking of the systems changes significantly. Instead of using a particular system's effectiveness as the reference effectiveness for topics, we propose several remedies including the use of mean within-topic system effectiveness as a baseline, which is shown to enable unbiased measurements of the risk-sensitive effectiveness of IR systems. © 2014 Springer International Publishing Switzerland.en_US
dc.item-language.isoengen_US
dc.publisherSpringer Verlagen_US
dc.item-rightsinfo:eu-repo/semantics/closedAccessen_US
dc.titleTackling biased baselines in the risk-sensitive evaluation of retrieval systemsen_US
dc.item-typeconferenceObjecten_US
dc.contributor.departmenten_US
dc.contributor.departmentTempDinçer, B.T., Department of Statistics and Computer Engineering, Muğla University, 48000 Muğla, Turkey; Ounis, I., School of Computing Science, University of Glasgow, Glasgow G12 8QQ, United Kingdom; MacDonald, C., School of Computing Science, University of Glasgow, Glasgow G12 8QQ, United Kingdomen_US
dc.identifier.doi10.1007/978-3-319-06028-6_3
dc.identifier.volume8416 LNCSen_US
dc.identifier.startpage26en_US
dc.identifier.endpage38en_US
dc.relation.journalLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)en_US
dc.relation.publicationcategoryKonferans Öğesi - Uluslararası - Kurum Öğretim Elemanıen_US


Bu öğenin dosyaları:

DosyalarBoyutBiçimGöster

Bu öğe ile ilişkili dosya yok.

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster