Basit öğe kaydını göster

dc.contributor.authorArslan, Ahmet
dc.contributor.authorDinçer, Bekir Taner
dc.date.accessioned2020-11-20T14:40:32Z
dc.date.available2020-11-20T14:40:32Z
dc.date.issued2019
dc.identifier.issn1386-4564
dc.identifier.issn1573-7659
dc.identifier.urihttps://doi.org/10.1007/s10791-018-9347-9
dc.identifier.urihttps://hdl.handle.net/20.500.12809/763
dc.description0000-0002-0660-7239en_US
dc.descriptionWOS: 000493691800002en_US
dc.description.abstractA typical information retrieval (IR) system applies a single retrieval strategy to every information need of users. However, the results of the past IR experiments show that a particular retrieval strategy is in general good at fulfilling some type of information needs while failing to fulfil some other type, i.e., high variation in retrieval effectiveness across information needs. On the other hand, the same results also show that an information need that a particular retrieval strategy failed to fulfil could be fulfilled by one of the other existing retrieval strategies. The challenge in here is therefore to determine in advance what retrieval strategy should be applied to which information need. This challenge is related to the robustness of IR systems in retrieval effectiveness. For an IR system, robustness can be defined as fulfilling every information need of users with an acceptable level of satisfaction. Maintaining robustness in retrieval effectiveness is a long-standing challenge and in this article we propose a simple but powerful method as a remedy. The method is a selective approach to index term weighting and for any given query (i.e., information need) it predicts the "best" term weighting model amongst a set of alternatives, on the basis of the frequency distributions of query terms on a target document collection. To predict the best term weighting model, the method uses the Chi-square statistic, the statistic of the Chi-square goodness-of-fit test. The results of the experiments, performed using the official query sets of the TREC Web track and the Million Query track, reveal in general that the frequency distributions of query terms provide relevant information on the retrieval effectiveness of term weighting models. In particular, the results show that the selective approach proposed in this article is, on average, more effective and more robust than the most effective single term weighting model.en_US
dc.description.sponsorshipTUBITAK, scientific and technological research projects funding programTurkiye Bilimsel ve Teknolojik Arastirma Kurumu (TUBITAK) [114E558]en_US
dc.description.sponsorshipThis work is supported by TUBITAK, scientific and technological research projects funding program, under Grant 114E558. Any opinions, findings and conclusions or recommendations expressed in this material are the authors' and do not necessarily reflect those of the sponsor.en_US
dc.item-language.isoengen_US
dc.publisherSpringeren_US
dc.item-rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectChi-Square Goodness-Of-Fiten_US
dc.subjectIndex Term Weightingen_US
dc.subjectRobustness in Retrieval Effectivenessen_US
dc.subjectSelective Information Retrievalen_US
dc.titleA selective approach to index term weighting for robust information retrieval based on the frequency distributions of query termsen_US
dc.item-typearticleen_US
dc.contributor.departmentMÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.contributor.institutionauthorDinçer, Bekir Taner
dc.identifier.doi10.1007/s10791-018-9347-9
dc.identifier.volume22en_US
dc.identifier.issue6en_US
dc.identifier.startpage543en_US
dc.identifier.endpage569en_US
dc.relation.journalInformation Retrieval Journalen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster