Basit öğe kaydını göster

dc.contributor.authorGöksel, Gökhan
dc.contributor.authorArslan, Ahmet
dc.contributor.authorDinçer, Bekir Taner
dc.date.accessioned2023-07-03T08:19:38Z
dc.date.available2023-07-03T08:19:38Z
dc.date.issued2023en_US
dc.identifier.citationGöksel G, Arslan A, Dinçer BT. 2023. A selective approach to stemming for minimizing the risk of failure in information retrieval systems. PeerJ Computer Science 9:e1175 https://doi.org/10.7717/peerj-cs.1175en_US
dc.identifier.otherPMCID: PMC10280253
dc.identifier.urihttps://doi.org/10.7717/peerj-cs.1175
dc.identifier.urihttps://hdl.handle.net/20.500.12809/10795
dc.description.abstractStemming is supposed to improve the average performance of an information retrieval system, but in practice, past experimental results show that this is not always the case. In this article, we propose a selective approach to stemming that decides whether stemming should be applied or not on a query basis. Our method aims at minimizing the risk of failure caused by stemming in retrieving semantically-related documents. The proposed work mainly contributes to the IR literature by proposing an application of selective stemming and a set of new features that derived from the term frequency distributions of the systems in selection. The method based on the approach leverages both some of the query performance predictors and the derived features and a machine learning technique. It is comprehensively evaluated using three rule-based stemmers and eight query sets corresponding to four document collections from the standard TREC and NTCIR datasets. The document collections, except for one, include Web documents ranging from 25 million to 733 million. The results of the experiments show that the method is capable of making accurate selections that increase the robustness of the system and minimize the risk of failure (i.e., per query performance losses) across queries. The results also show that the method attains a systematically higher average retrieval performance than the single systems for most query sets.en_US
dc.item-language.isoengen_US
dc.relation.isversionof10.7717/peerj-cs.1175en_US
dc.item-rightsinfo:eu-repo/semantics/openAccessen_US
dc.subjectSelective information retrievalen_US
dc.subjectSelective stemmingen_US
dc.subjectRobustnessen_US
dc.titleA selective approach to stemming for minimizing the risk of failure in information retrieval systemsen_US
dc.item-typearticleen_US
dc.contributor.departmentMÜ, Mühendislik Fakültesi, Bilgisayar Mühendisliği Bölümüen_US
dc.contributor.authorID0000-0002-0660-7239en_US
dc.contributor.institutionauthorDinçer, Bekir Taner
dc.relation.journalPeerJ Computer Scienceen_US
dc.relation.publicationcategoryMakale - Uluslararası Hakemli Dergi - Kurum Öğretim Elemanıen_US


Bu öğenin dosyaları:

Thumbnail

Bu öğe aşağıdaki koleksiyon(lar)da görünmektedir.

Basit öğe kaydını göster