A Methodology for Collection Selection in Heterogeneous Contexts

Faïza Abbaci; Jacques Savoy; Michel Beigbeder

Communication Dans Un Congrès Année : 2002

A Methodology for Collection Selection in Heterogeneous Contexts

(1) , (2) , (1)

1
2

Faïza Abbaci

Fonction : Auteur

Département Réseaux, Information, Multimédia

Jacques Savoy

Fonction : Auteur

Université de Neuchâtel = University of Neuchatel

Michel Beigbeder

Fonction : Auteur
PersonId : 840581

Département Réseaux, Information, Multimédia

Résumé

In this paper we demonstrate that in an ideal Distributed Information Retrieval environment, taking the ability of each collection server to return relevant documents into account when selecting collections can be effective. Based on this assumption, we suggest a new approach to resolve the collection selection problem. In order to predict a collection's ability to return relevant documents, we inspect a limited number n of documents retrieved from each collection and analyze the proximity of search keywords within them. In our experiments, we vary the underlying parameter n of our suggested model to define the most appropriate number of top documents to be inspected. Moreover, we evaluate the retrieval effectiveness of our approach and compare it with both the centralized indexing and the CORI approaches [1], [16]. Preliminary results from these experiments, conducted on WT10g test collection, tend to demonstrate that our suggested method can achieve appreciable retrieval effectiveness.

Mots clés

Information retrieval distributed information retrieval collection selection results merging strategy evaluation

Domaines

Modélisation et simulation

Florent Breuil : Connectez-vous pour contacter le contributeur

https://hal-emse.ccsd.cnrs.fr/emse-00948054

Soumis le : lundi 17 février 2014-16:29:37

Dernière modification le : mardi 17 septembre 2024-15:46:01

Dates et versions

emse-00948054 , version 1 (17-02-2014)

Identifiants

HAL Id : emse-00948054 , version 1

Citer

Faïza Abbaci, Jacques Savoy, Michel Beigbeder. A Methodology for Collection Selection in Heterogeneous Contexts. ITCC2002, International Conference on Information Technology: Coding and Computing, Apr 2002, Las Vegas, United States. 7p. ⟨emse-00948054⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EMSE RIM-ENSMSE UR-LSTI-ENSMSE ISCODE-ENSMSE FAYOL-ENSMSE ISCOD-ENSMSE TDS-MACS INSTITUT-MINES-TELECOM

68 Consultations

0 Téléchargements

A Methodology for Collection Selection in Heterogeneous Contexts

Résumé

Mots clés

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager