Focused retrieval with proximity scoring
Résumé
We present in this paper a scoring method for information retrieval based on the proximity of the query terms in the documents. The idea of the method first is to assign to each position in the document a fuzzy proximity value depending on its closeness to the surrounding keywords. These proximity values can then be summed on any range of text -- including any passage or any element -- and after normalization this sum is used as the relevance score for the extent. Some experiments on the Wikipedia collection used in the INEX 2008 evaluation campaign are presented and discussed.