ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information
Résumé
We present in this paper some experiments on the Wikipedia collection used in the INEX 2009 evaluation campaign with an information retrieval method based on proximity. The idea of the method is to assign to each position in the document a fuzzy proximity value depending on its closeness to the surrounding keywords. These proximity values can then be summed on any range of text - including any passage or any element - and after normalization this sum is used as the relevance score for the extent. To take into account the semantic tags, we define a contextual operator which allow to consider at query time only the occurrences of terms that appear in a given semantic context.