Structured Content-Only Information Retrieval Using Term Proximity and Propagation of Title Terms
Résumé
Our experiments in the 2006 INEX ad'hoc track were based on the use of the proximity of the query terms in the documents to rank them. More precisely we define around each occurence of a query term an influence function. For an occurence appearing in the text itself, this influence function is linearly decreasing from 1 to 0 depending on the distance to the occurence. When a query term happens to appear in a title of a structured document its influence is uniformly 1 from the beginning to the end of the (sub-)section. We use boolean queries and these influence functions are combined according to the tree of a query using fuzzy logic. The score of any part of a document is the summation of the resulting influence function at the root of the query tree on the range of this part. We present and comment the results.
Domaines
Informatique [cs]Origine | Fichiers produits par l'(les) auteur(s) |
---|