Dealing with Structured Documents in Information Retrieval Systems - Mines Saint-Étienne
Communication Dans Un Congrès Année : 1999

Dealing with Structured Documents in Information Retrieval Systems

Résumé

In this paper we suggest how hypertext links and the content of HTML pages can be used to cluster pages into what we call Web documents We put forward a method to automatically construct a hierarchy ofWeb doc uments and with the help of an abstraction function the context hierarchy of a site This hierarchy is represented by a graph whose links are structural typed Structural links between nodes reveal a context relationship The con text hierarchy along with the graph of the pages underlying the site are used to better index and retrieve the pages Furthermore it permits a new operator to be added in the IRS Information Retrieval System query language whereby the user will be able to di erentiate the context from the subject of his queries .
Fichier non déposé

Dates et versions

emse-00941327 , version 1 (03-02-2014)

Identifiants

  • HAL Id : emse-00941327 , version 1

Citer

Fernando Aguiar, Doan Bich-Liên, Michel Beigbeder. Dealing with Structured Documents in Information Retrieval Systems. World Conference on the WWW and Internet, Oct 1999, Honolulu, United States. 10 p. ⟨emse-00941327⟩
67 Consultations
0 Téléchargements

Partager

More