Skip to Main content Skip to Navigation
Reports

SASASQ : Système d'Apprentissage Supervisé Automatique pour la Classification des Questions

Abstract : Most question&answer systems are based on three main axes: question classification and analysis, documents retrieval and answer extraction. The performance in every stage affects the final result. The classification of questions appears as an important task because it deduces the type of expected answers. In this paper, we present a method of improving of the performance of classifier, based on the linguistic analysis (semantic, syntactic and morphological) and statistical approaches guided by a layered semantic hierarchy of fine grained questions types. In fact, we propose two methods of questions expansion. The first, aims to add for each word the synonyms matching it contextual sence, and the second adds a high representation "generalisation" for the noun. Various features of representation of documents, term frequency and machine learning algorithms are studied. Experiments conducted on real data are presented show an improvement of the precision in the classification of questions.
Document type :
Reports
Complete list of metadatas

https://hal-emse.ccsd.cnrs.fr/emse-00679940
Contributor : Florent Breuil <>
Submitted on : Friday, March 16, 2012 - 4:16:34 PM
Last modification on : Wednesday, June 24, 2020 - 4:19:08 PM

Identifiers

  • HAL Id : emse-00679940, version 1

Citation

Ali Harb. SASASQ : Système d'Apprentissage Supervisé Automatique pour la Classification des Questions. 2009. ⟨emse-00679940⟩

Share

Metrics

Record views

113