Natural language processing (NLP) and association rules (AR)-based knowledge extraction for intelligent fault analysis: a case study in semiconductor industry

Fault analysis (FA) is the process of collecting and analyzing data to determine the cause of a failure. It plays an important role in ensuring the quality in manufacturing process. Traditional FA techniques are time-consuming and labor-intensive, relying heavily on human expertise and the availability of failure inspection equipment. In semiconductor industry, a large amount of FA reports are generated by experts to record the fault descriptions, fault analysis path and fault root causes. With the development of Artificial Intelligence, it is possible to automate the industrial FA process while extracting expert knowledge from the vast FA report data. The goal of this research is to develop a complete expert knowledge extraction pipeline for FA in semiconductor industry based on advanced Natural Language Processing and Machine Learning. Our research aims at automatically predicting the fault root cause based on the fault descriptions. First, the text data from the FA reports are transformed into numerical data using Sentence Transformer embedding. The numerical data are converted into latent spaces using Generalized-Controllable Variational AutoEncoder. Then, the latent spaces are classified by Gaussian Mixture Model. Finally, Association Rules are applied to establish the relationship between the labels in the latent space of the fault descriptions and that of the fault root cause. The proposed algorithm has been evaluated with real data of semiconductor industry collected over three years. The average correctness of the predicted label achieves 97.8%. The method can effectively reduce the time of failure identification and the cost during the inspection stage.

Mots clés

Fault analysis Natural language processing GCVAE GMM Association rules

Domaines

Mathématiques [math] Sciences de l'ingénieur [physics]

Florent Breuil : Connectez-vous pour contacter le contributeur

https://hal-emse.ccsd.cnrs.fr/emse-04278681

Soumis le : vendredi 10 novembre 2023-10:33:30

Dernière modification le : mardi 17 septembre 2024-15:45:55

Dates et versions

emse-04278681 , version 1 (10-11-2023)

Identifiants

HAL Id : emse-04278681 , version 1
DOI : 10.1007/s10845-023-02245-7

Citer

Zhiqiang Wang, Kenneth Ezukwoke, Anis Hoayek, Mireille Batton-Hubert, Xavier Boucher. Natural language processing (NLP) and association rules (AR)-based knowledge extraction for intelligent fault analysis: a case study in semiconductor industry. Journal of Intelligent Manufacturing, 2023, ⟨10.1007/s10845-023-02245-7⟩. ⟨emse-04278681⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

EMSE PRES_CLERMONT CNRS FAYOL-ENSMSE LIMOS DEMO-ENSMSE CLERMONT-AUVERGNE-INP INSTITUT-MINES-TELECOM

69 Consultations

0 Téléchargements