Exploring learning techniques for edge AI taking advantage of NVMs

Michele Martemucci; François Rummens; Tifenn Hirtzlin; Adrien F. Vincent; Sylvain Saighi; Elisa Vianello

Poster De Conférence Année : 2023

Exploring learning techniques for edge AI taking advantage of NVMs

(1) , (1) , (2) , (3) , (3) , (2)

1
2
3

Michele Martemucci

Fonction : Auteur

Laboratoire Systèmes-sur-puce et Technologies Avancées

François Rummens

Fonction : Auteur

Laboratoire Systèmes-sur-puce et Technologies Avancées

Tifenn Hirtzlin

Fonction : Auteur

Département Composants Silicium

Adrien F. Vincent

Fonction : Auteur

Laboratoire de l'intégration, du matériau au système

Sylvain Saighi

Fonction : Auteur

Laboratoire de l'intégration, du matériau au système

Elisa Vianello

Fonction : Auteur

Département Composants Silicium

Résumé

The relatively recent development and remarkable results of Artificial Neural Networks (ANNs) are due to the construction of gigantic databases and algorithmic innovations requiring large hardware resources, which results in equally substantial energy consumptions. As Artificial Intelligence (AI) is now being embedded more and more into various connected objects, ranging from medical implants to autonomous cars, it is clear that the algorithmic and hardware solutions available in data centres will not be able to cover all the AI integration needs. The field of microelectronics has been working for several years now on the development of emerging memory technologies with the aim of integrating Non-Volatile Memory (NVM) within computing units. In a conventional processor architecture, such co-integration between the computation units and the memory would simplify the memory hierarchy, but also increase the bandwidth between computation and data access. In this study, we explore the potential of two non-volatile memory technologies, HfO2-based FeRAM [1] and OxRAM [2], for enabling on-chip learning systems. Notably, the quasi-infinite reading endurance of OxRAM devices and their poor writing endurance makes them suitable for inference-only applications, whereas the reported large writing endurance of FeRAM device would effectively allow moving training on-chip as well. Eventually, the migration of inference and learning from data centres to edge devices will allow them to adapt to the evolution of input data, to specialize each device to its user, to retain private data and offer faster service. To validate the feasibility of this approach, we designed a test chip in the 22nm FDSOI technology node. The primary objective of this chip is to demonstrate the implementation of a hybrid FeRAM/OxRAM memory circuit capable of storing the synaptic weights of a Neural Network (NN) during learning/inference phases, while accelerating NN training at the edge. Eventually, by incorporating synaptic metaplasticity in Binarized Neural Networks [3], the chip addresses the issue of catastrophic forgetting. The chip consists of two sub-cores, each comprising four 16kbit FeRAM arrays and one 16kbit OxRAM array. One FeRAM array and the OxRAM array can be operated simultaneously. The circuit leverages the OxRAM array to build a near-memory computing inference engine to accelerate the inference/feedforward pass of training, while FeRAM arrays store an 8-bit quantized version of the floating-point weights optimized during training.

Domaines

Micro et nanotechnologies/Microélectronique

Fichier principal

Abstract_Memrisys.pdf (103.64 Ko)

Origine	Fichiers produits par l'(les) auteur(s)

Contributeur MAP CEA : Connectez-vous pour contacter le contributeur

https://cea.hal.science/cea-04539479

Soumis le : mardi 9 avril 2024-16:30:35

Dernière modification le : mercredi 15 mai 2024-13:50:04

Dates et versions

cea-04539479 , version 1 (09-04-2024)

Identifiants

HAL Id : cea-04539479 , version 1

Citer

Michele Martemucci, François Rummens, Tifenn Hirtzlin, Adrien F. Vincent, Sylvain Saighi, et al.. Exploring learning techniques for edge AI taking advantage of NVMs. MEMRISYS 2023 - The 6th International Conference on Memristive Materials, Devices & Systems, Nov 2023, Turin, Italy. , 2023. ⟨cea-04539479⟩

Exporter

BibTeX XML-TEI Dublin Core DC Terms EndNote DataCite

Collections

CEA UGA CNRS IMS-BORDEAUX IMS-BORDEAUX-FUSION DRT LETI LIST CEA-GRE GS-COMPUTER-SCIENCE GS-SPORT-HUMAN-MOVEMENT DSCIN

32 Consultations

15 Téléchargements

Exploring learning techniques for edge AI taking advantage of NVMs

Résumé

Domaines

Dates et versions

Identifiants

Citer

Exporter

Collections

Partager