AIAI Seminar - 8 May 2023 - Talk by Ana Ozaki

 

Speaker:        Ana Ozaki

 

Title:                Querying Neural Networks: The BERT Case

 

Abstract: 

We investigate an approach for extracting knowledge from trained neural networks based on Angluin’s exact learning model with membership and equivalence queries to an oracle. In this approach, the oracle is a trained neural network. We consider Angluin’s classical algorithm for exact learning Horn theories and study the necessary changes to make it applicable to learn from neural networks. In particular, we have to consider that trained neural networks may not behave as Horn oracles, meaning that their underlying target theory may not be Horn. We propose a new algorithm that aims at extracting a Horn-approximation of the target theory and that is guaranteed to terminate in exponential time (in the worst case) and in polynomial time if the target is Horn. To showcase the applicability of the approach, we perform experiments on BERT based language models and extract rules that expose biases in these models.

Authors: Sophie Blum, Raoul Koudijs, Ana Ozaki, Samia Touileb

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

 

May 08 2023 -

AIAI Seminar - 8 May 2023 - Talk by Ana Ozaki

AIAI Seminar hosted by Ana Ozaki

G.03, Informatics Forum