Doctoral colloquium - Štefan Pócoš (27.2.2023)
Monday 27.2.2023 at 13:10, Lecture room I/9
Štefan Pócoš:
Explainability and Interpretability of Deep Neural Networks
Abstract:
It is well known that artificial neural networks (ANNs) have been shown to achieve outstanding accuracy in plenty of tasks. Although in some cases their performance is getting to its peaks, it is not the only aspect researchers are concerned about. One of the downsides of modern ANNs is their lack of explainability and interpretability. This is easily demonstrated by fooling them using adversarial examples. We examined this aspect of ANNs by applying several methods to fool them and visualized their inner behavior when fooled. We will also talk about our plan for future research, including the usage of modern networks based on attention.