explainable AI

Introduction to Explainable AI: why should we understand AI decisions?

Is XAI really that important? Why should we try to explain our models' predictions? A short introduction to explainable AI.

Detecting and Reducing Bias in Data

Currently, in contrast to shallow models exploited in the past, most deep learning systems extract features automatically, and to do that, they tend to rely on a huge amount of labeled data. Whereas the quality of dataset used to train neural networks has a huge impact on the models’ performance, those datasets are often noisy, biased and sometimes even contain incorrectly labeled samples. Moreover, deep neural networks (DNNs) are black-box models that usually have tens of layers with millions of parameters, and very complex latent space, which make their decisions very hard to interpret. Such fragile models are increasingly used to solve very sensitive and critical tasks. Therefore, the demand for a clear reasoning and correct decision is very high, especially when DNNs are used in transportation (autonomous cars), in healthcare, for legal systems, finances, and military. To address those challenges the project aims to develop methods of Explainable Artificial Intelligence (XAI) which might help to uncover and reduce the problem of bias in data. The project involves investigation and integration of explainability into new and existing Artificial Intelligence systems, and mostly focuses on Deep Neural Networks in the field of Computer Vision. One of the ways of categorizing XAI methods is to divide them into local and global explanations. Local analysis aims to explain a single prediction of a model, whereas a global one tries to explain how the whole model works in general. The project aims to develop novel methods of both local and global explainability to help explain deep neural network-based systems in order to justify them, to control their reasoning process, and to discover new knowledge.

Brief introduction to XAI

Brief introduction to visual local explainabiliy methods - Local Interpretable Model-Agnostic Explanations (LIME), Layer-wise Relevance Propagation (LRP) and Counterfactuals