Open Access. Powered by Scholars. Published by Universities.®

Physical Sciences and Mathematics Commons

Open Access. Powered by Scholars. Published by Universities.®

Computer Engineering

Western University

Attribution robustness

Articles 1 - 1 of 1

Full-Text Articles in Physical Sciences and Mathematics

Attribution Robustness Of Neural Networks, Sunanda Gamage Feb 2024

Attribution Robustness Of Neural Networks, Sunanda Gamage

Electronic Thesis and Dissertation Repository

While deep neural networks have demonstrated excellent learning capabilities, explainability of model predictions remains a challenge due to their black box nature. Attributions or feature significance methods are tools for explaining model predictions, facilitating model debugging, human-machine collaborative decision making, and establishing trust and compliance in critical applications. Recent work has shown that attributions of neural networks can be distorted by imperceptible adversarial input perturbations, which makes attributions unreliable as an explainability method. This thesis addresses the research problem of attribution robustness of neural networks and introduces novel techniques that enable robust training at scale.

Firstly, a novel generic framework …