Importance-Driven Deep Learning System Testing

Abstract

Deep Learning (DL) systems are key enablers for engineering intelligent applications due to their ability to solve complex tasks such as image recognition and machine translation. Nevertheless, using DL systems in safety- and security-critical applications requires to provide testing evidence for their dependable operation. Recent research in this direction focuses on adapting testing criteria from traditional software engineering as a means of increasing confidence for their correct behaviour. However, they are inadequate in capturing the intrinsic properties exhibited by these systems. We bridge this gap by introducing DeepImportance, a systematic testing methodology accompanied by an Importance-Driven (IDC) test adequacy criterion for DL systems. Applying IDC enables to establish a layer-wise functional understanding of the importance of DL system components and use this information to guide the generation of semantically-diverse test sets. Our empirical evaluation on several DL systems, across multiple DL datasets and with state-of-the-art adversarial generation techniques demonstrates the usefulness and effectiveness of DeepImportance and its ability to guide the engineering of more robust DL systems.

Metadata

Item Type:	Proceedings Paper
Authors/Creators:	Gerasimou, Simos https://orcid.org/0000-0002-2706-5272 Eniser, Hasan Ferit Sen, Alper
Dates:	Accepted: 9 December 2019 Published: 2020
Institution:	The University of York
Academic Units:	The University of York > Faculty of Sciences (York) > Computer Science (York)
Depositing User:	Pure (York)
Date Deposited:	10 Feb 2020 15:20
Last Modified:	18 Jul 2025 02:37
Status:	Published
Open Archives Initiative ID (OAI ID):	oai:eprints.whiterose.ac.uk:156747

Download

Accepted Version

Filename: PID6349913.pdf

Description: ICSE2020.pdf

CLICK TO DOWNLOAD

CORE (COnnecting REpositories)

Importance-Driven Deep Learning System Testing

Abstract

Metadata

Download

Accepted Version

Export

Statistics