F10880486S419 - International Journal of Innovative Technology and Exploring Engineering (IJITEE)

Adversarial Attack on Machine Learning Models
V. Sahaya Sakila¹, Sandeep M², Praveen Hari Krishna N³

¹V. Sahaya Sakila, Assistant Professor, Department of Computer Science and Engineering, SRM Institute of Science and Technology, Ramapuram, Chennai (TamilNadu), India.

²Sandeep M, Department of Computer Science and Engineering, SRM Institute of Science and Technology, Ramapuram, Chennai (TamilNadu), India.

³Praveen Hari Krishna N, Department of Computer Science and Engineering, SRM Institute of Science and Technology, Ramapuram, Chennai (TamilNadu), India.

Open Access | Editorial and Publishing Policies | Cite | Mendeley | Indexing and Abstracting
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open-access article under the CC-BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Machine Learning (ML) models are applied in a variety of tasks such as network intrusion detection or malware classification. Yet, these models are vulnerable to a class of malicious inputs known as adversarial examples. These are slightly perturbed inputs that are classified incorrectly by the ML model. The mitigation of these adversarial inputs remains an open problem. As a step towards understanding adversarial examples, we show that they are not drawn from the same distribution than the original data, and can thus be detected using statistical tests. Using this knowledge, we introduce a complimentary approach to identify specific inputs that are adversarial. Specifically, we augment our ML model with an additional output, in which the model is trained to classify all adversarial inputs.

Keywords: Adversarial Attacks, Generative Adversarial Network, Robust Classification.
Scope of the Article: Classification

Download PDF

JOURNAL

REQUIREMENTS

PRODUCT

CONTACT US