IoT Anomaly Detection using Multivariate
Soundararajan Ezekiel1, Abdullah Ali Alshehri2, Larry Pearlstein3, Xin-Wen Wu4, Adam Lutz5

1Soundararajan Ezekiel*, Computer Science, Indiana University of Pennsylvania, Indiana, Pennsylvania, USA.
2Abdullah Ali Alshehri, Electrical Engineering, King Abdulaziz University, Rabigh, Saudi Arabia.
3Larry Pearlstein, Electrical Engineering, The College of New Jersey, Ewing, NJ, USA.
4Xin-Wen Wu, Electrical Engineering, The College of New Jersey, Ewing, NJ, USA.
5Adam Lutz, Electrical Engineering, The College of New Jersey, Ewing, NJ, USA.
Manuscript received on January 12, 2020. | Revised Manuscript received on January 22, 2020. | Manuscript published on February 10, 2020. | PP: 1662-1669 | Volume-9 Issue-4, February 2020. | Retrieval Number: D1323029420/2020©BEIESP | DOI: 10.35940/ijitee.D1323.029420
Open Access | Ethics and Policies | Cite | Mendeley
© The Authors. Blue Eyes Intelligence Engineering and Sciences Publication (BEIESP). This is an open access article under the CC BY-NC-ND license (http://creativecommons.org/licenses/by-nc-nd/4.0/)

Abstract: Devices associated with Internet of Things are typically constrained in their resources and do not have the computational power necessary to analyze their input and detect anomalies that occur. Smart devices or and environmental sensors that measure temperature, air quality, or seismic activity are all built for specific purposes with minimal resources and often do not have enough security in place to protect against infiltration or detect abnormal behavior. Additionally, because these devices and sensors are typically always connected and transmit constant data in near real-time, the high dimensionality of the raw readings are extremely computationally intensive to analyze. A possible solution to reduce the dimensionality of the data while also extracting the most significant features is to use multivariate analysis techniques such as Principal Component Analysis. PCA is a method of multivariate analysis meant to reduce the size of matrices while not only keeping the most significant variables but also learning the interactions between them. In this paper, we propose exploring anomaly detection in IoT using multivariate analysis techniques to reduce the dimensionality of sensor input to reduce the computational complexity of analysis and learning the most significant variables. While the normal conditions of sensor data are often readily available, the size of the data makes it difficult to precisely determine instances of targeted anomalies. In this study, PCA is used to analyze the available features of the data and from them can determine the sensors under normal conditions. Once the normal conditions are determined, outliers which constitute anomalies can be determined through techniques such as Mahalanobis distance to determine the variance of each observation from the normal distribution. Our work can also be expanded to use other methods of dimensionality reduction and feature extraction such as t-Distributed Stochastic Neighbor Embedding. 
Keywords: Internet of Things, Anomaly Detection, Multivariate Analysis, Principal Component Analysis, Dimensionality Reduction.
Scope of the Article:  Seismic Evaluation of Building Nonstructural Components