论文标题
对工业物联网中时间序列数据的深度异常检测:一种沟通效率的智障联盟学习方法
Deep Anomaly Detection for Time-series Data in Industrial IoT: A Communication-Efficient On-device Federated Learning Approach
论文作者
论文摘要
由于边缘设备故障(即异常)严重影响工业物联网(IIOT)工业产品的生产,因此准确,及时检测异常变得越来越重要。此外,边缘设备收集的数据可能包含用户的私人数据,这在近年来用户隐私要求公众关注时,挑战当前检测方法。以此重点,本文提出了一个新的沟通效率的联合学习(FL)基于基于IIT中时间序列数据的深度异常检测框架。具体而言,我们首先引入了FL框架,以使分散的边缘设备能够协作训练异常检测模型,从而可以提高其泛化能力。其次,我们提出了一个基于注意机制的卷积神经网络长期记忆(AMCNN-LSTM)模型,以准确检测异常。 AMCNN-LSTM模型使用基于注意机制的CNN单元来捕获重要的细粒度特征,从而防止记忆丧失和梯度分散问题。此外,该模型保留了LSTM单元在预测时间序列数据中的优势。第三,为了使所提出的框架适应工业异常检测的及时性,我们提出了一种基于顶部\ textit {k}选择的梯度压缩机制,以提高沟通效率。对四个现实世界数据集的广泛实验研究表明,与不使用梯度压缩方案的联合学习框架相比,提出的框架可以准确,及时检测异常,并将通信开销降低50 \%。
Since edge device failures (i.e., anomalies) seriously affect the production of industrial products in Industrial IoT (IIoT), accurately and timely detecting anomalies is becoming increasingly important. Furthermore, data collected by the edge device may contain the user's private data, which is challenging the current detection approaches as user privacy is calling for the public concern in recent years. With this focus, this paper proposes a new communication-efficient on-device federated learning (FL)-based deep anomaly detection framework for sensing time-series data in IIoT. Specifically, we first introduce a FL framework to enable decentralized edge devices to collaboratively train an anomaly detection model, which can improve its generalization ability. Second, we propose an Attention Mechanism-based Convolutional Neural Network-Long Short Term Memory (AMCNN-LSTM) model to accurately detect anomalies. The AMCNN-LSTM model uses attention mechanism-based CNN units to capture important fine-grained features, thereby preventing memory loss and gradient dispersion problems. Furthermore, this model retains the advantages of LSTM unit in predicting time series data. Third, to adapt the proposed framework to the timeliness of industrial anomaly detection, we propose a gradient compression mechanism based on Top-\textit{k} selection to improve communication efficiency. Extensive experiment studies on four real-world datasets demonstrate that the proposed framework can accurately and timely detect anomalies and also reduce the communication overhead by 50\% compared to the federated learning framework that does not use a gradient compression scheme.