论文标题
基于深度强化学习的逃避生成对抗网络用于僵尸网络检测
Deep Reinforcement Learning based Evasion Generative Adversarial Network for Botnet Detection
论文作者
论文摘要
基于机器学习的僵尸网络探测器是对抗性逃避攻击的潜在目标。几项研究作品采用了对抗训练的对抗性培训,该培训是由生成对抗网(GAN)产生的样品,使僵尸网络探测器擅长识别对抗性逃避。但是,合成逃避可能不会遵循输入样品的原始语义。本文提出了一个新颖的GAN模型,该模型利用深度加固学习(DRL)来探索语义意识到的样本并同时硬化其检测。 DRL代理用于攻击充当僵尸网络检测器的GAN的歧视者。在GAN培训期间,经纪人对判别器进行了训练,这有助于GAN发电机比没有DRL的案件提前收敛。我们将此模型相关,即[“重生gan”或基于深度强化的基于学习的逃避生成对抗网络],因为在DRL的帮助下,它通过让其生成器探索在语义限制内的逃避样本来最大程度地减少GAN的工作。在GAN训练期间,进行了攻击以调整代理商学习精心扰动的歧视者权重。相关的不需要对ML分类器的对抗训练,因为它可以充当对抗性语义感知的僵尸网络检测模型。代码将在https://github.com/rhr407/relevagan上找到。
Botnet detectors based on machine learning are potential targets for adversarial evasion attacks. Several research works employ adversarial training with samples generated from generative adversarial nets (GANs) to make the botnet detectors adept at recognising adversarial evasions. However, the synthetic evasions may not follow the original semantics of the input samples. This paper proposes a novel GAN model leveraged with deep reinforcement learning (DRL) to explore semantic aware samples and simultaneously harden its detection. A DRL agent is used to attack the discriminator of the GAN that acts as a botnet detector. The discriminator is trained on the crafted perturbations by the agent during the GAN training, which helps the GAN generator converge earlier than the case without DRL. We name this model RELEVAGAN, i.e. ["relive a GAN" or deep REinforcement Learning-based Evasion Generative Adversarial Network] because, with the help of DRL, it minimises the GAN's job by letting its generator explore the evasion samples within the semantic limits. During the GAN training, the attacks are conducted to adjust the discriminator weights for learning crafted perturbations by the agent. RELEVAGAN does not require adversarial training for the ML classifiers since it can act as an adversarial semantic-aware botnet detection model. Code will be available at https://github.com/rhr407/RELEVAGAN.