论文标题
用于评估生成对抗网络的神经-AI界面
A Neuro-AI Interface for Evaluating Generative Adversarial Networks
论文作者
论文摘要
生成的对抗网络(GAN)越来越多地吸引了计算机视觉,自然语言处理,语音合成和类似领域的关注。但是,评估gan的性能仍然是一个开放且具有挑战性的问题。现有的评估指标主要使用自动统计方法来衡量真实图像和生成图像之间的差异。他们通常需要大量样本量进行评估,并且不能直接反映人类对图像质量的看法。在这项工作中,我们介绍了一种称为Neuroscore的评估度量,以评估gan的性能,该指标通过利用大脑信号更直接地反映了心理感受的图像质量。我们的结果表明,Neuroscore的性能优于当前评估指标:(1)与人类判断更一致; (2)评估过程需要少量的样本; (3)它能够按照gan进行对图像的质量进行排名。提出了基于卷积的神经网络(CNN)的神经AI界面,以直接从GAN生成的图像中预测神经科,而无需神经反应。重要的是,我们表明,在网络训练阶段包括神经反应可以显着提高所提出模型的预测能力。代码和数据可以在此链接中引用:https://github.com/villawang/neuro-ai-interface。
Generative adversarial networks (GANs) are increasingly attracting attention in the computer vision, natural language processing, speech synthesis and similar domains. However, evaluating the performance of GANs is still an open and challenging problem. Existing evaluation metrics primarily measure the dissimilarity between real and generated images using automated statistical methods. They often require large sample sizes for evaluation and do not directly reflect human perception of image quality. In this work, we introduce an evaluation metric called Neuroscore, for evaluating the performance of GANs, that more directly reflects psychoperceptual image quality through the utilization of brain signals. Our results show that Neuroscore has superior performance to the current evaluation metrics in that: (1) It is more consistent with human judgment; (2) The evaluation process needs much smaller numbers of samples; and (3) It is able to rank the quality of images on a per GAN basis. A convolutional neural network (CNN) based neuro-AI interface is proposed to predict Neuroscore from GAN-generated images directly without the need for neural responses. Importantly, we show that including neural responses during the training phase of the network can significantly improve the prediction capability of the proposed model. Codes and data can be referred at this link: https://github.com/villawang/Neuro-AI-Interface.