论文标题
分布检测是否可以学习?
Is Out-of-Distribution Detection Learnable?
论文作者
论文摘要
监督学习旨在在培训和测试数据来自同一分布的假设下培训分类器。为了缓解上述假设,研究人员研究了一个更现实的设置:分布(OOD)检测,其中测试数据可能来自培训期间未知的类(即OOD数据)。由于OOD数据的不可用和多样性,良好的概括能力对于有效的OOD检测算法至关重要。为了研究OOD检测的概括,在本文中,我们研究了OOD检测的近似正确的(PAC)学习理论,研究人员提出了一个开放问题。首先,我们发现了OOD检测的可学习性的必要条件。然后,使用这种条件,我们证明了在某些情况下可学习检测的几个不可能的定理。尽管不可能的定理令人沮丧,但我们发现这些不可能定理的某些条件在某些实际情况下可能不存在。基于这一观察结果,我们接下来给出了一些必要和足够的条件,以在某些实际情况下表征OOD检测的可学习性。最后,我们还根据我们的OOD理论为几种代表性的OOD检测工作提供了理论支持。
Supervised learning aims to train a classifier under the assumption that training and test data are from the same distribution. To ease the above assumption, researchers have studied a more realistic setting: out-of-distribution (OOD) detection, where test data may come from classes that are unknown during training (i.e., OOD data). Due to the unavailability and diversity of OOD data, good generalization ability is crucial for effective OOD detection algorithms. To study the generalization of OOD detection, in this paper, we investigate the probably approximately correct (PAC) learning theory of OOD detection, which is proposed by researchers as an open problem. First, we find a necessary condition for the learnability of OOD detection. Then, using this condition, we prove several impossibility theorems for the learnability of OOD detection under some scenarios. Although the impossibility theorems are frustrating, we find that some conditions of these impossibility theorems may not hold in some practical scenarios. Based on this observation, we next give several necessary and sufficient conditions to characterize the learnability of OOD detection in some practical scenarios. Lastly, we also offer theoretical supports for several representative OOD detection works based on our OOD theory.