关于计算决策树的概率解释

论文标题

关于计算决策树的概率解释

On Computing Probabilistic Explanations for Decision Trees

论文作者

Arenas, Marcelo, Barceló, Pablo, Romero, Miguel, Subercaseaux, Bernardo

论文摘要

正式的XAI（可解释的AI）是一个增长的领域，专注于对ML模型做出的决策的数学保证的计算解释。在正式XAI内部，研究最多的案例之一是解释决策树所采取的选择，因为它们传统上被视为最容易解释的模型之一。最近的工作重点是研究“充分理由”的计算，这种解释是通过提供$ x $的$ x $的子集$ y $来解释决策树$ t $和实例$ x $，从而解释了$ t（x）$的决定。 $ x $的分类为$ t $。 It has been argued, however, that sufficient reasons constitute a restrictive notion of explanation, and thus the community has started to study their probabilistic counterpart, in which one requires that the probability of $T(z) = T(x)$ must be at least some value $δ\in (0, 1]$, where $z$ is a random instance that is compatible with $y$. Our paper settles the computational complexity of $Δ$ - 季节对决策树的盛行，表明（1）发现$δ$ - 五个季节的尺寸很小，（2）找到$δ$ - $Δ$ - 五个季节的包含在最小的纳入中，不接受多项式算法（除非P = np）（除非py = np）。足够的季节很容易计算，我们回答了Izza等人最初提出的两个开放问题。

Formal XAI (explainable AI) is a growing area that focuses on computing explanations with mathematical guarantees for the decisions made by ML models. Inside formal XAI, one of the most studied cases is that of explaining the choices taken by decision trees, as they are traditionally deemed as one of the most interpretable classes of models. Recent work has focused on studying the computation of "sufficient reasons", a kind of explanation in which given a decision tree $T$ and an instance $x$, one explains the decision $T(x)$ by providing a subset $y$ of the features of $x$ such that for any other instance $z$ compatible with $y$, it holds that $T(z) = T(x)$, intuitively meaning that the features in $y$ are already enough to fully justify the classification of $x$ by $T$. It has been argued, however, that sufficient reasons constitute a restrictive notion of explanation, and thus the community has started to study their probabilistic counterpart, in which one requires that the probability of $T(z) = T(x)$ must be at least some value $δ\in (0, 1]$, where $z$ is a random instance that is compatible with $y$. Our paper settles the computational complexity of $δ$-sufficient-reasons over decision trees, showing that both (1) finding $δ$-sufficient-reasons that are minimal in size, and (2) finding $δ$-sufficient-reasons that are minimal inclusion-wise, do not admit polynomial-time algorithms (unless P=NP). This is in stark contrast with the deterministic case ($δ= 1$) where inclusion-wise minimal sufficient-reasons are easy to compute. By doing this, we answer two open problems originally raised by Izza et al. On the positive side, we identify structural restrictions of decision trees that make the problem tractable, and show how SAT solvers might be able to tackle these problems in practical settings.

下载PDF全文

下载文献需遵守相关版权规定

论文标题