IMLI：基于Maxsat的可解释分类规则学习的增量框架

论文标题

IMLI：基于Maxsat的可解释分类规则学习的增量框架

IMLI: An Incremental Framework for MaxSAT-Based Learning of Interpretable Classification Rules

论文作者

Ghosh, Bishwamittra, Meel, Kuldeep S.

论文摘要

由于最终用户需要了解由于学习系统而导致的决策背后的推理，因此在关键领域（例如医学诊断，法律，教育）中广泛采用了机器学习。可解释学习的计算棘手性使从业者设计了启发式技术，这些技术无法为权衡的准确性和解释性提供声音。在过去的十年中，最近基于MaxSat的方法（称为MIC）的成功激发，该方法旨在减少学习以结合性正常形式（CNF）表达的可解释规则的问题，以达到MaxSat查询。虽然显示MICT的精度与其他最先进的黑盒分类器相似，同时生成了小型可解释的CNF公式，但MIC的运行时性能显着滞后，并且在实践中呈现不可用的渲染方法。在这种情况下，作者提出了一个问题：是否有可能实现两全其美的最佳，即，可以在扩展到现实世界实例的同时利用MaxSat求解器的可解释学习框架？在本文中，我们迈出了肯定的上述问题的一步。我们提出IMLI：基于MAXSAT的增量方法，通过基于分区的培训方法来实现可扩展的运行时性能。对UCI存储库产生的基准测试的广泛实验表明，IMLI最多可实现三个数量级的运行时间改进，而不会丧失准确性和可解释性。

The wide adoption of machine learning in the critical domains such as medical diagnosis, law, education had propelled the need for interpretable techniques due to the need for end users to understand the reasoning behind decisions due to learning systems. The computational intractability of interpretable learning led practitioners to design heuristic techniques, which fail to provide sound handles to tradeoff accuracy and interpretability. Motivated by the success of MaxSAT solvers over the past decade, recently MaxSAT-based approach, called MLIC, was proposed that seeks to reduce the problem of learning interpretable rules expressed in Conjunctive Normal Form (CNF) to a MaxSAT query. While MLIC was shown to achieve accuracy similar to that of other state of the art black-box classifiers while generating small interpretable CNF formulas, the runtime performance of MLIC is significantly lagging and renders approach unusable in practice. In this context, authors raised the question: Is it possible to achieve the best of both worlds, i.e., a sound framework for interpretable learning that can take advantage of MaxSAT solvers while scaling to real-world instances? In this paper, we take a step towards answering the above question in affirmation. We propose IMLI: an incremental approach to MaxSAT based framework that achieves scalable runtime performance via partition-based training methodology. Extensive experiments on benchmarks arising from UCI repository demonstrate that IMLI achieves up to three orders of magnitude runtime improvement without loss of accuracy and interpretability.

下载PDF全文

下载文献需遵守相关版权规定

论文标题