论文标题
Run2survive:基于生存分析的算法选择的决策理论方法
Run2Survive: A Decision-theoretic Approach to Algorithm Selection based on Survival Analysis
论文作者
论文摘要
算法选择(AS)从最适合算法问题类的特定实例的固定候选算法中自动选择算法,其中“适用性”通常是指算法的运行时。由于候选算法的运行时间可能非常长,因此通常在时间限制下生成算法选择模型的训练数据,因为并非所有算法都在所有情况下都可以完成。因此,培训数据通常包含审查信息,因为算法的真实运行时仍未知。但是,许多标准AS方法无法以适当的方式处理此类信息。另一方面,生存分析(SA)自然支持审查的数据,并提供适当的方法来学习算法运行时的学习分布模型,正如我们在这项工作中所证明的那样。我们利用这样的模型作为复杂的决策理论方法的基础来选择算法,我们将其配置为Run2survive。此外,利用了这种框架,我们提倡采用一种规避风险的算法选择方法,其中避免了超时的优先级。在用标准基准Aslib进行的广泛实验研究中,我们的方法被证明具有很高的竞争力,在许多情况下,甚至在方法中甚至都优于最先进的方法。
Algorithm selection (AS) deals with the automatic selection of an algorithm from a fixed set of candidate algorithms most suitable for a specific instance of an algorithmic problem class, where "suitability" often refers to an algorithm's runtime. Due to possibly extremely long runtimes of candidate algorithms, training data for algorithm selection models is usually generated under time constraints in the sense that not all algorithms are run to completion on all instances. Thus, training data usually comprises censored information, as the true runtime of algorithms timed out remains unknown. However, many standard AS approaches are not able to handle such information in a proper way. On the other side, survival analysis (SA) naturally supports censored data and offers appropriate ways to use such data for learning distributional models of algorithm runtime, as we demonstrate in this work. We leverage such models as a basis of a sophisticated decision-theoretic approach to algorithm selection, which we dub Run2Survive. Moreover, taking advantage of a framework of this kind, we advocate a risk-averse approach to algorithm selection, in which the avoidance of a timeout is given high priority. In an extensive experimental study with the standard benchmark ASlib, our approach is shown to be highly competitive and in many cases even superior to state-of-the-art AS approaches.