自动现金：自主分类算法选择具有深Q-Network

论文标题

自动现金：自主分类算法选择具有深Q-Network

Auto-CASH: Autonomous Classification Algorithm Selection with Deep Q-Network

论文作者

Mu, Tianyu, Wang, Hongzhi, Wang, Chunnan, Liang, Zheng

论文摘要

各种数据源生成的大量数据集对机器学习算法选择和超参数配置提出了挑战。对于特定的机器学习任务，通常需要足够的时间来选择适当的算法并配置其超参数。如果可以自动解决算法选择和超参数优化的问题，则该任务将通过性能保证更有效地执行。这样的问题也称为现金问题。早期工作要么需要大量的人工劳动，要么遭受高时间或空间复杂性的苦难。在我们的工作中，我们介绍了基于元学习的预先培训模型Auto-Cash，以更有效地解决现金问题。 Auto-Cash是第一种利用深Q网络来自动选择每个数据集的元功能的方法，从而在不引入过多人类劳动的情况下大大降低了时间的成本。为了证明我们的模型的有效性，我们对120个现实世界分类数据集进行了广泛的实验。与经典和最先进的现金方法相比，实验结果表明，自动现金在较短的时间内实现了更好的性能。

The great amount of datasets generated by various data sources have posed the challenge to machine learning algorithm selection and hyperparameter configuration. For a specific machine learning task, it usually takes domain experts plenty of time to select an appropriate algorithm and configure its hyperparameters. If the problem of algorithm selection and hyperparameter optimization can be solved automatically, the task will be executed more efficiently with performance guarantee. Such problem is also known as the CASH problem. Early work either requires a large amount of human labor, or suffers from high time or space complexity. In our work, we present Auto-CASH, a pre-trained model based on meta-learning, to solve the CASH problem more efficiently. Auto-CASH is the first approach that utilizes Deep Q-Network to automatically select the meta-features for each dataset, thus reducing the time cost tremendously without introducing too much human labor. To demonstrate the effectiveness of our model, we conduct extensive experiments on 120 real-world classification datasets. Compared with classical and the state-of-art CASH approaches, experimental results show that Auto-CASH achieves better performance within shorter time.

下载PDF全文

下载文献需遵守相关版权规定

论文标题