论文标题
具有基于项目集的生成模型的合成数据集生成
Synthetic Dataset Generation with Itemset-Based Generative Models
论文作者
论文摘要
本文提出了基于现有基于项目集的生成模型的三个不同的数据生成器,这些数据生成器量身定制为交易数据集。所有这些发电机都是直观且易于实现并表现出令人满意的性能。通过三种不同方法来评估每个发电机的质量,以捕获原始数据集结构的保留程度。
This paper proposes three different data generators, tailored to transactional datasets, based on existing itemset-based generative models. All these generators are intuitive and easy to implement and show satisfactory performance. The quality of each generator is assessed by means of three different methods that capture how well the original dataset structure is preserved.