论文标题
基于标题的创意行业形象生成数据集
A Creative Industry Image Generation Dataset Based on Captions
论文作者
论文摘要
大多数图像生成方法都难以精确地控制生成图像的属性,例如结构,比例,形状等,从而限制了其在概念设计和图形设计等创意行业中的大规模应用等。使用提示和草图是可控性的实用解决方案。现有数据集缺乏提示或草图,并且不是为创意行业设计的。这是我们工作的主要贡献。 a)这是涵盖创意行业领域最重要领域的第一个数据集,并用及时和草图标记。 b)我们在测试集中提供多个参考图像,并为每个参考提供细粒度分数,这些参考图像对测量很有用。 c)我们将两个最先进的模型应用于我们的数据集,然后找到一些缺点,例如提示比草图高。
Most image generation methods are difficult to precisely control the properties of the generated images, such as structure, scale, shape, etc., which limits its large-scale application in creative industries such as conceptual design and graphic design, and so on. Using the prompt and the sketch is a practical solution for controllability. Existing datasets lack either prompt or sketch and are not designed for the creative industry. Here is the main contribution of our work. a) This is the first dataset that covers the 4 most important areas of creative industry domains and is labeled with prompt and sketch. b) We provide multiple reference images in the test set and fine-grained scores for each reference which are useful for measurement. c) We apply two state-of-the-art models to our dataset and then find some shortcomings, such as the prompt is more highly valued than the sketch.