部分可观测时空混沌系统的无模型预测

论文标题

部分可观测时空混沌系统的无模型预测

The Creativity of Text-to-Image Generation

论文作者

Oppenlaender, Jonas

论文摘要

图像的文本指导综合已成为成为主流现象的巨大飞跃。借助文本到图像生成系统，任何人都可以创建数字图像和艺术品。这引起了文本到图像生成是否有创造力的问题。本文阐述了与文本到图像艺术（所谓的“ AI Art”）有关的人类创造力的性质，特别关注及时工程的实践。该论文认为，当前以产品为中心的创造力观点在文本到图像生成的背景下不足。提供了一个例子的案例，并强调了在线社区对文本到图像艺术创意生态系统的重要性。该论文提供了有关Rhodes概念性四个P模型创造力模型的在线生态系统绘制的高级摘要。讨论了评估文本到图像生成的创造力的挑战，以及在人类计算机互动领域（HCI）研究文本对图像生成的机会的挑战。

Text-guided synthesis of images has made a giant leap towards becoming a mainstream phenomenon. With text-to-image generation systems, anybody can create digital images and artworks. This provokes the question of whether text-to-image generation is creative. This paper expounds on the nature of human creativity involved in text-to-image art (so-called "AI art") with a specific focus on the practice of prompt engineering. The paper argues that the current product-centered view of creativity falls short in the context of text-to-image generation. A case exemplifying this shortcoming is provided and the importance of online communities for the creative ecosystem of text-to-image art is highlighted. The paper provides a high-level summary of this online ecosystem drawing on Rhodes' conceptual four P model of creativity. Challenges for evaluating the creativity of text-to-image generation and opportunities for research on text-to-image generation in the field of Human-Computer Interaction (HCI) are discussed.

下载PDF全文

下载文献需遵守相关版权规定

论文标题