论文标题

这不是s!tpostost的

It Isn't Sh!tposting, It's My CAT Posting

论文作者

Rawat, Parthsarthi, Das, Sayan, Aguirre, Jorge, Daphara, Akhil

论文摘要

在本文中,我们描述了一种新颖的体系结构,可以为给定的输入图像产生有趣的字幕。该体系结构分为两半,即图像字幕和热闹的文本转换。该体系结构以预先训练的CNN模型(在此实现中的VGG16)开头,并在其上应用了注意LSTM以生成正常的字幕。然后,这些普通字幕被馈送到我们热闹的文本转换变压器中,该转换器将此文本转换为有趣的东西,同时保持输入图像的上下文。该体系结构也可以分为两半,只能通过输入句子来使用SEQ2SEQ变压器来生成有趣的字幕。本文旨在通过使用CATNET生成字幕同时帮助日常用户更懒惰和更有趣。

In this paper, we describe a novel architecture which can generate hilarious captions for a given input image. The architecture is split into two halves, i.e. image captioning and hilarious text conversion. The architecture starts with a pre-trained CNN model, VGG16 in this implementation, and applies attention LSTM on it to generate normal caption. These normal captions then are fed forward to our hilarious text conversion transformer which converts this text into something hilarious while maintaining the context of the input image. The architecture can also be split into two halves and only the seq2seq transformer can be used to generate hilarious caption by inputting a sentence.This paper aims to help everyday user to be more lazy and hilarious at the same time by generating captions using CATNet.

扫码加入交流群

加入微信交流群

微信交流群二维码

扫码加入学术交流群,获取更多资源