《Look, Imagine and Match: Improving Textual-Visual Cross-Modal Retrieval with Generative Models》

时间 2021-01-02

原文原文链接

来源：CVPR2018 一、Introduction 第一篇同时利用GAN和Reinforcement Learning(RL)做跨媒体检索的文章。这个网络可以同时做三个跨媒体的任务：cross-media retrieval，image caption and text-to-image synthesis（对于后两个任务，文章只给出了可视化的结果，没有给出定量的分析）。这篇文章发表在CVP