论文-《MUREL: Multimodal Relational Reasoning for Visual Question Answering Remi》重点翻译+扩展

时间 2020-12-25

原文原文链接

Multimodal attentional networks are currently state-of-the-art models for Visual Question Answering (VQA) tasks involving real images. 多模态注意力网络是目前最先进的涉及真实图像的VQA任务模型。 In this paper, we propose MuRe