论文-《MUREL: Multimodal Relational Reasoning for Visual Question Answering Remi》重点翻译+扩展

  Multimodal attentional networks are currently state-of-the-art models for Visual Question Answering (VQA) tasks involving real images. 多模态注意力网络是目前最先进的涉及真实图像的VQA任务模型。   In this paper, we propose MuRe
相关文章
相关标签/搜索