survey： VQA

时间 2020-12-26

原文原文链接

VQA： Given an image and a question in natural language, it requires reasoning over visual elements of the image and general knowledge to infer the correct answer. 和基于对象检测的任务区别对象识别-对图像主要对象进行分类目标检测-通过