论文-《Bottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering 》重点翻译+扩展

论文下载 摘要Abstract Top-down: Top-down visual attention mechanisms have been used extensively in image captioning and visual question answering (VQA) to enable deeper image understanding through fine-grai
相关文章
相关标签/搜索