Visual Question Answering: Datasets, Algorithms, and Future Challenges.
Computer Vision and Image Understanding(2017)
摘要
•Comparison of visual question answering (VQA) with related computer vision tasks.•Critical review of all major VQA datasets and evaluation metrics.•Comprehensive review and comparison of existing methods for VQA.•All major datasets have language and difficulty bias that critically affects VQA.•Recommendations for future VQA datasets and evaluation metrics to combat bias.
更多查看译文
关键词
Image understanding,Natural language processing,Vision and language
AI 理解论文
溯源树
样例
生成溯源树,研究论文发展脉络