2016 EMNLP EMNLP 2016

Human Attention in Visual Question Answering: Do Humans and Deep Networks look at the same regions?

The Questioner