2017 ICCV ICCV 2017

Multi-Modal Factorized Bilinear Pooling With Co-Attention Learning for Visual Question Answering