An example of attention on image of a flower vase

Modifying Stacked Attention Networks for VQA

An example of attention on image of a flower vase

Modifying Stacked Attention Networks for VQA

We modified the architecture of Stacked Attention Network (SANs), which originally utilized image features to project attention on query-vector, for the problem of Image QA, by trying different combinations of attention on image features and query vector. On the dataset we used, the modified models performed on par with the original model proposed in paper.