Towards Transparent AI Systems: Interpreting Visual Question Answering Models
International Conference on Machine Learning (ICML) Workshop on Visualization for Deep Learning, 2016
Yash Goyal, Akrit Mohapatra,
Devi Parikh, Dhruv Batra
[Best Student Paper]
Interactive Visualizations: Question and Image
In this paper, we experimented with two visualization methods -- guided backpropagation and occlusion -- to interpret deep learning models for the task of Visual Question Answering. Specifically, we find what part of the input (pixels in images or words in questions) the VQA model focuses on while answering a question about an image.
CloudCV: Large-Scale Distributed Computer Vision as a Cloud Service
Book Chapter, Mobile Cloud Visual Media Computing
Harsh Agrawal, Clint Solomon Mathialagan, Yash Goyal, Neelima Chavali, Prakriti Banik, Akrit Mohapatra, Ahmed Osman, Dhruv Batra
Editors: Gang Hua, Xian-Sheng Hua. Springer, 2015.
We present a comprehensive system to provide access to state-of-the-art distributed computer vision algorithms as a cloud service through a Web Interface and APIs.