About Me

I am a Computer Scientist at the Center for Vision Technologies at SRI International, where I primarily work with Ajay Divakaran, Yi Yao and Giedrius Burachas on applying Deep Learning to Computer Vision and Natural Language Processing tasks. I finished graduate school (Master's) majoring in Computer Engineering from Virginia Tech (VT). While at VT, I had the honor of working in the Computer Vision Lab advised by Prof. Devi Parikh and in close collaboration with Prof. Dhruv Batra.

I believe in connection. In my research, I seek to facilitate human-AI and AI-AI connections to solve tasks effectively. Currently, I am trying to do so by imbuing the intelligence to comprehend languages and images to computers. I believe this is a crucial step towards having seamless and natural interactions with humans.


  • 2018: I presented our work on using an image-retrieval game to evaluate efficacy of AI explanations/rationalizations at the VQA/Visual Dialog Workshop at CVPR 2018. Arxiv paper coming soon.
  • 2017: I hosted the annual SRI CVT Lab Day Team Challenge with a human-AI collaborative image-retrieval game with AI's that can "rationalize/explain" their decisions to humans!
  • 2017: I came in 3rd place in SRI CVT's Shark Tank Competition - a competition where recent graduates pitch a proposal idea.
  • 2017: I started working at SRI International - Center for Vision Technologies
  • 2017: I graduated from Virginia Tech and successfully defended my thesis!
  • 2017: I gave a talk at MACV 2017, UPenn on The Visual 20 Questions Game. Link to slides (voice overlay coming soon)
  • 2016: Paper on VQA Relevance accepted at EMNLP 2016! Title: Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions
  • 2016: Paper on CoNLL Shared Task on Discourse Parsing accepted at CoNLL 2016! We ranked within top 10 worldwide!
  • 2016: I won Employee of the Fortnight award as an intern at Blue River Technology! (now acquired by John Deere, September 2017)
  • 2016: I am interning at a Deep Learning Agri-tech startup called Blue River Technology over the Summer of 16. (Update: Acquired by John Deere as of September 2017)
  • 2016: I will be serving as the Vice President of Tau Beta Pi, Virginia Beta Chapter, from Fall 2016 for a year!
  • 2015: I served as a reviewer for CVPR 2016!
  • 2015: An online demo for predicting plausibility of natural language triplet sentences.
  • 2014: Our Face-recognition UAV featured in National News : Deccan Chronicle, Indian Express etc
  • 2013: I won a silver medal at SRM University Research Day for my white-paper presentation on an Electro-Mechanical Exoskeleton.
  • 2012: I won an Academic Merit Scholarship from SRM University that waives a part of my tuition for excellent academic performance.


When I am not making multi-layer neural networks, you will usually find me playing the guitar (part of the School of Rock, Princeton band), or lindy-hop/balboa dancing! I try my hand at cooking/baking too at times. I am the proud inventor of a more interesting (read unhealthier) way to make Ramen Noodles, the recipe of which I will put up soon. Check back later for more!

As a child, I was always interested in engineering simple gadgets/softwares. I opened an informal research society in my middle school to encourage middle schoolers take an interest in science by engineering simple gadgets. My previous website features some of those naive endeavors along with some (little less naive) projects I did as an undergrad.

Press Coverage

  • Deccan Chronicle, Indian Express, Engineering.Careers360: UAV with Facial Recognition Capabilities, Click here

    Dr. Erik Brynjolfsson, the Director of the MIT Center for Digital Business, tweeted:

Projects / Publications:

The Art of Deep Connection - Towards Natural and Pragmatic Conversational Agent Interactions. [Master's Thesis], Virginia Tech E-Library, 2017

Make RBF Networks Fast Again- Exploiting Multi-Threaded Computing to Speed Up RBF Networks. [draft paper] [code]

Ray, Arijit, Gordon Christie, Mohit Bansal, Dhruv Batra, and Devi Parikh, "Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions.", in EMNLP 2016. [pdf] [code] [Video]

Prashant Chandrasekar, Xuan Zhang, Saurabh Chakravarty, Arijit Ray, John Krulick, and Alla Rozovskaya, "The Virginia Tech System at CoNLL-2016 Shared Task on Shallow Discourse Parsing", In CoNLL Shared Task (2016).

Object Prediction using Image Context: Predict next object in an image reasoned on present image context in a sequential manner.

Online Demo for Predicting Plausibility of Common Sense Assertions: Enter a three-phrase tuple to assess the plausibility score based on a joint language-vision common-sense reasoning.

Learning to Listen: Matching Cover songs with Original Productions: Match Original Songs to Cover Songs using an Ensemble of Supervised and Unsupervised Approaches.

Ray, Arijit, Kishan Prudhvi Guddanti, and N. Chellammal. "An Approach to Intelligent Traction Control Using Regression Networks and Anomaly Detection." Applied Artificial Intelligence 29.6 (2015): 597-616.

