About Me

I am a Computer Scientist at the Center for Vision Technologies at SRI International, where I work on applying Deep Learning to Computer Vision and Natural Language Processing tasks. I finished graduate school (Master's) majoring in Computer Engineering at Virginia Tech (VT). While at VT, I had the honor of working in the Computer Vision Lab with Prof. Devi Parikh and Prof. Dhruv Batra.

I believe in connection. In my research, I seek to facilitate human-AI and AI-AI connections to solve tasks effectively. Currently, I am trying to do so by imbuing the intelligence to comprehend languages and images to computers. I believe this is a crucial step towards having seamless and natural interactions with humans.


I enjoy hiking, biking, and lasertag! I try my hand at baking and cooking too at times. I am the proud inventor of a more interesting (read unhealthier) way to make Ramen Noodles, the recipe of which I will put up soon. Recently, I have taken up an interest in Swing Dancing! Check back later for more!

Recent News:

Press Coverage

  • Deccan Chronicle, Indian Express, Engineering.Careers360: UAV with Facial Recognition Capabilities, Click here

Projects / Publications:

Make RBF Networks Fast Again- Exploiting Multi-Threaded Computing to Speed Up RBF Networks. [draft paper] [code]

Ray, Arijit, Gordon Christie, Mohit Bansal, Dhruv Batra, and Devi Parikh, "Question Relevance in VQA: Identifying Non-Visual And False-Premise Questions.", in EMNLP 2016. [pdf] [code] [Video]

Prashant Chandrasekar, Xuan Zhang, Saurabh Chakravarty, Arijit Ray, John Krulick, and Alla Rozovskaya, "The Virginia Tech System at CoNLL-2016 Shared Task on Shallow Discourse Parsing", In CoNLL Shared Task (2016).

Object Prediction using Image Context: Predict next object in an image reasoned on present image context in a sequential manner.

Online Demo for Predicting Plausibility of Common Sense Assertions: Enter a three-phrase tuple to assess the plausibility score based on a joint language-vision common-sense reasoning.

Learning to Listen: Matching Cover songs with Original Productions: Match Original Songs to Cover Songs using an Ensemble of Supervised and Unsupervised Approaches.

Ray, Arijit, Kishan Prudhvi Guddanti, and N. Chellammal. "An Approach to Intelligent Traction Control Using Regression Networks and Anomaly Detection." Applied Artificial Intelligence 29.6 (2015): 597-616.

Contact Me

Have a question?

Best way to reach me would be to drop an email to ray93 at vt dot edu. Urgent? Call me on (78one) five 18-8 three twentysix! Please don't spam me!