ECE 6554: Advanced Computer
Vision, Spring 2016
Electrical
and Computer
Engineering Department,
Virginia
Tech
Meets: TR 5:00 pm to 6:15 pm in Lavery Hall Room 345
Instructor:
Devi
Parikh
Email: parikh@vt.edu
Forum: https://scholar.vt.edu/portal/site/s16ece6554
Course overview Project video Pre-requisite Requirements Important dates Schedule Resources
This is a graduate course in computer vision. The focus of this course is to survey and critique published approaches in computer vision. We will read and analyze the strengths and weaknesses of research papers on a variety of important topics pertaining to visual recognition and identify open research questions. See the schedule for a list of topics we will cover.
An introduction to computer vision or equivalent course. A machine learning or pattern recognition course may be beneficial.
Following are the requirements to successfully complete this course:
Paper reviews Leading Discussion Presentations Project
List of papers, particularly "Seeds / pointers for presenters:” is being updated.
Date | Topic and papers | Presenter, Discussion leads |
01/19 | Introduction |
Devi [slides] |
01/21 | Convolutional Neural Networks (and the deep learning "revolution") See: Dhruv Batra's Deep Learning Class (Fall 2015) https://computing.ece.vt.edu/~f15ece6504/ for more details. |
Presenter: Ram |
01/26 | Local features-based
image descriptions Local Convolutional Features With Unsupervised Training for Image Retrieval. M. Paulin, M. Douze, Z. Harchaoui, J.Mairal, F.Perronin and C. Schmid. ICCV 2015 Object Categorization by Learned Universal Visual Dictionary. J. Winn, A. Criminisi and T. Minka. ICCV 2005. [project page] A
Performance Evaluation of Local Descriptors.
K. Mikolajczyk and C. Schmid. CVPR 2003. |
Presenter: Mincan, Orson “For” discussion lead: nuo “Against” discussion lead: Dusold |
01/28 | Object discovery Unsupervised object discovery and localization in the wild: part-based matching with bottom-up region proposals. M. Cho, S. Kwak, C. Schmid, and J. Ponce. CVPR, 2015 Foreground
Focus: Finding Meaningful Features in Unlabeled Images.
Y. J. Lee and K. Grauman. BMVC 2008. [project
page] |
Presenter: Swazoo “For” discussion lead: Murat “Against” discussion lead: Latha |
02/02 | Object detection Faster R-CNN: Towards real-time object detection with region proposal networks. S.Ren, K. He, R. Girshick and J. Sun NIPS 2015. [code]
Histograms
of Oriented Gradients for Human Detection.
N. Dalal and B. Triggs. CVPR 2005. [video]
[PASCAL
datasets] |
Presenter: nuo “For” discussion lead: Alorf “Against” discussion lead: Mincan |
02/04 | Object proposals Edge boxes: Locating object proposals from edges. C. L. Zitnick and P. Dollar. ECCV 2014. [code]
Category
Independent Object Proposals. I.
Endres and D. Hoiem. ECCV 2010. [project] |
Presenter: Latha “For” discussion lead: Swazoo “Against” discussion lead: Murat |
02/09 | Segmentation DeepEdge: A Multi-Scale Bifurcated Deep Network for Top-Down Contour Detection. G. Bertasius, J. Shi, L. Torresani. CVPR 2015
Combining
Top-down and Bottom-up Segmentation.
E. Borenstein, E. Sharon and S. Ullman. CVPR
workshop 2004. [data] |
Presenter: Yash “For” discussion lead: Xiao “Against” discussion lead: Abraham |
02/11 | Pose Deeppose: Human pose estimation via deep neural networks. A. Toshev and C. Szegedy. CVPR 2014. Articulated Pose Estimation using Flexible Mixtures of Parts. Y. Yang, D. Ramanan. CVPR 2011.
Poselets:
Body Part Detectors Trained Using 3D Human Pose Annotations.
L. Bourdev and J. Malik. ICCV 2009. [code] |
Presenter: Rich F “For” discussion lead: Orson “Against” discussion lead: Xiao |
02/16 | Context Inside-Outside Net: Detecting Objects in Context with Skip Pooling and Recurrent Neural Networks. S. Bell, C.L. Zitnick, K. Bala, R. Girshick. arXiv:1512.04143 2015 The role of context for object detection and semantic segmentation in the wild. R. Mottaghi, X. Chen, X. Liu, N.G. Cho, S.W. Lee, S. Fidler and R. Urtasun. CVPR 2014
An
Empirical Study of Context in Object Detection.
S. Divvala, D. Hoiem, J. Hays, A. Efros and M. Hebert. CVPR 2009. [project
page] |
Presenter: Aroma “For” discussion lead: Arjun “Against” discussion lead: Mostafa |
02/18 | 3D layout Single Image 3D Without a Single 3D Image. D. Fouhey, W. Hussain, A. Gupta, M. Heber. ICCV 2015. Unfolding an Indoor Origami World. D. Fouhey, A. Gupta, and M. Hebert. ECCV 2014. [project page] Box In the Box: Joint 3D Layout and Object Reasoning from Single Images. A. Schwing, S. Fidler, M. Pollefeys and R. Urtasun. ICCV 2013
|
Presenter: Pooja “For” discussion lead: Abraham “Against” discussion lead: Rich F |
02/23 | Holistic scene
understanding Write review for and discuss: TextonBoost: Joint Appearance, Shape and Context Modeling for Multi-Class Object Recognition and Segmentation. J. Shotton, J. Winn, C. Rother and A. Criminisi. ECCV 2006. [project page] [data] [code] Seeds / pointers for presenters: Describing the Scene as a Whole: Joint Object Detection, Scene Classification and Semantic Segmentation. J. Yao, S. Fidler and R. Urtasun. CVPR 2012. |
Presenter:
Murat
“For” discussion lead: Mostafa “Against” discussion lead: Jinwoo |
02/25 | Groups of objects Write review for and discuss: Recognition Using Visual Phrases. M. Sadeghi and A. Farhadi. CVPR 2011. Seeds / pointers for presenters: Automatic Discovery of Groups of Objects for Scene Understanding. C. Li, D. Parikh and T. Chen. CVPR 2012. [project page] |
Presenter: Alorf “For” discussion lead: Aroma “Against” discussion lead:
Arjun |
03/01 | Project proposals due Saliency What makes a patch distinct. R. Margolin, A. Tal and L. Zelnik-Manor. CVPR 2013.
Learning
to Detect a Salient Object.
T. Liu, J. Sun, N. Zheng, X. Tang, H. Shum. CVPR 2007. [results]
[data]
[code] |
Presenter: Xiao “For” discussion lead: Graham “Against” discussion lead: Aroma |
03/03 | Importance VIP: Finding Important People in Images. C.S. Mathialagan, A.C. Gallagher. and D. Batra. CVPR 2015. [demo]
Understanding
and Predicting Importance in Images.
A. Berg, T. Berg, H. Daume, J. Dodge, A. Goyal, X. Han, A, Mensch, M.
Mitchell, A. Sood, K. Stratos and K. Yamaguchi. CVPR 2012. [UIUC
sentence dataset] [ImageClef
dataset] |
Presenter: Arjun “For” discussion lead: Arijit “Against” discussion lead: Adithya |
03/08 | Spring break: no class |
N/A |
03/10 | Spring break: no class |
N/A |
03/15 | Images of people |
Presenter: Arijit “For” discussion lead: Mincan “Against” discussion lead: nuo |
03/17 | Action Recognition Action recognition with trajectory-pooled deep-convolutional descriptors. L. Wang, Q. Yu, and T. Xiao. arXiv preprint arXiv:1505.04868 (2015) Action recognition by dense trajectories. In Computer Vision and Pattern Recognition. H. Wang, A. Kläser, C. Schmid. and C.L Liu. CVPR 2011. Action
Recognition from a Distributed Representation of Pose and Appearance.
S. Maji, L. Bourdev and J. Malik. CVPR 2011. [code] |
Presenter: Graham “For” discussion lead: Jinwoo “Against” discussion lead: Alorf |
03/22 | Global and high-level
image descriptors
Efficient
Object Category Recognition Using Classemes. L.
Torresani, M. Szummer and A. Fitzgibbon. ECCV 2010. [code
and data] CNN features off-the-shelf: an astounding baseline for recognition. Razavian AS, Azizpour H, Sullivan J, Carlsson S. CVPRW 2014.
Objects
as Attributes for Scene Classification.
L.-J. Li, H. Su, Y. Lim and L. Fei-Fei, 1st International Workshop on
Parts and Attributes, ECCV 2010. |
Presenter: Adithya “For” discussion lead: Harsh “Against” discussion lead: Orson |
03/24 | No class because Devi is traveling |
N/A |
03/29 | Attributes Write review for and discuss: Learning To Detect Unseen Object Classes by Between-Class Attribute Transfer. C. Lampert, H. Nickisch and S. Harmeling. CVPR 2009. [project page with data] Seeds / pointers for presenters: Describing Objects by Their Attributes. A. Farhadi, I. Endres, D. Hoiem and D. Forsyth, CVPR 2009. [data] Relative Attributes. D. Parikh and K. Grauman. ICCV 2011. [code and data] Panda: Pose aligned networks for deep attribute modeling, N. Zhang, M. Paluri, MA. Ranzato, T. Darrell, L. Bourdev. CVPR 2014 |
Presenter: Jinwoo “For” discussion lead: Adithya “Against” discussion lead: Yash |
03/31 | No class because Devi is traveling |
N/A |
04/05 | No class because Devi is traveling |
N/A |
04/07 | No class because Devi is traveling |
N/A |
04/12 | Language and images Show and tell: A neural image caption generator. O. Vinyals, A. Toshev, S. Bengio. and D. Erhan. arXiv 2014. Deep Visual-Semantic Alignments for Generating Image Descriptions. A. Karpathy, L. Fei-Fei. CVPR 2015. VQA: Visual Question Answering. S. Antol*, A. Agrawal*, J. Lu, M. Mitchell, D. Batra, C. L. Zitnick, and D. Parikh ICCV 2015. [Project Page][code][demo] Baby
Talk: Understanding and Generating Simple Image Descriptions.
G. Kulkarni, V. Premraj, S. Dhar, S. Li, Y. Choi, A. C. Berg and T. L.
Berg. CVPR 2012. |
Presenter: Harsh “For” discussion lead: Yash “Against” discussion lead: Arijit |
04/14 | Human-in-the-loop |
Presenter: Sneha “For” discussion lead: Aishwarya “Against” discussion lead: Swazoo |
04/19 | Crowdsourcing Cost-Effective HITs for Relative Similarity Comparisons. MJ. Wilber, IS. Kwak, SJ. Belongie. AAAI Human Computation and Crowdsourcing 2014 |
Presenter:
Dusold
“For” discussion lead: Latha “Against” discussion lead: Sneha |
04/21 | Applications |
Presenter: Abraham “For” discussion lead: Rich F “Against” discussion lead: Aishwarya |
04/26 | No class because Devi is traveling |
N/A |
04/28 | Human abilities Write review for and discuss: Rapid natural scene categorization in the near absence of attention. L. Fei-Fei, R. VanRullen, C. Koch and P. Perona. PNAS 2002. Seeds / pointers for presenters: What Do We Perceive in a Glance of a Real-World Scene? L. Fei-Fei, A. Iyer, C. Koch and P. Perona. Journal of Vision, 2007. |
Presenter: Aishwarya “For” discussion lead: Dusold “Against” discussion lead: Graham |
05/03 | Big data |
Presenter: Mostafa “For” discussion lead: Sneha “Against” discussion lead: Harsh
|
05/08 | Final project presentations 9:00 am to 1:00 pm in Whittemore 654.
|
|
05/11 | Project video due by 11:59 am (noon) |
Other code and data:
Similar courses:
This course has been inspired by the following two courses: