ECE 5554 & ECE 4984 Fall 2015

ECE 5554: Computer Vision, Fall 2015

ECE 4984: Introduction to Computer Vision, Fall 2015

Electrical and Computer Engineering Department
Virginia Tech

Meets:
Tuesdays and Thursdays (TR)
From 3:30 PM to 4:45 PM
In Goodwin Hall (GOODW) 135

Instructor:
Devi Parikh
Email: parikh@vt.edu
Office: 440 Whittemore Hall
Office hours: Fridays 3:00 pm to 4:00 pm (starting 09/11/2015)

TA:
Jiasen Lu
Email: jiasenlu@vt.edu
Office: 264 Whittemore Hall
Office Hours: Mondays 10:00 am to 11:30 am
Wednesdays 1:30 pm to 3:00 pm

Announcements Course overview Requirements Schedule Class Projects References

Announcements:

Welcome to ECE 5554!

Course overview:

"To see is to know what is where by looking"- David Marr

Over the past few decades, machines have come a long way in their ability to "see". Some examples are autonomous navigators like Mars Rover and Google car, medical imaging technologies, pedestrian detectors in vehicles, image search engines, face detectors in cameras and Facebook, aids for the visually impaired, control-free video games, and industrial automation systems.

In this introductory Computer Vision course, we will learn how to "teach machines to see". We will explore several fundamental concepts including image formation, feature detection, segmentation, multiple view geometry, recognition, and video processing. We will use these concepts to build applications that aid machines to see the world around them.

Requirements:

Problem Sets (55% of final grade): You will be given 6 problem sets, one approximately every two weeks. These will involve a combination of conceptual questions and programming problems. The programming problems will provide hands-on experience working with techniques covered in or related to the lectures. All code and written responses must be completed individually and submitted to Scholar. Most problem sets will take significant time to complete. Please start early. Problem Set 0 (PS0) will be worth 5% the final grade, and the remaining 5 problem sets will be worth 10% each.

Project (25% of final grade): Your project can be about applying any of the techniques we studied in class to real world problems. You can also extend a technique, or empirically analyze it. Comparisons between two approaches are also welcome. It is wonderful if you design and evaluate a novel approach to an important existing or new vision problem. Be creative! You must work in teams of two or more. The instructor may make exceptions. If you want to work on a project alone, come talk to the instructor first. The following are deliverables for your project. Only one team-member should submit each deliverable. That is, only one submission (for each deliverable) per team please.

Proposal (10% of project grade): A 1-page description that describes the following:

Problem statement: Clearly state the goal of your project. When someone uses your system, what is the expected input to the system, and what is the desired output?
Approach: Describe the technical approach you plan to employ.
Experiments and results: Describe the experimental setup you will follow, which datasets you will use, which existing code you will exploit, what you will implement yourself, and what you would define as a success for the project. If you plan on collecting your own data, describe what data collection protocol you will follow. Provide a list of experiments you will perform. Describe what you expect the experiments to reveal, or what is uncertain about the potential outcomes.

Project Report Webpage (50% of project grade): We expect you to have completed your project by this time. The only thing we expect between this and the final poster presentation is incorporating feedback from the instructor and/or TA. The report should be in the form of a webpage. It should be a nice, professional looking, visual, self-contained webpage describing your project. We will link to all project pages from the class webpage. Any student in the class should be able to clearly understand your report such that he/she can implement it.Below is an outline of what we expect the webpage to cover:

Abstract: One or two sentences on the motivation behind the problem you are solving. One or two sentences describing the approach you took. One or two sentences on the main result you obtained.
Teaser figure: A figure that conveys the main idea behind the project or the main application being addressed.
Introduction: Motivation behind the problem you are solving, what applications it has, any brief background on the particular domain you are working in (if not regular RBG photographs), etc. If you are using a new way to solve an existing problem, briefly mention and describe the existing approaches and tell us how your approach is new.
Approach: Describe very clearly and systematically your approach to solve the problem. Tell us exactly what existing implementations you used to build your system. Tell us what obstacles you faced and how you addressed them. Justify any design choices or judgment calls you made in your approach.
Experiments and results: Provide details about the experimental set up (number of images/videos, number of datasets you experimented with, train/test split if you used machine learning algorithms, etc.). Describe the evaluation metrics you used to evaluate how well your approach is working. Include clear figures and tables, as well as illustrative qualitative examples if appropriate. Be sure to include obvious baselines to see if your approach is doing better than a naive approach (e.g. for classification accuracy, how well would a classifier do that made random decisions?). Also discuss any parameters of your algorithms, and tell us how you set the values of those parameters. You can also show us how the performance varies as you change those parameter values. Be sure to discuss any trends you see in your results, and explain why these trends make sense. Are the results as expected? Why?
Qualitative results: Show several visual examples of inputs/outputs of your system (success cases and failures) that help us better understand your approach.
Conclusion and future work: Conclusion would likely make the same points as the abstract. Discuss any future ideas you have to make your approach better.
References: List out all the references you have used for your work.
Examples:

See this for a webpage template.
See this for an example of a nice, professional looking page.
See this for an example of how to lay out the various details of your project. You may need to provide more details than this, beause you will not be submitting an associated paper to accompany the webpage. So the page should be self-contained.

Poster Presentation (40% of project grade): A poster presentation that describes the same points as the proposal and the project report webpage. A sample poster template can be downloaded from here. Describe the results you have obtained. This may involve some additional experiments, etc. based on the instructor's and/or TA's feedback on your report webpage. Live demos when applicable are highly encouraged. Prepare a 1 minute schpeel that presents the main summary of your work. The instructor and TA will ask further questions as necessary.

Due Dates: All problem sets/reports are to be submitted by the due date noted on the assignment. Deadlines are firm. Anything from 1 minute to 24 hours is one day late.

Late Day policy: Throughout the term you have an allowance of four free late days for your submissions, meaning you can accrue up to four days in late submissions with no penalty. For example, you could turn in one assignment four days late, or two assignments/project proposal/project report webpage each one day late. Once you have used all your free late days, a late submission will not be accepted. Please plan ahead so you can spend your late days wisely. In particular, note that we expect you will find the earlier assignments easier than those later in the course. We will count a full additional day as having passed for submissions 1 minute to 24 hours late. Submissions that are one day late after all the late days are used up will receive only up to 50% credit. Submissions that are two days late after all the late days are used up will receive only up to 25% credit. After that, 0 credit.

Exams (15% of final grade): There will be an in-class final exam. The exam is closed everything, but one 8.5 x 11 cheat sheet is allowed (two sides).

Participation and attendance (5% of final grade): Participation in class and regular attendance is expected. If for whatever reason you are absent, it is your responsibility to find out what you missed that day. Note that attendance and participation do factor into the final grade.

Textbook: Computer Vision: Algorithms and Applications, by Rick Szeliski. An electronic copy is available free online here. Some background reading on object recognition is from Kristen Grauman and Bastian Leibe's short book on Visual Object Recognition.

Scholar: The scholar webpage for this course is here (https://scholar.vt.edu/portal/site/vision_fall2015).

All problem sets, proposals and links to project report pages must be submitted in the Assignments section. No hard copies are accepted.

Any questions relating to the course must be first raised in the Forums section, before reaching out to the Instructor/TA.

Lectures and problem sets will be available on this webpage (not on scholar).

Schedule (tentative):

Day/Date	Topic	Readings and links	Lectures	Assignments, Exams, Deadlines
Tue 8/25/2015	Course Intro	Sec 1.1-1.3	Intro [ppt]	PS0 out

Thu 8/27/2015	Features and filters	Sec 3.1.1-2, 3.2	Linear filters [ppt]
Mon 08/31/2015				PS0 due
Tue 9/1/2015		Sec 3.2.3, 4.2 Seam carving paper Seam carving video	Gradients [ppt]
Thu 9/3/2015		Sec 3.3.2-4	Edges and binary image analysis [ppt]	PS1 out
Tue 9/8/2015		Sec 10.5 Texture Synthesis	Texture [ppt]
Thu 9/10/2015		Sec 2.3.2 Foundations of Color, B. Wandell Lotto Lab illusions	Color [ppt]

Tue 9/15/2015	Grouping and fitting	Sec 5.2-5.4 k-means demo	Segmentation and clustering [ppt]
Wed 9/16/2015				PS1 due
Thu 9/17/2015		Sec 4.3.2 Hough Transform demo Excerpt from Ballard & Brown	Hough transform [ppt]
Tue 9/22/2015		Sec 5.1.1	Deformable contours [ppt]	PS2 out
Thu 9/24/2015		Sec 2.1.1, 2.1.2, 6.1.1	Alignment and 2D image transformations [ppt]

Tue 9/29/2015	Multiple views and motion	Sec 3.6.1, 6.1.4	Homography and image warping [ppt] Notes on homography matrix	Project proposals due
Thu 10/1/2015		Sec 4.1	Local invariant features 1 [ppt]
Mon 10/5/2015				PS2 due
Tue 10/6/2015		Sec 4.1	Local invariant features 2 [ppt]	PS3 out
Thu 10/8/2015		Sec 11.1.1, 11.2-11.5	Image formation [ppt]
Tue 10/13/2015		Sec 11.1.1, 11.2-11.5 Epipolar geometry demo Audio camera, O'Donovan et al. Virtual viewpoint video, Zitnick et al.	Epipolar geometry and stereo [ppt]
Thu 10/15/2015		Tomasi Kanade 1992 Affine structure fom motion Epipolar geometry and projective structure from motion	Structure from motion [ppt]

Mon 10/19/2015				PS3 due
Tue 10/20/2015	Recognition	Grauman & Leibe Ch 1-4 (3 is review) Grauman & Leibe Ch 5,6 Szeliski 14.3 Video Google demo by Sivic et al., paper	Indexing local features and instance recognition [ppt]
Thu 10/22/2015		Grauman & Leibe Ch 7,8.1,9.1,11.1 Szeliski 14.1	Intro to category recognition [ppt]
Tue 10/27/2015		Grauman & Leibe Ch 7,8.1,9.1,11.1 Szeliski 14.1 Viola-Jones face detection paper (for additional reference)	Face detection [ppt]
Thu 10/29/2015		Grauman & Leibe Ch 11.3 11.4 Szeliski 14.4	Discriminative classifiers for image recognition [ppt]
Tue 11/3/2015		Grauman & Leibe Ch 11.3 11.4 Szeliski 14.4	Parts-based models [ppt]	PS4 out
Thu 11/5/2015		Eigenfaces Fisherfaces Face recognition: A Literature Survey (pp 1-26)	Face recognition [ppt]

Tue 11/10/2015	Video processing	8.4,12.6.4	Motion and optical flow [ppt]
Thu 11/12/2015			No class
Mon 11/16/2015				PS4 due
Tue 11/17/2015		8.4,12.6.4 Davis & Bobick paper: The Representation and Recognition of Action Using Temporal Templates Stauffer & Grimson paper: Adaptive Background Mixture Models for Real-Time Tracking.	Background subtraction, action recognition [ppt]
Thu 11/19/2015		5.1.2, 4.1.4	Tracking [ppt]	PS5 out
Sat 11/21/2015			Project report due	Project report due Template
Tue 11/24/2015			No class (Thanksgiving)
Thu 11/26/2015			No class (Thanksgiving)

Tue 12/1/2015			Guest lectures: Convolutional Neural Networks (CNNs) [ppt]: Michael Cogswell Learning and Using Common Sense [ppt]: Xiao Lin
Wed 12/2/2015				PS5 due
Thu 12/3/2015			Poster Presentations 2:30 pm - 5:30 pm GoodWin Hall Atrium	List of class projects
Tue 12/8/2015			Visual Question Answering (VQA) + SPOT Evaluations
Thu 12/10/2015			Reading day (no class)
Tue 12/15/2015			Final Exam 10:05AM to 12:05PM	Final Exam