Professional Documents
Culture Documents
ECE/CSE 576
Linda Shapiro
Professor of Computer Science & Engineering
Professor of Electrical & Computer Engineering
Course Information
• Time:
– Mondays and Wednesdays, 1:30 -2:50pm
• Location:
– More 234
• Contact:
– shapiro@cs.uw.edu , CSE 634
• TAs:
– Bindita Chaudhuri
– bindita@cs.washington.edu
– Deepali Aneja
– deepalia@cs.washington.edu
• Website:
– https://courses.cs.washington.edu/courses/cse576/19sp/
2
3
4
The car is in front of the pole
Sky
Person
White
Horse
Car
Road
Shadow
1m
Wheel
5
Computer Vision
Brightness
9
Measurement
Brightness
http://www.newworldencyclopedia.org/entry/Same_color_illusion 10
Slide Credit: Alyosha Efros
Measurement
Length
Müller-Lyer Illusion
11
http://www.michaelbach.de/ot/sze_muelue/index.html Slide Credit: Alyosha Efros
Image Enhancement
12
Image Enhancement
13
Image Enhancement
14
Seam Carving
less
important
18
The Digital Michelangelo Project, Levoy et al.
19
20
21
22
23
24
Google’s 3D Maps
Structure estimation from tourist photos
25
Apple’s 3D maps
https://www.youtube.com/watch?v=InIVv-LsgZE
26
Computer Vision
28
Source: S. Seitz
Vision-based interaction: Xbox Kinect
29
How hard is computer vision?
30
“In 1966, Minsky hired a first-year
undergraduate student and assigned him
a problem to solve over the summer:
connect a television camera to a
computer and get the machine to
describe what it sees.”
Crevier 1993, pg. 88
34
Why is vision so hard?
• Ill-posed problem
35
Challenges 1: view point variation
36
Michelangelo 1475-1564 slide by Fei Fei, Fergus & Torralba
Challenges 2: illumination
37
slide credit: S. Ullman
Challenges 3:
occlusion
38
Magritte, 1957 slide by Fei Fei, Fergus & Torralba
Challenges 4: scale
39
slide by Fei Fei, Fergus & Torralba
Challenges 5: deformation
40
slide by Fei Fei, Fergus & Torralba Xu, Beihong 1943
Challenges 6: background clutter
41
Klimt, 1913 slide by Fei Fei, Fergus & Torralba
Challenges 7: object intra-class variation
42
slide by Fei-Fei, Fergus & Torralba
Challenges 8: local ambiguity
43
slide by Fei-Fei, Fergus & Torralba
Challenges 9: the world behind the image
44
Slide Credit: Alyosha Efros
What Works Today?
45
• Reading license plates, zip codes, checks
46
Svetlana Lazebnik
Biometrics
47
Source: S. Seitz
Mobile visual search: Google Goggles
48
Face detection
49
Source: S. Seitz
Smile detection
51
Object recognition (in supermarkets)
LaneHawk by EvolutionRobotics
“A smart camera is flush-mounted in the checkout lane, continuously watching
for items. When an item is detected and recognized, the cashier verifies the
quantity of items that were found under the basket, and continues to close the
transaction. The item can remain under the basket, and with LaneHawk,you are
assured to get paid for it… “
52
Safety
53
Security
54
Automotive safety
Oct 9, 2010. "Google Cars Drive Themselves, in Traffic". The New York Times. John Markoff
June 24, 2011. "Nevada state law paves the way for driverless cars". Financial Post.
Christine Dobby
Aug 9, 2011, "Human error blamed after Google's driverless car sparks five-vehicle crash"
. The Star (Toronto)
56
Vision-based interaction: Xbox Kinect
57
Augmented reality, consumer products
58
http://nconnex.com/wp/
Special effects: shape and motion capture
59
Source: S. Seitz
Vision for robotics, space exploration
NASA'S Mars Exploration Rover Spirit captured this westward view from atop
a low plateau where Spirit spent the closing months of 2007.
61
Classification of 22q11.2DS
62
Computer vision in other scientific fields
63
Computer vision research in biology
http://www.vision.caltech.edu/visipedia/
http://leafsnap.com/ 64
Computer vision in cosmology
http://astrometry.net/
65
Computer vision research in healthcare
66
Computer vision in the real-world
• Most examples are less than 5 years old
• Very active research area. Many new
applications to come.
• A website of computer vision industries
maintained by Prof. David Lowe (UBC):
http://www.cs.ubc.ca/~lowe/vision.html
67
Topics
68
Grading (tentative)
• Three regular assignments (50%)
– Using Qt (cross platform UI in c++) qt.nokia.com
– Use of interactive UIs for exploring and gaining
intuition
1. Filters, edge detection, segmentation
2. Creating panoramas
3. Content-based image retrieval
• One Course Project (30%)
• One Exam (20%)
69
Project 1: Image Filtering
70
Project 2: Panorama Stitching
71
Project 3: Content-Based Image Retrieval
72
Final Course Project: Machine Learning for
Some Kind of Recognition
• b
• d
• a • c 73
Books