• Tidak ada hasil yang ditemukan

CS4495/6495 Introduction to Computer Vision

N/A
N/A
Protected

Academic year: 2024

Membagikan "CS4495/6495 Introduction to Computer Vision"

Copied!
52
0
0

Teks penuh

(1)

1A-L1 Introduction

CS4495/6495

Introduction to Computer Vision

(2)

Outline

What is computer vision?

State of the art

Why is this hard?

Course overview

Software

(3)

Why study Computer Vision?

Images (and movies) have become ubiquitous in both production and consumption.

Therefore applications to manipulate images (movies) are becoming core.

As are systems that extract information from imagery

Surveillance

Building 3D representations

Motion capture assisted

(4)

Why study Computer Vision?

But most of all…

It is a really deep and cool set of problems!

(5)

Every picture tells a story

Goal of computer vision is to write computer

programs that can interpret images

Steve Seitz

(6)

Making sense of a picture

We want to extract meaning out of an image/sequence of images

This is different from image processing, which is mainly concerned with transforming images

Image processing operations such as blurring, thresholding etc. are often used as part of CV algorithms

(7)

Making sense of a picture

Look at this scene carefully…

(8)
(9)

Making sense of a picture

What items could you identify? How did you recognize them?

What about other objects/spaces/time of day etc.?

(10)

Current state of the art

Can computers match (or beat) human vision?

Yes and no (but mostly no!)

Humans are much better at “hard” things

Computers can be better at “easy” things

Though getting really good at labeling using

machine learning techniques. Only a little on that in this course.

Steve Seitz

(11)

Current state of the art

The next slides show some examples of what current vision systems can do

(12)

Optical character recognition (OCR)

Handwritten Digit recognition

Technology to convert scanned docs to text

If you have a scanner, it probably came with OCR software

License plate readers

http://en.wikipedia.org/wiki/Automatic_number_plate_recognition

Steve Seitz

(13)

Face detection and more…

Most digital cameras can detect faces…

(14)

Face detection and more…

Some can detect blinking or smiling…

SONY

“Smile Shutter”

(15)

Face detection and more…

And some can even recognize you!

(16)

Object recognition (in supermarkets)

Evolution Robotics Retail developed LaneHawk™, a

retail loss-prevention solution that helps turn bottom-of-

basket (BOB) losses and in-cart losses into profits in real time.

The company was acquired by Datalogic 5 years later!

(17)

Object recognition (in mobile devices!)

(18)

The Matrix movies, ESC Entertainment, XYZRGB, NRC

Special effects: shape capture

Steve Seitz

(19)

Pirates of the Caribbean Industrial Light and Magic www.ilm.com

Special effects: motion capture

Steve Seitz

(20)

Earth viewers (3D modeling)

Image from Microsoft’s Virtual Earth

(see also: Google Earth) Steve Seitz

(21)

Smart cars

Mobileye

Slide content courtesy of Amnon Shashua

(22)

Smart cars are here!

(23)

Sports

Sportvision first down line Steve Seitz

(24)

Vision-based interaction (and games)

Nintendo Wii has camera-based IR

tracking built in. Steve Seitz

(25)

But the game changer:

(26)

Security and surveillance

(27)

Medical imaging

Image guided surgery Grimson et al., MIT 3D imaging

MRI, CT

Steve Seitz

(28)

Current state of the art

This is just a taste of the state of the art.

Some of these are less than 5 years old, most less than 10

This is a very active research area, and rapidly changing

Many new apps in the next 5 years

(29)

Why is this hard?

(30)

Simple scene right?

Dark square Light square

Edward Adelson

(31)

Really?

Edward Adelson

(32)

Really!

Edward Adelson

(33)

Vision is NOT Image Processing

In the previous example, the two squares have exactly the same measurement of intensity.

So, seeing is not the same as measuring properties in the image.

Rather, “seeing” is building a percept of what is in the world based upon the measurements

made by an imaging sensor.

(34)

Building models from change (1)

Michael Black

(35)

Building models from change (1)

Left Image Michael Black

(36)

Building models from change (1)

Right Image Michael Black

(37)

Building models from change (2)

Dan Kersten

http://vision.psych.umn.edu/users/kersten/kersten-lab/shadows.html

(38)

Building models from change (3)

Dan Kersten

http://vision.psych.umn.edu/users/kersten/kersten-lab/shadows.html

(39)

Interpreting images

The previous example is one where the human system is again “wrong” – nothing is moving

upwards. But feels like the best interpretation.

Our goal is to develop your understanding of some of what it takes to go from image to

interpretation.

(40)

Course overview

(41)

A little bit of pedagogy…

(42)

A little bit of pedagody…

Computational Models (Math!)

(43)

Computational Models (Math!)

A little bit of pedagody…

Algorithm

(44)

Computational Models (Math!)

A little bit of pedagogy…

Real Images Scene Ground truth

Algorithm

(45)

Computational Models (Math!)

Real Images Algorithm

A little bit of pedagogy…

Introduction to Computer

Vision

(46)

Topic outline

1. INTRODUCTION

2. IMAGE PROCESSING FOR COMPUTER VISION 3. CAMERA MODELS AND VIEWS

4. FEATURES AND MATCHING 5. LIGHTNESS AND BRIGHTNESS 6. IMAGE MOTION

7. MOTION AND TRACKING

8. CLASSIFICATION AND RECOGNITION 9. MISCELLANEOUS OPERATIONS

10. HUMAN VISION

(47)

Problem sets

8 problem sets (PS0 to PS7)

(48)

Policies

Blackboard-level conversations OK, esp. on forums

Write your own code

Ask questions on forum first, then contact TA/instructor

(49)

Exam

There will be a final exam.

It’s not hard – it simply designed to require folks to go back over the slides (and text) and remember what we’ve learned.

(50)

Grading

The general rubric is 85% of the final grade is based upon the problem sets.

15% is the final.

(51)

Software

Embedded programming exercises (in Octave)

Matlab/Octave: Primary platform for exercises, problem sets

Python + NumPy + OpenCV: You can submit

your problem set solutions in Python, but there will be very limited support

(52)

Learning goals

What do you expect to learn from this course?

Note down somewhere and track your progress.

In the end, you may not have learnt everything you expected.

At the same time, you may have learnt some things you did not know about at all

Referensi

Dokumen terkait

vertical stripes, blue, green and black from left to right. In the

To see how this is a problem, first consider using the previous code on this picture: from SimpleCV import Color, Image. img

Original image Binarized images with different threshold levels 176 Otsu 183 70% 191 80% Number of pixels Pixel intensity 0:black, 255:white Cropped image Binarized images with

To allow fair comparison, we make other methods work on the same set of orig- inal images, in which six image streams are shown from the left to the right: a is the original video

Then the left matrix is called the matrix of cofactors from A, and the matrix on the right that is the transpose of the left is called the adjointof A and denoted by adjA.. Then the

Different kinds of noise • We said that Gaussian averaging was a reasonable thing to do if the noise was independent at each pixel and centered about zero such as if created by a

Screen Viewer right eye S left eye To synthesize the left-view and the right-view images using the estimated depth map, we compute the parallax value for each pixel in an image from

The visible point is in shadow because the black triangle occludes the light source.. More complex lighting interactions can then be