How Do Humans Do It? | Introduction

Available In Following Subtitles

English

Variant 1

Posted on: May 15, 2025

Video by: First Principles of Computer Vision

First Principles of Computer Vision is a lecture series presented by Shree Nayar who is faculty in the Computer Science Department, School of Engineering and Applied Sciences, Columbia University. Computer Vision is the enterprise of building machines that “see.” This series focuses on the physical and mathematical underpinnings of vision and has been designed for students, practitioners, and enthusiasts who have no prior knowledge of computer vision.

Instantly generate YouTube summary, transcript and subtitles!

Install Tubelator On Chrome

Video Summary & Chapters

No chapters for this video generated yet.

Video Transcript

0:04

Before we begin to develop tools to help us solve vision problems,

0:08

it's worth taking a look at how our human visual system works.

0:13

So here you see the human eye and the visual cortex.

0:17

Here you see the eye in the front.

0:20

The eye has a lens which projects the three-dimensional world

0:24

onto a two-dimensional image.

0:27

This two-dimensional image is being formed on the retina,

0:30

which is in the back here.

0:32

The retina, by the way, has some cells within it that do some early visual processing.

0:38

So there's a little bit of information reduction.

0:40

that's happening on the retina itself.

0:43

And then the reduced image, so to speak, travels through the optic nerve right here and goes

0:50

to the lateral geniculate nucleus, which acts like a relay.

0:54

It's able to figure out what information needs to go to which part of the brain.

0:58

So it sends that information then back to the visual cortex right here.

1:03

And you can see that different parts of the visual cortex have been given different colors.

1:08

And that's to show you the parts that are responsible for analysis of shape, of color,

1:15

of motion, of texture, and so on and so forth.

1:19

So there's a lot we know about the human visual system, and yet it's amazing to me how little

1:25

we know.

1:26

We know, for instance, roughly where motion analysis takes place, but we have no idea

1:33

exactly what the circuit diagram, if you will,

1:37

is of that particular part of the brain.

1:40

We don't know how the neurons are connected to each other

1:43

and what their weights are.

1:45

So we don't have a detailed architecture

1:48

or a circuit, if you will, that can be mapped to silicon

1:51

so we can emulate the human visual system.

1:54

So in short, vision is easy for us,

1:58

but we're very far from understanding

2:01

how we actually do it.

2:03

So what do we do?

2:05

Well, we reinvent.

2:08

This might sound unfortunate to you, but not quite. As you can imagine, there are many applications of vision that require

2:16

functionality and precision that go well beyond what the human visual system is capable of.

2:23

While human vision is remarkable in its versatility and is able to cope with many complex real-world situations,

2:30

it is more of a qualitative system than a quantitative one.

2:35

For instance, if you want to know how many millimeters this pencil is,

2:41

in terms of its length, the human visual system can only give you very rough estimates.

2:47

Such estimates are not useful in many domains, such as factory automation or medical imaging.

2:54

While no computer vision system has yet been developed that is as versatile as the human one,

2:59

there are many computer vision systems in use today

3:03

that demonstrate much higher precision and reliability than ours.

3:09

In short, for many tasks that require vision,

3:12

the human visual system may indeed be the wrong system to emulate.

3:17

Furthermore, human vision is more fallible than we like to believe.

3:22

You see, when you and I perceive something incorrectly,

3:25

we do not have a voice in our head telling us we are wrong.

3:28

We see what we see and we believe it to be accurate.

3:33

To demonstrate this, let's take a look at some well-known optical illusions.

Download extension to view full transcript.

Install Tubelator On Chrome

YouTube First AI Assistant

Install On Chrome

AI Art For This Video No image generated for this video yet but here is the example.

ai art

0:09

Prompt

spider man in aladdin style, bright colors, hyper quality, high detail, high resolution, --video --s 750 --v 6. 0 --ar 1:2

ai images

Explore more in Education

Repeat-After-Me Story + SHADOWING English Speaking Practice

by English Coach Chad

تحرير النفس من الكسل 2 - د. محمد خير الشعال

by د.محمد خير الشعال

Dự đoán giá Bitcoin trong năm 2025 - Cơ hội x mấy đây?

Hướng dẫn 2 Airdrop làm là có thưởng (miễn phí)

Jennifer Senior: For parents, happiness is a very high bar