Neural Networks from Scratch - Part 5: Hidden Layer Activation Functions

Available In Following Subtitles

English

Variant 1

Chinese

Variant 1 Variant 2

Posted on: Aug 6, 2024

Video by: sentdex

Learn about hidden layer activation functions like rectified linear, step, and sigmoid functions in this video on neural networks. Explore the purpose and usage of these activation functions to deepen your understanding of neural networks. Dive into the topic with nnfs.io resources and enhance your knowledge.

Instantly generate YouTube summary, transcript and subtitles!

Install Tubelator On Chrome

Video Summary & Chapters

No chapters for this video generated yet.

Video Transcript

0:05

What's going on everybody welcome to part 5 of the neural networks from scratch series.

0:10

In this video we're going to be getting into our activation functions.

0:13

Specifically in this video we're going to talk about the rectified linear as well as the step function and sigmoid functions.

0:22

And then we'll save the softmax probably for the next video.

0:25

As always this video is based on the book which you can get at nnfs.io

0:31

Oh, and with that, let's jump into it.

0:34

And first, let's just cover a overview

0:36

of what are these activation functions?

0:39

What are they doing?

0:40

Why are we using them?

0:41

And so on.

0:42

And first, we're gonna start with the step function,

0:45

otherwise known as the unit step function,

0:47

or possibly the Hevaside after Oliver Hevaside.

0:50

So let's get into that.

0:52

So for an activation function,

0:54

you can use a variety of different functions.

0:56

In this case, we're just gonna look

0:57

at a step function to start.

0:58

so a step function is very...

1:00

simple. The idea is if your input to this function is greater than zero, then the output

1:05

will be a one. Otherwise, the output is going to be a zero. So that's all there is to this

1:12

step function. And now let's consider using this step function as an activation function.

1:17

So now taking the step function and using it as an activation function, the way that

1:22

this works in your entire neural network is each neuron in the hidden layers and the output

1:27

layers will have this activation function and this comes into play after

1:32

you do the inputs times the weights plus the bias. So that value of inputs times

1:37

the weights plus the bias is what is being fed through your activation

1:41

function. So as you can see here as we tweak the weights we can see how that

1:46

impacts that output and the same thing is true with the bias as we tweak a

1:50

bias we can see what kind of changes might be made. The big takeaway here is

1:55

the output is quite literally a 0 or a 1, so given some pretty complex inputs and...

2:00

tweaking of weights and biases we can see that for sure the output of this

2:05

activation function is and will always be either a 0 or a 1, nothing more than

2:10

that. And this 1 here or a 0 or 1, the output from this activation function

2:16

that becomes the input to the next neurons. So on the macro level again

2:20

every neuron in your hidden layers as well as your output layer is going to

2:24

have an activation function associated. Now what's most common and what you'll

2:29

see and soon learn why. Generally the output layer is going to have a different activation

2:34

function than what you're using in your hidden layers, but for now we're just kind of showing

2:37

an example of a variety of step functions being used as activation functions in this

2:42

network. So with that in mind and in mind of the zero and one only outputs, that brings

2:48

us to the next activation function which is the sigmoid activation function. Now while

2:55

you can have a neural network with a activation function that's a step function.

2:58

It quickly became obvious.

3:00

that using something like a sigmoid activation function was a little more easy to train a

3:05

neural network or a little more reliable for training a neural network due to the granularity

3:10

of the output.

3:11

So again the sigmoid activation function comes in after you do the inputs times the weights

3:16

plus the bias.

Video Summary & Chapters

0:05

1. 神经网络激活功能概述

介绍神经网络激活功能及其作用

0:42

2. 步骤功能激活

讨论步骤功能作为激活功能的简单性和输出特性

2:09

3. 步骤功能输出特性

讨论步骤功能输出为0或1，作为下一神经元输入

2:51

4. Sigmoid激活功能介绍

探讨Sigmoid激活功能的优势和细微性

3:35

5. 神经网络损失计算

下一步是计算神经网络的错误和损失

4:08

6. 输出优化器问题

描述输出1不被优化器了解的问题

4:24

7. 修正线性单元激活函数

介绍修正线性单元激活函数的作用

5:15

8. Sigmoid激活函数问题

讨论Sigmoid激活函数的消失梯度问题

6:03

9. 激活函数用途

探讨为何在神经网络中使用激活函数的目的

7:09

10. 线性激活函数限制

讨论线性激活函数对非线性数据拟合的限制

8:58

11. 修正线性激活功能优势

解释修正线性激活函数的快速和强大之处

9:04

12. 工作原理

探讨修正线性激活函数的工作原理

9:11

13. 加强输入

了解如何增加体重来修正输入

9:37

14. 神经元激活

探讨神经元激活的影响

11:24

15. 隐藏层模型

分析具有两个隐藏层的神经元模型

13:16

16. 重量影响输出

探讨重量如何影响输出曲线

13:45

17. 改变激活点

调整神经元的激活点来改变输出轨道

14:35

18. 神经元激活与禁用

调整神经元激活和禁用区域

15:12

19. 神经元设置与曲线调整

使用神经元对应影像形状进行设置和调整

15:40

20. 神经元权重调整

补偿和正确排列神经元权重

16:35

21. 神经元连接与重量调整

调整连接和神经元重量

17:03

22. 神经元激活顶点设置

设置神经元激活顶点位置

17:48

23. 神经元激活功能适应

线性与非线性激活功能适应

18:15

24. 模拟输入范围

模拟输入范围激活发生地点

18:23

25. 神经元的功能和输出

整体神经网络功能和输出的探讨

18:37

26. 神经元的激活作用

探讨神经元激活对整体神经网络功能的影响

19:13

27. 深度学习挑战

解释神经网络行为和挑战，以及对人工智能决策的引导

19:55

28. 神经网络的构建

探讨从底层构建神经网络的重要性和目的

20:37

29. 隐藏层神经元调整

讨论隐藏层神经元之间的重量调整和优化

21:01

30. 神经元数量与改进

神经元数量与性能改进的关系探讨

21:28

31. 神经元的协作

神经元之间的协作和影响范围的讨论

21:58

32. 非线性隐藏层

探讨为解决非线性问题而使用多个隐藏层的必要性

22:09

33. 激活功能

学习修正线性激活函数的实现

23:59

34. 修正线性对象

编写修正线性单元对象的前向方法

25:06

35. 活性激活功能

实现修正线性单元激活函数

25:56

36. 层组织

讨论如何组织不同类型的层

26:14

37. NNFS包装

介绍使用NNFS包装的方法

26:44

38. 创建代码文件

使用NNFS工具创建代码文件

26:57

39. 获取样本代码

介绍如何获取样本代码和使用GitHub页面

27:14

40. NNFS包的用途

探讨使用NNFS包的主要原因和目标

27:41

41. 设置随机种子

讨论设置随机种子和默认数据类型

28:18

42. 数据类型设置

解释设置默认数据类型为NumPy的重要性

28:49

43. 重新设置数据

说明使用NNFS包重新设置数据集的过程

29:05

44. 生成数据集

展示如何使用功能生成数据集

29:49

45. 课程回顾

回顾课程内容和学习神经网络的重要性

31:06

46. 挑战性数据集的染色

数据集挑战，染色使其更加明显

33:01

47. 导入和创建数据集

导入并创建螺旋数据集的功能和标签

33:51

48. 删除神经网络层

删除一层神经网络层的审查和提醒

34:33

49. 独特功能描述

数据集中两个独特功能的描述

34:42

50. 层面一个

定义神经网络中的第一个隐藏层

34:55

51. 激活功能定义

介绍神经网络中的激活功能

35:42

52. 层1前进

将数据传递到第一个隐藏层

37:25

53. 消极值处理

处理神经网络中的消极值问题

37:47

54. 解决网络死亡

解决神经网络中所有值为零的问题

38:21

55. 激活功能重要性

探讨激活功能的作用及重要性

39:11

56. 培训全路

总结神经网络培训的过程

Video Transcript

0:05

正在发生什么,欢迎大家进入第五部分神经网络从切断系列。

0:10

在这个视频中,我们将进入我们的激活功能。

0:13

具体而言,在本视频中,我们将谈论修正的线性以及步骤功能和 Sigmoid功能。

0:22

然后我们将保存 softmax 可能为下一个视频。

0:25

像往常一样,这个视频是基于你可以在 nnfs.io 获得的书籍。

0:31

哦,与此同时,让我们跳进它。

0:34

首先,让我们来讲一个概述。

0:36

这些激活功能是什么?

0:39

他们正在做什么?

0:40

我们为什么使用它们?

0:41

那么就这样。

0:42

首先,我们将从步骤功能开始,

0:45

另一方面,被称为单元步骤功能,

0:47

也许是奥利弗·赫瓦西德之后的哈瓦西德。

0:50

所以,让我们进入这一点。

0:52

因此,对于一个激活功能,

0:53

你可以使用各种不同的功能。

0:56

在这种情况下,我们只会看

0:57

在一个步骤的功能开始。

0:58

因此,步骤功能非常简单。

1:00

想法是,如果您的输入到这个函数大于零,那么输出将是一个

1:06

否则,输出将是零。

1:10

所以这就是这个步骤功能,现在让我们考虑使用这个步骤功能作为激活功能。

1:17

所以现在采取步骤功能并使用它作为一个

1:20

激活功能,它在整个神经网络中运作的方式是

1:24

每一个神经元在隐藏层和输出层

1:28

将有这个激活功能,

1:30

然后,你在输入后进入游戏。

1:33

有时,重量加上偏见。

1:36

因此,输入的值加倍重量加上偏见

1:38

它是通过你的激活功能被喂养的。

1:42

正如你所看到的那样,当我们推翻重量时,

1:45

我们可以看到它如何影响这个产量,

1:48

对偏见也是如此。

1:49

当我们推翻偏见时,我们可以看到什么样的变化。

1:52

可以做。这里的主要 takeaway是输出相当字面为0或1。

1:58

一些相当复杂的输入和重量和偏见,我们可以肯定看到

2:03

这个激活功能的输出是并将永远是0或1。

2:09

和这个 1 这里,或 0 或 1 这个激活功能的输出,

2:16

这将成为下一个神经元的输入。因此,在宏观层面,再次,每一个神经元

2:21

在您的隐藏层以及您的输出层将有启动功能

2:26

现在,什么是最常见的,你会看到什么,很快就了解为什么,一般来说,

2:32

输出层将具有与您正在使用的活动不同的激活功能

2:35

在你隐藏的层面上,但现在我们只是展示各种例子。

2:39

在这个网络中使用步骤功能作为激活功能。

2:45

而且,考虑到只有0和1的输出,这将使我们进入下一个激活功能。

2:51

这是 Sigmoid 激活功能。

2:54

现在,虽然您可以有一个神经网络,具有激活功能,这是一个步骤功能,

2:59

很快就显然,使用像 Sigmoid 激活功能的一些东西是很少的。

3:04

更容易训练神经网络或更可靠的训练神经网络

Video Summary & Chapters

No chapters for this video generated yet.

Video Transcript

0:05

正在发生什么,欢迎大家进入第五部分神经网络从切断系列。

0:10

在这个视频中,我们将进入我们的激活功能。

0:13

具体而言,在本视频中,我们将谈论修正的线性以及步骤功能和 Sigmoid功能。

0:22

然后我们将保存 softmax 可能为下一个视频。

0:25

像往常一样,这个视频是基于你可以在 nnfs.io 获得的书籍。

0:31

哦,与此同时,让我们跳进它。

0:34

首先,让我们来讲一个概述。

0:36

这些激活功能是什么?

0:39

他们正在做什么?

0:40

我们为什么使用它们?

0:41

那么就这样。

0:42

首先,我们将从步骤功能开始,

0:45

另一方面,被称为单元步骤功能,

0:47

也许是奥利弗·赫瓦西德之后的哈瓦西德。

0:50

所以,让我们进入这一点。

0:52

因此,对于一个激活功能,

0:53

你可以使用各种不同的功能。

0:56

在这种情况下,我们只会看

0:57

在一个步骤的功能开始。

0:58

因此,步骤功能非常简单。

1:00

想法是,如果您的输入到这个函数大于零,那么输出将是 1

1:06

否则,输出将是0,所以只有这个步骤功能。

1:13

现在,让我们考虑使用这个步骤功能作为激活功能。

1:17

所以现在采取步骤功能并使用它作为激活功能。

1:21

它在你的整个神经网络中运作的方式是

1:24

每一个神经元在隐藏层和输出层

1:28

将有这个激活功能,

1:30

然后,你在输入后进入游戏。

1:33

有时,重量加上偏见。

1:36

因此,输入的值加倍重量加上偏见

1:38

它是通过你的激活功能被喂养的。

1:42

正如你所看到的那样,当我们推翻重量时,

1:45

我们可以看到它如何影响这个产量。

1:48

对偏见也是如此。

1:49

当我们推翻偏见时,我们可以看到什么样的变化。

1:52

可以做。这里的主要 takeaway是输出相当字面为0或1。

1:58

一些相当复杂的输入和重量和偏见,我们可以肯定看到

2:04

这个激活功能的输出是并将永远是0或1。

2:09

和这个 1 这里,或 0 或 1 这个激活功能的输出,

2:16

这将成为下一个神经元的输入。因此,在宏观层面,再次,每一个神经元

2:21

在您的隐藏层以及您的输出层将有启动功能

2:26

现在,什么是最常见的,你会看到什么,很快就了解为什么,一般来说,

2:32

输出层将具有与您正在使用的活动不同的激活功能

2:35

在你隐藏的层面上,但现在我们只是展示各种例子。

2:39

在这个网络中使用步骤功能作为激活功能。

2:45

而且,考虑到只有0和1的输出,这将使我们进入下一个激活功能。

2:51

这是 Sigmoid 激活功能。

2:54

现在,虽然您可以有一个神经网络,具有激活功能,这是一个步骤功能,

2:59

很快就显然,使用像 Sigmoid 激活功能的一些东西是很少的。

3:04

更容易训练神经网络或更可靠的训练神经网络

Download extension to view full transcript.

Install Tubelator On Chrome

YouTube First AI Assistant

Install On Chrome

AI Art For This Video No image generated for this video yet but here is the example.

ai art

0:09

Prompt

spider man in aladdin style, bright colors, hyper quality, high detail, high resolution, --video --s 750 --v 6. 0 --ar 1:2

ai images

Explore more in Education

Repeat-After-Me Story + SHADOWING English Speaking Practice

by English Coach Chad

تحرير النفس من الكسل 2 - د. محمد خير الشعال

by د.محمد خير الشعال

Dự đoán giá Bitcoin trong năm 2025 - Cơ hội x mấy đây?

Hướng dẫn 2 Airdrop làm là có thưởng (miễn phí)

Jennifer Senior: For parents, happiness is a very high bar