读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

2023年4月11日下午9:50 • 深度学习

原ppt下载：pan.baidu.com/s/1nv54p9R，密码：3mty

需深入实践并理解的重要概念：

Deep Learning：

1）读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

2）每个neuron的softmax输出：，其中：读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

DNN（Deep Neural Networks）：

- Use to minimum total costs for softmax layer. CE is better.

- MSE minimum：读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

- CE minimum：读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

- batch：样本训练中，将完整数据分为等量的多个batch（批次），每次输入一个batch而不是完整样本进行训练

- epoch：周期被定义为向前和向后传播中所有batch的单次训练迭代

- mini-batch has better performance than original gradient descent

- As an activative function, used when the number of layers is quite large.

- 对于大于0的所有输入来说，它都有一个不变的导数值；常数导数值有助于网络训练进行得更快，常用于多层隐藏层

- Special cases of MaxOut：读《Deep Learning Tutorial》（台湾大学李宏毅深度学习教学ppt）后杂记

Learnable activation function
Adaptive learning rate（学习率：每次迭代中cost function中最小化的量。简单来说，我们下降到cost function的最小值的速率是学习率）

- Use a large rate first, then change to a small one

- Use the optimizer Adam（Advanced Adagrad Momentum）

- Use early stopping

- Will change structure of networks while training. better than MaxOut

CNN（Convolutional Neural Networks）：

1) Patterns are much smaller than the whole image

2) The same patterns appear in different regions

3) Subsampling pixels does not change the object

读《Deep Learning Tutorial》（台湾大学 李宏毅 深度学习教学ppt）后杂记