改进深度神经网络 36

35 TensorFlow 基础与自动微分机制 2026/01/06
34 深度学习框架（Deep Learning Frameworks）的价值与选择标准 2026/01/06
33 训练一个使用了 Softmax 的分类器 2026/01/06
32 Softmax 回归 2026/01/06
31 测试时的 Batch Normalization（Batch Norm at Test Time） 2026/01/06
30 Batch Normalization 为何有效？ 2026/01/06
29 将 Batch Norm 拟合进神经网络（Fitting Batch Norm into a Neural Network） 2026/01/06
28 归一化网络的激活函数（Normalizing activations in a network） 2026/01/06
27 超参数调优的两种策略 —— “熊猫方式” vs. “鱼子酱方式” 2026/01/06
26 为超参数选择合适的采样尺度（Scale） 2026/01/06
25 超参数调试（Hyperparameter Tuning） 2026/01/06
24 深度学习中的优化挑战 —— 局部最优 vs 鞍点 vs 平稳段 2026/01/06
23 学习率衰减（Learning Rate Decay） 2026/01/06
22 Adam 优化算法（Adam Optimization Algorithm） 2026/01/02
21 RMSprop（Root Mean Square Propagation）优化算法 2026/01/02
20 动量梯度下降法（Gradient Descent with Momentum） 2026/01/02
19 指数加权平均中的偏差修正（Bias Correction in Exponentially Weighted Averages） 2026/01/02
18 了解指数加权平均 2026/01/02
17 指数加权平均（Exponentially Weighted Averages） 2026/01/02
16 了解小批量梯度下降法 2026/01/02
15 优化算法 —— Mini-batch Gradient Descent（小批量梯度下降） 2026/01/02
约书亚·本吉奥访谈 2026/01/02
14 梯度检查（Gradient Checking）实现要点 2026/01/02
13 梯度检查（Gradient Checking） 2026/01/02
12 梯度的数值近似（Numerical Approximation of Gradients） 2026/01/02
11 深度神经网络的权重初始化（Weight Initialization for Deep Networks） 2026/01/02
10 梯度消失与梯度爆炸问题（Vanishing ／ Exploding Gradients） 2026/01/02
09 输入归一化（Normalizing Inputs） 2026/01/02
08 神经网络中的其他正则化方法 2026/01/02
07 了解 Drop out 2026/01/02
06 Dropout 正则化 2025/12/27
05 为什么正则化能减少过拟合？ 2025/12/27
04 神经网络中的正则化（Regularization in Neural Networks） 2025/12/27
03 机器学习基本配方（Basic Recipe for Machine Learning） 2025/12/27
02 Bias（偏差）与 Variance（方差） 2025/12/27
01 训练集（Train）、开发集（Dev）和测试集（Test） 2025/12/26