是什么，它在AI算法中的作用是什么

2024年1月2日上午4:19 • 人工智能 • 阅读 57

问题：在AI算法中，Batch Normalization（批归一化）是什么？它的作用是什么？

详细介绍：

Batch Normalization是一种用于神经网络的技术，旨在解决深度神经网络训练中的内部协变量偏移问题，并加速网络的收敛。内部协变量偏移是指神经网络每层输入的数据分布在训练过程中产生的漂移。Batch Normalization通过规范化输入的数据分布，有助于网络更快地收敛，并提高网络的泛化能力。

算法原理：

Batch Normalization通过对每个mini-batch的输入数据进行归一化操作来实现。它定义了两个可学习的参数，缩放因子（scale factor）和偏移因子（shift factor），来重新缩放和平移归一化的值，以恢复网络的表达能力。

具体算法原理如下：
1. 对于一个mini-batch的输入$x^{(1)}, …, x^{(m)}$，计算其均值$\mu$和方差$\sigma^2$:
– $\mu \leftarrow \frac{1}{m} \sum_{i=1}^{m} x^{(i)}$
– $\sigma^2 \leftarrow \frac{1}{m} \sum_{i=1}^{m} (x^{(i)} – \mu)^2$

对输入进行归一化操作:
$\hat{x}^{(i)} \leftarrow \frac{x^{(i)} – \mu}{\sqrt{\sigma^2 + \epsilon}}$

其中，$\epsilon$是一个很小的正数，用于数值稳定性。

对归一化后的值进行缩放和平移操作:
$y^{(i)} \leftarrow \gamma \hat{x}^{(i)} + \beta$

其中，$\gamma$和$\beta$是可学习的参数。

输出归一化后的值$y^{(1)}, …, y^{(m)}$，作为下一层的输入。

公式推导：

首先，我们需要推导出Batch Normalization中归一化后的值$\hat{x}^{(i)}$，以及缩放和平移后的值$y^{(i)}$。

根据上述算法原理中的步骤2和步骤3，可以得到：
$$\hat{x}^{(i)} = \frac{x^{(i)} – \mu}{\sqrt{\sigma^2 + \epsilon}}$$

$$y^{(i)} = \gamma \hat{x}^{(i)} + \beta$$

其中，$\mu$是计算的均值，$\sigma^2$是计算的方差，$\epsilon$是用于数值稳定性的小常数，$\gamma$和$\beta$是可学习的参数。

计算步骤：

计算每个mini-batch输入的均值和方差。
对输入进行归一化操作。
对归一化的值进行缩放和平移操作。

Python代码示例：

下面是一个使用Python实现Batch Normalization的示例代码，使用的是一个虚拟的数据集：

import numpy as np

class BatchNormalization:
 def __init__(self, epsilon=1e-5):
 self.epsilon = epsilon
 self.gamma = None
 self.beta = None
 self.mean = None
 self.var = None

 def forward(self, X):
 self.mean = np.mean(X, axis=0)
 self.var = np.var(X, axis=0)
 self.X_normalized = (X - self.mean) / np.sqrt(self.var + self.epsilon)
 out = self.gamma artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls self.X_normalized + self.beta
 return out

 def backward(self, dout):
 dX_normalized = dout artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls self.gamma
 dvar = np.sum(dX_normalized artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (self.X - self.mean), axis=0) artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls -0.5 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (self.var + self.epsilon) artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (-1.5)
 dmean = np.sum(dX_normalized artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (-1 / np.sqrt(self.var + self.epsilon)), axis=0) + dvar artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.mean(-2 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (self.X - self.mean), axis=0)
 dX = (dX_normalized artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls 1 / np.sqrt(self.var + self.epsilon)) + (dvar artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls 2 artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls (self.X - self.mean) / m) + (dmean / m)
 self.dgamma = np.sum(dout artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls self.X_normalized, axis=0)
 self.dbeta = np.sum(dout, axis=0)
 return dX

代码细节解释：

构造函数__init__：初始化BatchNormalization类的参数，其中epsilon是用于数值稳定性的小常数。
前向传播函数forward：计算每个mini-batch输入的均值和方差，然后进行归一化操作，并进行缩放和平移操作，最后输出归一化后的值。
反向传播函数backward：根据前向传播中的公式推导，计算各个参数的梯度，并返回输入的梯度。

其中，dout是上一层的梯度输入，dX表示输入的梯度，self.dgamma和self.dbeta是缩放和平移参数的梯度。

以上是Batch Normalization的详细介绍、算法原理、公式推导、计算步骤以及Python示例代码。Batch Normalization作为一种常用的技术，可以有效解决深度神经网络中的内部协变量偏移问题，并加速网络的训练过程。

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/823038/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

PyTorch学习—13.优化器optimizer的概念及常用优化器

文章目录 * – + 引言 + 一、什么是优化器？ + 二、optimizer的基本属性 + 三、optimizer的基本方法 + 四、方法实例 + * 1.optim…

人工智能 2023年7月23日
00153
图像去雾学习总结

前言：本来题目想作为如何学习图像去雾，去雾字如其名，而学习是学会去雾方面相关的知识。但是后来一想，每个研究方向均是一片海洋，而自己是半瓶不满的杯水，如何教别人呢，因此本文只能算作…

人工智能 2023年7月27日
0060
回归算法基础知识

基本概念回归（Regression）是分析变量之间相互关系的一种方法。 “回归”一词最早是由英国科学家弗朗西斯·高尔顿（Francis Galton，18…

人工智能 2023年6月17日
0086
位置计算 & 数据选择

深入浅出Pandas读书笔记 4.4 位置计算原数据在同位置上对移动后的数据相减, 可以传入一个数字来规定移动多少位, 负数代表移动方向相反 s = pd.Series([9, …

人工智能 2023年7月7日
0055
动态聚类法

目录动态聚类法一、K—均值算法(K-means) * 1.1 条件及约定 1.2 基本思想： 1.3 基于使聚类准则函数最小化 – 准则函数聚类准则 1.4 算法…

人工智能 2023年5月31日
00162
安装tensorflow+CUDA全过程记录

首先，版本之间的匹配非常重要。 [En] First of all, the matching between versions is so important. 先搞清楚自己的G…

人工智能 2023年5月23日
00190
pandas期末复习

Pandas（Python Data Analysis Library）是基于NumPy的数据分析模块，它提供了大量标准数据模型和高效操作大型数据集所需的工具，可以说Pandas是…

人工智能 2023年7月18日
0067
RuntimeError: Expected hidden[0] size (x, x, x), got(x, x, x)

先上图：上图是在训练BILSTM网络时出现的问题。问题描述：通过定义BILSTM网络的初始权重h0，c0，并将其作为BILSTM的初始权重输入至网络，通过如下代码实现 o…

人工智能 2023年7月13日
0095
基于知识图谱的推荐系统研究综述

一、介绍 1、归纳总结基于知识图谱的推荐算法分类 2、分为基于连接的推荐、基于嵌入的推荐和基于混合的推荐 3、关键词：知识图谱：推荐系统：数据挖掘 4、推荐系统作为一种信息过滤系…

人工智能 2023年6月1日
00100
Educode–机器学习基础模型与算法测试闯关实验

第1关：线性回归模型应用实现代码： –– coding: utf-8 –– ”’ 油气藏的储量密度Y与生油…

人工智能 2023年6月15日
00103
Java开发才不到3年，来面试开口要25K，面完连10K都不想给

前言我的好朋友兼大学同学老左家庭经济情况不错，毕业之后没两年自己存了点钱加上家里的支持，自己在杭州开了一家网络公司。由于公司不是很大所以公司大部分的开发人员都是自己面试的，近期公…

人工智能 2023年7月30日
0060
C++20 以 Bazel & Clang 开始

C++20 如何以 Bazel & Clang 进行构建呢？本文将介绍： Bazel 构建系统的安装 LLVM 编译系统的安装 Clang is an “LL…

人工智能 2023年6月4日
00106
解决tensorflow2不能导入mnist的input_data包问题

解决tensorflow2不能导入mnist的input_data包问题报错如下：找到jupyter的python环境我的环境是在anaconda安装目录下首先找到你的si…

人工智能 2023年5月24日
00108
SAR基础知识简介

SAR基础知识简介 SAR提供了某些独特的功能，比光学传感器有以下优势和不足： 1.Base 2.Radar 3.Radar Image 4.SAR Woking Modes 5….

人工智能 2023年5月27日
0099
“声音”背后的原理（2）：采样、量化和编码

采样、量化和编码 1. 模拟信号数字化 * 1.1 采样 1.2 量化 2. 编码音频处理的大致流程：音频——（采集设备）——模拟信号（连续）——（模数转换器ADC）——数字信…

人工智能 2023年5月27日
0082
【代码精读】开山之作MVSNet PyTorch版本超详细分析

MVSNet PyTorch实现版本(非官方)GitHub – xy-guo/MVSNet_pytorch: PyTorch Implementation of MVS…

人工智能 2023年6月15日
00139

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31