Pytorch 自定义激活函数前向与反向传播 sigmoid

2023年8月31日下午9:59 • Python • 阅读 48

文章目录

*
– Sigmoid
–
+ 公式
+ 求导过程
+ 优点：
+ 缺点：
+ 自定义Sigmoid
+ 与Torch定义的比较
+ 可视化

import matplotlib
import matplotlib.pyplot as plt
import numpy as np
import torch
import torch.nn as nn
import torch.nn.functional as F

%matplotlib inline

plt.rcParams['figure.figsize'] = (7, 3.5)
plt.rcParams['figure.dpi'] = 150
plt.rcParams['axes.unicode_minus'] = False

Sigmoid

公式

sigmoid ( x ) = σ ( x ) = 1 1 + e − x \text{sigmoid}(x)= \sigma(x) = \frac{1}{1+e^{-x}}sigmoid (x )=σ(x )=1 +e −x 1

求导过程

σ ′ ( x ) = [ ( 1 + e − x ) − 1 ] ′ = ( − 1 ) ( 1 + e − x ) − 2 ( − 1 ) e − x = ( 1 + e − x ) − 2 e − x = e − x ( 1 + e − x ) 2 = 1 + e − x − 1 ( 1 + e − x ) 2 = 1 + e − x ( 1 + e − x ) 2 − 1 ( 1 + e − x ) 2 = 1 ( 1 + e − x ) ( 1 − 1 ( 1 + e − x ) ) = σ ( x ) ( 1 − σ ( x ) ) \begin{aligned} \sigma'(x) =&[(1+e^{-x})^{-1}]’ \ =&(-1)(1+e^{-x})^{-2}(-1)e^{-x}\ =&(1+e^{-x})^{-2}e^{-x}\ =&\frac{e^{-x}}{(1+e^{-x})^2} \ =&\frac{1+e^{-x}-1}{(1+e^{-x})^2} \ =&\frac{1+e^{-x}}{(1+e^{-x})^2} – \frac{1}{(1+e^{-x})^2} \ =&\frac{1}{(1+e^{-x})}(1-\frac{1}{(1+e^{-x})}) \ =&\sigma(x)(1-{\sigma(x)}) \end{aligned}σ′(x )========[(1 +e −x )−1 ]′(−1 )(1 +e −x )−2 (−1 )e −x (1 +e −x )−2 e −x (1 +e −x )2 e −x (1 +e −x )2 1 +e −x −1 (1 +e −x )2 1 +e −x −(1 +e −x )2 1 (1 +e −x )1 (1 −(1 +e −x )1 )σ(x )(1 −σ(x ))

用于隐层神经元输出，取值范围为(0,1)，它可以将一个实数映射到(0,1)的区间，可以用来做二分类。在特征相差比较复杂或是相差不是特别大时效果比较好。Sigmoid作为激活函数有以下优缺点：

优点：

输出范围有限，数据在传递的过程中不容易发散。
输出范围为(0,1)，所以可以用作输出层，输出表示概率。
抑制两头，对中间细微变化敏感，对分类有利。
在特征相差比较复杂或是相差不是特别大时效果比较好。

缺点：

梯度消失（Gradient Vanishing）会导致backpropagate时，w的系数太小，w更新很慢。所以对初始化时要特别注意，避免过大的初始值使神经元进入饱和区。
输出不是zero-center 这会导致后层的神经元的输入是非0均值的信号，这会对梯度产生影响：假设后层神经元的输入都为正(e.g. x>0 elementwise in ),那么对w求局部梯度则都为正，这样在反向传播的过程中w要么都往正方向更新，要么都往负方向更新，导致有一种捆绑的效果，使得收敛缓慢。如果你是按batch去训练，那么每个batch可能得到不同的符号（正或负），那么相加一下这个问题还是可以缓解
指数运算耗时，计算效率低

自定义Sigmoid

class SelfDefinedSigmoid(torch.autograd.Function):
    @staticmethod
    def forward(ctx, inp):
        result = torch.divide(torch.tensor(1), (1 + torch.exp(-inp)))
        ctx.save_for_backward(result)
        return result

    @staticmethod
    def backward(ctx, grad_output):

        result, = ctx.saved_tensors
        return grad_output * result * (1 - result)

class Sigmoid(nn.Module):
    def __init__(self):
        super().__init__()

    def forward(self, x):
        out = SelfDefinedSigmoid.apply(x)
        return out

与Torch定义的比较


torch.manual_seed(0)

sigmoid = Sigmoid()
inp = torch.randn(5, requires_grad=True)
out = sigmoid((inp + 1).pow(2))

print(f'Out is\n{out}')

out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nFirst call\n{inp.grad}")

out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nSecond call\n{inp.grad}")

inp.grad.zero_()
out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nCall after zeroing gradients\n{inp.grad}")

Out is
tensor([0.9984, 0.6223, 0.8005, 0.9213, 0.5018],
       grad_fn=<selfdefinedsigmoidbackward>)

First call
tensor([ 0.0080,  0.3322, -0.3765,  0.2275, -0.0423])

Second call
tensor([ 0.0159,  0.6643, -0.7530,  0.4549, -0.0845])

Call after zeroing gradients
tensor([ 0.0080,  0.3322, -0.3765,  0.2275, -0.0423])
</selfdefinedsigmoidbackward>


torch.manual_seed(0)
inp = torch.randn(5, requires_grad=True)
out = torch.sigmoid((inp + 1).pow(2))

print(f'Out is\n{out}')

out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nFirst call\n{inp.grad}")

out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nSecond call\n{inp.grad}")

inp.grad.zero_()
out.backward(torch.ones_like(inp), retain_graph=True)
print(f"\nCall after zeroing gradients\n{inp.grad}")

Out is
tensor([0.9984, 0.6223, 0.8005, 0.9213, 0.5018], grad_fn=<sigmoidbackward>)

First call
tensor([ 0.0080,  0.3322, -0.3765,  0.2275, -0.0423])

Second call
tensor([ 0.0159,  0.6643, -0.7530,  0.4549, -0.0845])

Call after zeroing gradients
tensor([ 0.0080,  0.3322, -0.3765,  0.2275, -0.0423])
</sigmoidbackward>

从上面结果，可以看出与torch定义sigmoid得到是一样的结果

可视化


inp = torch.arange(-8, 8, 0.1, requires_grad=True)
out = sigmoid(inp)
out.sum().backward()

inp_grad = inp.grad

plt.plot(inp.detach().numpy(),
         out.detach().numpy(),
         label=r"$\sigma(x)=\frac{1}{1+e^{-x}} $",
         alpha=0.7)
plt.plot(inp.detach().numpy(),
         inp_grad.numpy(),
         label=r"$\sigma'(x)$",
         alpha=0.5)
plt.grid()
plt.legend()
plt.show()

Original: https://blog.csdn.net/jasneik/article/details/123952718
Author: jasneik
Title: Pytorch 自定义激活函数前向与反向传播 sigmoid

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/764888/

转载文章受原作者版权保护。转载请注明原作者出处！

python

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

为什么微服务一定要有网关呢

一、什么是服务网关 服务网关 = 路由&…

Python 2023年9月30日
0044
matplotlib 知识点整理：ax与figure

1 axis与figure 我们可以把figure看成一个大的画布；ax（axis）看成是画布中的一块区域 1.1 plt.figure() matplotlib.pyplot.f…

Python 2023年9月3日
0052
基础算法篇——双指针算法

本次我们介绍基础算法中的双指针算法，我们会从下面几个角度来介绍：双指针简介双指针基本使用最长连续不重复字符列数组元素的目标和判断子序列双指针简介首先我们先来简单介绍一…

Python 2023年10月15日
0057
Selenium4.0+Python3系列（一） – 开发环境搭建

一、写在前面我从未想过自己会写python系列的自动化文章，有些同学会问，那你现在为什么又开始写了？不止一个人找过我，问我可以写一些 Python自动化的文章吗，答案是肯定的。…

Python 2023年10月18日
0041
数据可视化第八章使用matplotlib绘制高级图表

8.1 绘制等高线图import numpy as npimport matplotlib.pyplot as pltdef calcu_elevation(x1, y1):h =…

Python 2023年8月31日
0051
python+requests+pytest 接口自动化框架（六）

目录一、规范YAML测试用例 1、一级关键字必须要包含：name,request,validate 2、在request下必需包含：method,url 3、传参方式：二、接口…

Python 2023年9月11日
0062
Centos8.2下使用Docker-Compose+uWSGI+NGINX部署Django+Vue-cli项目

文章目录前言一、部署流程 * 1、项目结构 2、django目录下新建Dockerfile 3、编辑uWSGI文件 4、NGINX/Dockerfile配置 5、NGINX/n…

Python 2023年8月3日
0055
Python介绍

Python 是一门优雅而健壮的编程语言，它继承了传统编译语言的强大性和通用性，同时也借鉴了脚本语言和解释语言的易用性。 Python被设计成是 “符合大脑思维习惯&#…

Python 2023年5月24日
0072
【python】01以满天星和月亮为例，python中如何制作动画？

”’01-MyStar满天星1.下载 pygame模块1.1 打开运行===>cmd===>pip install pygame1.2 impo…

Python 2023年9月18日
0063
swc神经元数据的使用和Python分析

目录前言回顾 * 如何快速查看swc数据？如何python读取数据？ – 1.何处下载数据？ 2.观察数据 3. 我如何读入数据的？读完了呢？前言小白记性不…

Python 2023年8月22日
0045
python animation path_从0开始学python第14.4节 pygame 动画基础 Animation

我们知道，游戏里的动画就是在短暂的时间里让窗口画面不断的变换图像。通过视觉的延迟来欺骗眼镜，让我们以为看到的是一个连续的动画。想象我们的程序窗口有6个像素宽、1个像素高。除了4,…

Python 2023年9月25日
0053
【论文&模型讲解】CLIP（Learning Transferable Visual Models From Natural Language Supervision）

文章目录前言 0 摘要 1 Introduction and Motivating Work 2 Approach * 2.0 模型整体结构 2.1 数据集 2.2 选择一种高效…

Python 2023年10月9日
0086
Pandas loc/iloc用法详解

在数据分析过程中，很多时候需要从数据表中提取出相应的数据，而这么做的前提是需要先”索引”出这一部分数据。虽然通过 Python 提供的索引操作符&#8221…

Python 2023年8月16日
0071
Python配置Bottle及简单使用

一、安装Bottle 本文这里直接是在Pycharm上安装的Bottle框架，在终端上，执行如下命令，安装Bottle： pip install bottle 安装非常快，安装完…

Python 2023年8月11日
0050
django实现定时任务

最近想加一个定时执行自动化测试用例的功能，本身平台是用django开发的，度了一下，绝大部分都是推荐django-crontab，可是我搞半天都没有定时执行任务。去官网查djan…

Python 2023年8月4日
0066
matplotlib绘制两个图形及网格、透明度、图例、颜色等

ZCQ 下面的实例展示了matplotlib绘制两个图形及网格、透明度、图例、颜色等的基本操作上代码 #比较自&…

Python 2023年9月4日
0043

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31