损失函数与反向传播

2023年7月23日上午11:20 • 人工智能 • 阅读 80

损失函数定义与作用

损失函数(loss function)在深度学习领域是用来计算搭建模型预测的输出值和真实值之间的误差。
1.损失函数越小越好
2.计算实际输出与目标之间的差距
3.为更新输出提供依据（反向传播)

常见的损失函数

回归常见的损失函数有：均方差（Mean Squared Error，MSE）、平均绝对误差（Mean Absolute Error Loss，MAE）、Huber Loss是一种将MSE与MAE结合起来，取两者优点的损失函数，也被称作Smooth Mean Absolute Error Loss 、分位数损失（Quantile Loss）损失函数。

分类常见的损失函数有：交叉熵损失（Cross Entropy Loss）、合页损失（Hinge Loss）、0/1损失函数、指数损失、对数损失/对数似然损失（Log-likelihood Loss）

平均绝对误差损失（Mean Absolute Error Loss）

例如 x = [1,2,3],y = [1,2,5]
如果reduction = mean ,结果就是（（1-1）+（2-2）+（5-3））/3 = 2/3
如果reduction = sum ,结果就是（（1-1）+（2-2）+（5-3）） = 2
代码示例：

import torch
from torch.nn import L1Loss

input  = torch.tensor([1,2,3],dtype=float)
target = torch.tensor([1,2,5],dtype=float)
input = torch.reshape(input,(1,1,1,3))
target = torch.reshape(target,(1,1,1,3))
loss = L1Loss()
output = loss(input,target)
print(output)

MSELoss

和平均绝对误差损失（Mean Absolute Error Loss）差不多，但是加了一个平方

; MAE与MSE的区别：

MSE比MAE能够更快收敛：当使用梯度下降算法时，MSE损失的梯度为，而MAE损失的梯度为正负1。所以。MSE的梯度会随着误差大小发生变化，而MAE的梯度一直保持为1，这不利于模型的训练
MAE对异常点更加鲁棒：从损失函数上看，MSE对误差平方化，使得异常点的误差过大；从两个损失函数的假设上看，MSE假设了误差服从高斯分布，MAE假设了误差服从拉普拉斯分布，拉普拉斯分布本身对于异常点更加鲁棒

交叉熵损失函数 (Cross-entropy loss function)

这个有点复杂，没看懂啊
在看参考文章时发现一篇文章很好，在这里，交叉熵损失函数（cross-entropy loss function）
还有很多损失函数，学基础我先了解这些，还有很多，在这里

使用神经网络利用损失函数计算差值

import torch
import torchvision
from torch import nn
from torch.nn import Conv2d, MaxPool2d, Flatten, Linear, CrossEntropyLoss
from torch.utils.data import DataLoader
from torch.utils.tensorboard import SummaryWriter

dataset = torchvision.datasets.CIFAR10("dataset2",train=False,transform=torchvision.transforms.ToTensor())
dataloader = DataLoader(dataset,batch_size=1)

class Test(nn.Module):
    def __init__(self):
        super(Test, self).__init__()
        self.conv1 = Conv2d(3,32,5,padding=2)
        self.maxpool1 = MaxPool2d(kernel_size=2)
        self.conv2 = Conv2d(32,32,5,padding=2)
        self.maxpool2 = MaxPool2d(2)
        self.conv3 = Conv2d(32,64,5,padding=2)
        self.maxpool3 = MaxPool2d(kernel_size=2)
        self.flatten = Flatten()
        self.linear1 = Linear(1024,64)
        self.linear2 = Linear(64,10)

    def forward(self,x):
        x = self.conv1(x)
        x = self.maxpool1(x)
        x = self.conv2(x)
        x = self.maxpool2(x)
        x = self.conv3(x)
        x = self.maxpool3(x)
        x = self.flatten(x)
        x = self.linear1(x)
        x = self.linear2(x)
        return x;
loss = CrossEntropyLoss()
test = Test()
for data in dataloader:
    imgs,target = data
    output = test(imgs)
    result_loss = loss(output,target)
    result_loss.backward()
    print(result_loss)

输出：

tensor(2.2068, grad_fn=<NllLossBackward0>)
tensor(2.2500, grad_fn=<NllLossBackward0>)
tensor(2.2562, grad_fn=<NllLossBackward0>)
tensor(2.4017, grad_fn=<NllLossBackward0>)
tensor(2.4001, grad_fn=<NllLossBackward0>)
tensor(2.3810, grad_fn=<NllLossBackward0>)
tensor(2.3225, grad_fn=<NllLossBackward0>)
tensor(2.3851, grad_fn=<NllLossBackward0>)
tensor(2.2274, grad_fn=<NllLossBackward0>)
tensor(2.3469, grad_fn=<NllLossBackward0>)
tensor(2.4058, grad_fn=<NllLossBackward0>)
tensor(2.1119, grad_fn=<NllLossBackward0>)
tensor(2.3562, grad_fn=<NllLossBackward0>)
tensor(2.2470, grad_fn=<NllLossBackward0>)
tensor(2.1172, grad_fn=<NllLossBackward0>)
tensor(2.2596, grad_fn=<NllLossBackward0>)
tensor(2.3484, grad_fn=<NllLossBackward0>)
tensor(2.2497, grad_fn=<NllLossBackward0>)
tensor(2.2346, grad_fn=<NllLossBackward0>)
tensor(2.3989, grad_fn=<NllLossBackward0>)
tensor(2.2627, grad_fn=<NllLossBackward0>)
tensor(2.4221, grad_fn=<NllLossBackward0>)
tensor(2.3888, grad_fn=<NllLossBackward0>)
tensor(2.1550, grad_fn=<NllLossBackward0>)
tensor(2.3529, grad_fn=<NllLossBackward0>)
tensor(2.3224, grad_fn=<NllLossBackward0>)
tensor(2.3853, grad_fn=<NllLossBackward0>)
tensor(2.3900, grad_fn=<NllLossBackward0>)
tensor(2.1328, grad_fn=<NllLossBackward0>)
tensor(2.3807, grad_fn=<NllLossBackward0>)
tensor(2.3733, grad_fn=<NllLossBackward0>)
tensor(2.3658, grad_fn=<NllLossBackward0>)
tensor(2.3882, grad_fn=<NllLossBackward0>)
......

Original: https://blog.csdn.net/qq_52237775/article/details/122788481
Author: Star_.
Title: 损失函数与反向传播

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/710498/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Pandas经典用法：数据筛选之iloc和loc

Pandas 是一套用于 Python 的快速、高效的数据分析工具。它可以用于数据挖掘和数据分析，同时也提供数据清洗功能。本篇目录如下： ; 一、iloc 1.定义 iloc索引器…

人工智能 2023年7月7日
00170
机器学习西瓜书&南瓜书线性模型

机器学习西瓜书&南瓜书线性模型 1. 基本形式给定由d个属性描述的示例x = ( x 1 ; x 2 ; . . . x d ) x=(x_1;x_2;…x…

人工智能 2023年6月17日
0067
语音信号处理-python

语音信号处理 1.语音信号的产生与特性要对语音进行分析，首先要提取能够代表语音的特征参数。只有有了特征参数，才能利用这些参数进行有效的处理。在语音信号处理过程中，语音信号的质量不…

人工智能 2023年5月27日
0056
python:基础知识

环境： window11 python 3.10.6 vscode javascript、c/c++/java/c#基础（与这些语言对比）参考：https://www.bilib…

人工智能 2023年7月3日
0068
【数据聚类】第六章第一节：层次聚类算法概述、聚合和分裂方法

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年6月2日
0066
简易Lasso回归 R语言变量含有分类变量处理

这是个简易的lasso，里面有几个参数，像family，应该自主？？函数去调一下加载包 library(haven) 导入数据 R alpha1.fit coef(alph…

人工智能 2023年7月2日
0063
逻辑斯蒂回归公式

逻辑斯蒂回归首先我们要知道逻辑斯蒂回归是干什么用的，它可以用回归的方式找到分类边界曲线 1、sigmoid压缩函数：g ( z ) = 1 1 + e − z g(z)= {1…

人工智能 2023年6月17日
0082
深度学习实战（十）：使用 PyTorch 进行 3D 医学图像分割

深度学习实战（十）：使用 PyTorch 进行 3D 医学图像分割 1. 项目简介 2. 3D医学图像分割的需求 3. 医学图像和MRI 4. 三维医学图像表示 5. 3D-Une…

人工智能 2023年6月23日
0094
3.2 softmax多分类、tensorflow2实现——python实战

文章目录 softmax计算过程实现多分类 * one-hot 准确率交叉熵损失函数加载数据构建Dataset 可视化Dataset数据划分测试与训练数据建立softma…

人工智能 2023年7月17日
0048
DataFrame索引的创建与基础用法

DataFrame 是一个表格型的数据结构（数据框），它含有一组有序的列，每列可以是不同的数据类型。DataFrame 既有行索引，也有列索引，可以看作是由多个Series索引组成…

人工智能 2023年7月16日
0051
GO和KEGG富集分析详细步骤

GO和KEGG富集分析文章目录 GO和KEGG富集分析 * @[toc] 1. 将差异表达结果的基因名称转化为id 2. GO富集分析 3. GO圈图绘制 4. KEGG富集分析…

人工智能 2023年5月31日
00132
14:30面试，14:38就出来了，问的实在是太…

从外包出来，没想到算法死在另一家厂子，自从加入这家公司，每天都在加班，钱倒是给的不少，所以也就忍了。没想到8月一纸通知，所有人不许加班，薪资直降30%，顿时有吃不起饭的赶脚。好在…

人工智能 2023年7月30日
0050
利用Python语言爬取京东购物平台商品信息数据，并且爬取多个商品页面。

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月30日
0067
为什么反向传播算法通常与梯度下降优化算法一起使用

为什么反向传播算法通常与梯度下降优化算法一起使用？反向传播算法（Backpropagation）与梯度下降优化算法（Gradient Descent）常常一起使用是因为反向传播算…

人工智能 2024年1月5日
0038
2.基于原型的聚类方法

基于原型的聚类方法文章目录 * – 一、概念 – 二、K-Means – + 2.1 算法流程 + 2.2 超参数 + 2.3 特性 + 2….

人工智能 2023年5月31日
0051
Multi-Modal Knowledge Graph Construction andApplication: A Survey

摘要近年来知识工程的兴起，其特征是知识图的快速发展。然而，现有的知识图大多用纯符号表示，这损害了机器理解真实世界的能力。知识图的多模态化是实现人级机器智能不可避免的关键步骤。这项…

人工智能 2023年6月4日
0075

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30