YOLOX改进之损失函数修改（上）

2023年6月17日上午6:26 • 人工智能 • 阅读 145

文章内容：如何在YOLOX官网代码中修改– 置信度预测损失

环境：pytorch1.8

损失函数修改内容：

（1）置信度预测损失更换：二元交叉熵损失替换为 FocalLoss或者 VariFocalLoss

（2）定位损失更换：IOU损失替换为GIOU、CIOU、EIOU以及 a-IOU系列

提示：使用之前可以先了解YOLOX及上述损失函数原理

参考链接：

使用方法：直接替换即可

代码修改过程：

1、置信度预测损失更换之 FocalLoss（不需要创建新的py文件）

使用：直接在YOLOX-main/yolox/models/yolo_head.py的YOLOXHead类中创建focal_loss方法

（1）首先找到置信度预测损失计算位置loss_obj，并进行替换（位置在386-405行左右）


        loss_iou = (
            self.iou_loss(bbox_preds.view(-1, 4)[fg_masks], reg_targets)
        ).sum() / num_fg

        loss_obj = (
            self.focal_loss(obj_preds.sigmoid().view(-1, 1), obj_targets)
        ).sum() / num_fg
        loss_cls = (
            self.bcewithlog_loss(
                cls_preds.view(-1, self.num_classes)[fg_masks], cls_targets
            )
        ).sum() / num_fg

（2）创建focal_loss方法，放到def get_l1_target（…）之前即可，代码如下：

def focal_loss(self, pred, gt):
        pos_inds = gt.eq(1).float()
        neg_inds = gt.eq(0).float()
        pos_loss = torch.log(pred+1e-5) * torch.pow(1 - pred, 2) * pos_inds * 0.75
        neg_loss = torch.log(1 - pred+1e-5) * torch.pow(pred, 2) * neg_inds * 0.25
        loss = -(pos_loss + neg_loss)
        return loss

2、置信度预测损失更换之 VariFocalLoss（代码较多，所以额外创建新的py文件）

步骤一：YOLOX-main/yolox/models文件夹下创建varifocalloss.py文件，内容如下：

import torch.nn as nn
import torch.nn.functional as F

def reduce_loss(loss, reduction):
    """Reduce loss as specified.

    Args:
        loss (Tensor): Elementwise loss tensor.

        reduction (str): Options are "none", "mean" and "sum".

    Return:
        Tensor: Reduced loss tensor.

"""
    reduction_enum = F._Reduction.get_enum(reduction)

    if reduction_enum == 0:
        return loss
    elif reduction_enum == 1:
        return loss.mean()
    elif reduction_enum == 2:
        return loss.sum()

def weight_reduce_loss(loss, weight=None, reduction='mean', avg_factor=None):
    """Apply element-wise weight and reduce loss.

    Args:
        loss (Tensor): Element-wise loss.

        weight (Tensor): Element-wise weights.

        reduction (str): Same as built-in losses of PyTorch.

        avg_factor (float): Avarage factor when computing the mean of losses.

    Returns:
        Tensor: Processed loss values.

"""

    if weight is not None:
        loss = loss * weight

    if avg_factor is None:
        loss = reduce_loss(loss, reduction)
    else:

        if reduction == 'mean':
            loss = loss.sum() / avg_factor

        elif reduction != 'none':
            raise ValueError('avg_factor can not be used with reduction="sum"')
    return loss

def varifocal_loss(pred,
                   target,
                   weight=None,
                   alpha=0.75,
                   gamma=2.0,
                   iou_weighted=True,
                   reduction='mean',
                   avg_factor=None):
    """Varifocal Loss _
    Args:
        pred (torch.Tensor): The prediction with shape (N, C), C is the
            number of classes
        target (torch.Tensor): The learning target of the iou-aware
            classification score with shape (N, C), C is the number of classes.

        weight (torch.Tensor, optional): The weight of loss for each
            prediction. Defaults to None.

        alpha (float, optional): A balance factor for the negative part of
            Varifocal Loss, which is different from the alpha of Focal Loss.

            Defaults to 0.75.

        gamma (float, optional): The gamma for calculating the modulating
            factor. Defaults to 2.0.

        iou_weighted (bool, optional): Whether to weight the loss of the
            positive example with the iou target. Defaults to True.

        reduction (str, optional): The method used to reduce the loss into
            a scalar. Defaults to 'mean'. Options are "none", "mean" and
            "sum".

        avg_factor (int, optional): Average factor that is used to average
            the loss. Defaults to None.

"""

    assert pred.size() == target.size()
    pred_sigmoid = pred.sigmoid()
    target = target.type_as(pred)
    if iou_weighted:
        focal_weight = target * (target > 0.0).float() + \
            alpha * (pred_sigmoid - target).abs().pow(gamma) * \
            (target  0.0).float()
    else:
        focal_weight = (target > 0.0).float() + \
            alpha * (pred_sigmoid - target).abs().pow(gamma) * \
            (target  0.0).float()
    loss = F.binary_cross_entropy_with_logits(
        pred, target, reduction='none') * focal_weight
    loss = weight_reduce_loss(loss, weight, reduction, avg_factor)
    return loss

class VarifocalLoss(nn.Module):

    def __init__(self,
                 use_sigmoid=True,
                 alpha=0.75,
                 gamma=2.0,
                 iou_weighted=True,
                 reduction='mean',
                 loss_weight=1.0):
        """Varifocal Loss _
        Args:
            use_sigmoid (bool, optional): Whether the prediction is
                used for sigmoid or softmax. Defaults to True.

            alpha (float, optional): A balance factor for the negative part of
                Varifocal Loss, which is different from the alpha of Focal
                Loss. Defaults to 0.75.

            gamma (float, optional): The gamma for calculating the modulating
                factor. Defaults to 2.0.

            iou_weighted (bool, optional): Whether to weight the loss of the
                positive examples with the iou target. Defaults to True.

            reduction (str, optional): The method used to reduce the loss into
                a scalar. Defaults to 'mean'. Options are "none", "mean" and
                "sum".

            loss_weight (float, optional): Weight of loss. Defaults to 1.0.

"""
        super(VarifocalLoss, self).__init__()
        assert use_sigmoid is True, \
            'Only sigmoid varifocal loss supported now.'
        assert alpha >= 0.0
        self.use_sigmoid = use_sigmoid
        self.alpha = alpha
        self.gamma = gamma
        self.iou_weighted = iou_weighted
        self.reduction = reduction
        self.loss_weight = loss_weight

    def forward(self,
                pred,
                target,
                weight=None,
                avg_factor=None,
                reduction_override=None):
        """Forward function.

        Args:
            pred (torch.Tensor): The prediction.

            target (torch.Tensor): The learning target of the prediction.

            weight (torch.Tensor, optional): The weight of loss for each
                prediction. Defaults to None.

            avg_factor (int, optional): Average factor that is used to average
                the loss. Defaults to None.

            reduction_override (str, optional): The reduction method used to
                override the original reduction method of the loss.

                Options are "none", "mean" and "sum".

        Returns:
            torch.Tensor: The calculated loss
"""
        assert reduction_override in (None, 'none', 'mean', 'sum')
        reduction = (
            reduction_override if reduction_override else self.reduction)
        if self.use_sigmoid:
            loss_cls = self.loss_weight * varifocal_loss(
                pred,
                target,
                weight,
                alpha=self.alpha,
                gamma=self.gamma,
                iou_weighted=self.iou_weighted,
                reduction=reduction,
                avg_factor=avg_factor)
        else:
            raise NotImplementedError
        return loss_cls

步骤二：在YOLOX-main/yolox/models/yolo_head.py中调用VarifocalLoss

（1）导入

from .varifocalloss import VarifocalLoss

（2）在init中实例化

self.varifocal = VarifocalLoss(reduction='none')

（3）替换原有的置信度预测损失loss_obj


        loss_iou = (
            self.iou_loss(bbox_preds.view(-1, 4)[fg_masks], reg_targets)
        ).sum() / num_fg

        loss_obj = (self.varifocal(obj_preds.view(-1, 1), obj_targets)
        ).sum() / num_fg
        loss_cls = (
            self.bcewithlog_loss(
                cls_preds.view(-1, self.num_classes)[fg_masks], cls_targets)
        ).sum() / num_fg

效果：根据个人数据集而定。FocalLoss与VariFocalLoss在我的数据集上均能提升，模型越大效果越明显。（但是在yolox-tiny上FocalLoss效果AP50会低于原来）

Original: https://blog.csdn.net/weixin_45679938/article/details/122343945
Author: 你的陈某某
Title: YOLOX改进之损失函数修改（上）

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/629022/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

dataframe 设置空值_2019-12-19(三)对DataFrame空值数据记录进行的各种选择(1)

”’ 上期回顾：学习对DataFrame数据记录进行各项选择前的初步和整体的了解。本次：我们将学习对DataFrame数据记录进行的各种选择！因为2…

人工智能 2023年7月8日
0078
Python matplotlib 实时数据动画

; 文章目录一、实时数据可视化的数据准备 * 01.设置图表主题样式 02 使用样例数据二、使用电影票房数据制作动画一、实时数据可视化的数据准备 import pandas …

人工智能 2023年7月17日
0080
CUDA11.3以及PyTorch-GPU版本安装

目录 1 前言 2 CUDA&cuDNN * 2.1 查看硬件 2.2 安装 2.3 验证 3 PyTorch * 3.1 安装 3.2 验证 4 总结 1 前言本笔记仅…

人工智能 2023年6月16日
0095
机器学习 | 回归问题

机器学习 | 回归问题更多内容，关注wx公众号：数据分析这件小事儿对于监督学习，其基本问题就是使用特征向量x预测响应变量y，如果响应变量y为连续变量，则称为回归问题。用x来预…

人工智能 2023年6月18日
0088
对波士顿数据集的回归分析

1.掌握建立模型的必要步骤，划分数据集、实例化模型、建立模型、评估模型 2.掌握回归算法的建模和评估 1.根据数据集找出哪3个特征对房屋价值影响最大 2.建立回归模型预测波士顿房屋…

人工智能 2023年6月19日
0090
1.2 信息系统开发方法

1.2 信息系统开发方法 1.信息系统常用的开发方法包括结构化方法、面向对象方法、原型化方法、面向服务的方法。结构化开发方法将系统的生命周期划分为系统规划、系统分析、系统设计、…

人工智能 2023年6月26日
0080
快速上手：图聚类入门 Graph Clustering

硕士研究工作基本告一段落了，静候佳音中～其实一直想总结一下图节点聚类的一些工作，算是一个逗号吧。个人总结，若有错误欢迎指正。本文从问题定义入手，再到近几年的工作，最后进行横向对…

人工智能 2023年6月19日
00246
COMSC

原文：Consensus One-step Multi-view Subspace Clustering 创新点：传统的子空间聚类分为两个步骤。首先是学一个亲和矩阵，也就是原文中…

人工智能 2023年6月2日
0080
对sklearn中transform()和fit_transform()的深入理解

在用机器学习解决问题时，往往要先对数据进行预处理。其中，z-score归一化和Min-Max归一化是最常用的两种预处理方式，可以通过sklearn.preprocessing模块导…

人工智能 2023年7月5日
0067
一文搞懂时间序列预测模型（2）：ARIMA模型的理论与实践

本文通过一段时间的长江流量数据集来实战演示ARIMA模型的理论、建模及调参选择过程，其中包括数据准备、随机性、稳定性检验。本文旨在通过实践的操作过程，完成ARIMA模型的分享，相信…

人工智能 2023年7月18日
0073
机器学习代码笔记-2-简单线性回归

Out[ ]: [<matplotlib.lines.line2d at 0x7fdfcee72f50>]</matplotlib.lines.line2d&gt…

人工智能 2023年6月4日
0083
CART 分类决策树

1. Cart树简介 Cart模型是一种决策树模型，它即可以用于分类，也可以用于回归，其学习算法分为下面两步：（1）决策树生成：用训练数据生成决策树，生成树尽可能大（2）决策树…

人工智能 2023年6月30日
0067
正交试验案例分析全步骤

一、案例说明 1.案例背景为了研究磁疗对烫伤治疗的消肿效果，某研究所对白鼠进行试验，选取强度（A）、磁疗时间（B）和振动（C）三个因素，部分数据参考如下： 2.分析目的用正交设…

人工智能 2023年7月16日
0061
pandas 涉及内容的用法

1.1 DataFrame 的构建 DataFrame 是由索引和内容组成的，索引有: 行索引和列索引；创建方式： pd.DataFrame(ndarray数据，index=[‘…

人工智能 2023年7月8日
0069
python数据处理—-分组和聚合（高级）

什么是聚合？在SQL中我们经常使用 GROUP BY 将某个字段,按不同的取值进行分组, 在pandas中也有groupby函数 *分组之后,每组都会有至少1条数据, 将这些数据…

人工智能 2023年7月15日
0083
论文阅读：Oriented RepPoints for Aerial Object Detection (CVPR 2022)

paper:https://arxiv.org/abs/2105.11111code:GitHub – LiWentomng/OrientedRepPoints: Th…

人工智能 2023年7月10日
0094

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

YOLOX改进之损失函数修改（上）

大家都在看