【DIoU CIoU】DIoU和CIoU损失函数理解及代码实现

2023年7月12日下午3:16 • 人工智能 • 阅读 65

文章目录

1 引言
2 问题分析
3 作者思考
4 DIoU Loss计算过程
5 CIoU Loss计算过程
6 IoU/GIoU/DIoU/CIoU代码实现可视化
7 感谢链接

1 引言

目标检测任务的损失函数由Classificition Loss和Bounding Box Regeression Loss两部分构成。

Bounding Box Regression Loss Function的演进路线是：
Smooth L1 Loss –> IoU Loss –> GIoU Loss –> DIoU Loss –> CIoU Loss

之前写到了 Smooth L1 Loss 、 IoU Loss 和 GIoU Loss。
本文介绍DIoU Loss 和 CIoU Loss。

2 问题分析

GIoU Loss 存在的问题：

出现下图中的情况时，IoU和GIoU的值都一样，此时GIoU退化为IoU, 无法区分其相对位置关系。
收敛的比较慢
回归的还不够准确

; 3 作者思考

基于IoU和GIoU存在的问题，作者提出了两个问题：

问题一：直接最小化预测框与目标框之间的归一化距离是否可行，以达到更快的收敛速度。
问题二：如何使回归在与目标框有重叠甚至包含时更准确、更快。

好的目标框回归损失应该考虑三个重要的几何因素： 重叠面积，中心点距离，长宽比。

针对问题一，作者提出了DIoU Loss，相对于GIoU Loss收敛速度更快，DIoU Loss考虑了重叠面积(IoU)和中心点距离(d 2 c 2 \frac{d^{2}}{c^{2}}c 2 d 2 )，但没有考虑到长宽比；

针对问题二，作者提出了CIoU Loss，其收敛的精度更高，以上三个因素都考虑到了。

4 DIoU Loss计算过程

Distance-IoU(DIoU) Loss计算过程如下：

图中，b表示预测框中心点坐标，b g t b^{gt}b g t表示GT框中心点坐标。ρ 2 ( b , b g t ) ρ^2(b, b^{gt})ρ2 (b ,b g t )表示两中心点的距离的平方，c 2 c^2 c 2表示两矩形最小外接矩形的对角线长度的平方。

DIoU损失能直接最小化两个box之间的距离，因此收敛速度更快。

L D I o U = 1 − D I o U L_{DIoU}=1-DIoU L D I o U =1 −D I o U
当两个框重合时，L D I o U = 0 L_{DIoU}=0 L D I o U =0；当两个框相距无穷远时，L D I o U = 2 L_{DIoU}=2 L D I o U =2，故0 ≤ L D I o U < 2 0≤L_{DIoU}。

可以将DIoU替换IoU用于NMS算法当中，也即论文提出的DIoU-NMS, 实验结果表明有一定的提升。

DIoU相比于GIoU的优点：
DIoU Loss可以直接优化2个框之间的距离，比GIoU Loss收敛速度更快
对于目标框包裹预测框的情况，DIoU Loss可以收敛的很快，而GIoU Loss此时退化为IoU Loss收敛速度较慢

; 5 CIoU Loss计算过程

Complete-IoU(CIoU) Loss计算过程如下：在DIoU的基础上，考虑长宽比α v αv αv。

其中，α αα是用于做trade-off的参数，v v v是用来衡量长宽比一致性的参数。

CIoU Loss function的定义为

L C I o U = 1 − C I o U L_{CIoU}=1-CIoU L C I o U =1 −C I o U

！注意！： CIoU loss的梯度类似于DIoU loss，但还要考虑v v v的梯度。在长宽在 [0, 1] 的情况下，w 2 + h 2 w^2+h^2 w 2 +h 2的值通常很小，会导致梯度爆炸，因此在1 w 2 + h 2 \frac{1}{w^2+h^2}w 2 +h 2 1 实现时将替换成1。

6 IoU/GIoU/DIoU/CIoU代码实现可视化

import numpy as np
import cv2
import torch
import math

def CountIOU(RecA, RecB):
    xA = max(RecA[0], RecB[0])
    yA = max(RecA[1], RecB[1])
    xB = min(RecA[2], RecB[2])
    yB = min(RecA[3], RecB[3])

    interArea = max(0, xB - xA + 1) * max(0, yB - yA + 1)

    RecA_Area = (RecA[2] - RecA[0] + 1) * (RecA[3] - RecA[1] + 1)
    RecB_Area = (RecB[2] - RecB[0] + 1) * (RecB[3] - RecB[1] + 1)

    iou = interArea / float(RecA_Area + RecB_Area - interArea)

    return iou

def Giou(rec1,rec2):

    x1,y1,x2,y2 = rec1
    x3,y3,x4,y4 = rec2
    iou = CountIOU(rec1,rec2)
    area_C = (max(x1,x2,x3,x4)-min(x1,x2,x3,x4))*(max(y1,y2,y3,y4)-min(y1,y2,y3,y4))
    area_1 = (x2-x1)*(y1-y2)
    area_2 = (x4-x3)*(y3-y4)
    sum_area = area_1 + area_2

    w1 = x2 - x1
    w2 = x4 - x3
    h1 = y1 - y2
    h2 = y3 - y4
    W = min(x1,x2,x3,x4)+w1+w2-max(x1,x2,x3,x4)
    H = min(y1,y2,y3,y4)+h1+h2-max(y1,y2,y3,y4)

    Area = W * H

    add_area = sum_area - Area

    end_area = (area_C - add_area)/area_C
    giou = iou - end_area
    return giou

def Diou(bboxes1, bboxes2):
    rows = bboxes1.shape[0]
    cols = bboxes2.shape[0]
    dious = torch.zeros((rows, cols))
    if rows * cols == 0:
        return dious
    exchange = False
    if bboxes1.shape[0] > bboxes2.shape[0]:
        bboxes1, bboxes2 = bboxes2, bboxes1
        dious = torch.zeros((cols, rows))
        exchange = True

    w1 = bboxes1[:, 2] - bboxes1[:, 0]
    h1 = bboxes1[:, 3] - bboxes1[:, 1]
    w2 = bboxes2[:, 2] - bboxes2[:, 0]
    h2 = bboxes2[:, 3] - bboxes2[:, 1]

    area1 = w1 * h1
    area2 = w2 * h2

    center_x1 = (bboxes1[:, 2] + bboxes1[:, 0]) / 2
    center_y1 = (bboxes1[:, 3] + bboxes1[:, 1]) / 2
    center_x2 = (bboxes2[:, 2] + bboxes2[:, 0]) / 2
    center_y2 = (bboxes2[:, 3] + bboxes2[:, 1]) / 2

    inter_max_xy = torch.min(bboxes1[:, 2:],bboxes2[:, 2:])
    inter_min_xy = torch.max(bboxes1[:, :2],bboxes2[:, :2])
    out_max_xy = torch.max(bboxes1[:, 2:],bboxes2[:, 2:])
    out_min_xy = torch.min(bboxes1[:, :2],bboxes2[:, :2])

    inter = torch.clamp((inter_max_xy - inter_min_xy), min=0)
    inter_area = inter[:, 0] * inter[:, 1]
    inter_diag = (center_x2 - center_x1)**2 + (center_y2 - center_y1)**2
    outer = torch.clamp((out_max_xy - out_min_xy), min=0)
    outer_diag = (outer[:, 0] ** 2) + (outer[:, 1] ** 2)
    union = area1+area2-inter_area
    dious = inter_area / union - (inter_diag) / outer_diag
    dious = torch.clamp(dious,min=-1.0,max = 1.0)
    if exchange:
        dious = dious.T
    return dious

def bbox_overlaps_ciou(bboxes1, bboxes2):
    rows = bboxes1.shape[0]
    cols = bboxes2.shape[0]
    cious = torch.zeros((rows, cols))
    if rows * cols == 0:
        return cious
    exchange = False
    if bboxes1.shape[0] > bboxes2.shape[0]:
        bboxes1, bboxes2 = bboxes2, bboxes1
        cious = torch.zeros((cols, rows))
        exchange = True

    w1 = bboxes1[:, 2] - bboxes1[:, 0]
    h1 = bboxes1[:, 3] - bboxes1[:, 1]
    w2 = bboxes2[:, 2] - bboxes2[:, 0]
    h2 = bboxes2[:, 3] - bboxes2[:, 1]

    area1 = w1 * h1
    area2 = w2 * h2

    center_x1 = (bboxes1[:, 2] + bboxes1[:, 0]) / 2
    center_y1 = (bboxes1[:, 3] + bboxes1[:, 1]) / 2
    center_x2 = (bboxes2[:, 2] + bboxes2[:, 0]) / 2
    center_y2 = (bboxes2[:, 3] + bboxes2[:, 1]) / 2

    inter_max_xy = torch.min(bboxes1[:, 2:],bboxes2[:, 2:])
    inter_min_xy = torch.max(bboxes1[:, :2],bboxes2[:, :2])
    out_max_xy = torch.max(bboxes1[:, 2:],bboxes2[:, 2:])
    out_min_xy = torch.min(bboxes1[:, :2],bboxes2[:, :2])

    inter = torch.clamp((inter_max_xy - inter_min_xy), min=0)
    inter_area = inter[:, 0] * inter[:, 1]
    inter_diag = (center_x2 - center_x1)**2 + (center_y2 - center_y1)**2
    outer = torch.clamp((out_max_xy - out_min_xy), min=0)
    outer_diag = (outer[:, 0] ** 2) + (outer[:, 1] ** 2)
    union = area1+area2-inter_area
    u = (inter_diag) / outer_diag
    iou = inter_area / union
    with torch.no_grad():
        arctan = torch.atan(w2 / h2) - torch.atan(w1 / h1)
        v = (4 / (math.pi ** 2)) * torch.pow((torch.atan(w2 / h2) - torch.atan(w1 / h1)), 2)
        S = 1 - iou
        alpha = v / (S + v)
        w_temp = 2 * w1
    ar = (8 / (math.pi ** 2)) * arctan * ((w1 - w_temp) * h1)
    cious = iou - (u + alpha * ar)
    cious = torch.clamp(cious,min=-1.0,max = 1.0)
    if exchange:
        cious = cious.T
    return cious

img = np.zeros((512,512,3), np.uint8)
img.fill(255)

RecA = [30,30,300,300]
RecB = [60,60,350,340]

cv2.rectangle(img, (RecA[0],RecA[1]), (RecA[2],RecA[3]), (0, 255, 0), 5)
cv2.rectangle(img, (RecB[0],RecB[1]), (RecB[2],RecB[3]), (255, 0, 0), 5)

IoU = CountIOU(RecA,RecB)
GIoU = Giou(RecA,RecB)
RecA_tensor,RecB_tensor = torch.tensor([RecA]), torch.tensor([RecB])
DIoU = Diou(RecA_tensor,RecB_tensor)
CIoU = bbox_overlaps_ciou(RecA_tensor,RecB_tensor)

font = cv2.FONT_HERSHEY_SIMPLEX

cv2.putText(img,"IOU = %.2f"%IoU,(130, 150),font,0.8,(0,0,0),2)
cv2.putText(img,"GIOU = %.2f"%GIoU,(130, 180),font,0.8,(0,0,0),2)
cv2.putText(img,"DIOU = %.2f"%DIoU,(130, 210),font,0.8,(0,0,0),2)
cv2.putText(img,"CIOU = %.2f"%CIoU,(130, 240),font,0.8,(0,0,0),2)

cv2.imshow("image",img)
cv2.waitKey()
cv2.destroyAllWindows()

结果输出：

7 感谢链接

DIoU和CIOU用于目标检测与实例分割，作者已开源，可参考：

https://github.com/Zzh-tju?tab=repositories

其它感谢链接：

https://zhuanlan.zhihu.com/p/94799295
https://zhuanlan.zhihu.com/p/104236411

Original: https://blog.csdn.net/weixin_45377629/article/details/124998517
Author: 寻找永不遗憾
Title: 【DIoU CIoU】DIoU和CIoU损失函数理解及代码实现

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/687878/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

高斯过程回归（Gaussian Process Regression）

在概率论和统计学中，高斯过程是指观测发生在连续域（例如：时域、空间域）中的一种特殊的概率模型 ; 1 基本概念在高斯过程，连续的输入空间的任何点与正态分布的随机变量相关，而且任何…

人工智能 2023年6月17日
00103
【RuntimeError: CUDA error: device-side assert triggered】问题与解决

RuntimeError: CUDA error: device-side assert triggered 问题描述 * 解决思路 – 发现问题：总结问题描述当…

人工智能 2023年7月20日
0068
[BEV系列]BEVFormer: Learning Bird’s-Eye-ViewRepresentation from Multi-Camera Images viaSpatiotemporal

论文链接：https://arxiv.org/pdf/2203.17270v1.pdf代码链接：https://github.com/zhiqi-li/BEVFormer 摘要（A…

人工智能 2023年6月2日
0061
windows10 + Python 3.6+cuda11.2 + cudnn8.1.1.33 + Tensorflow Objection API 环境配置与训练

cuda和cudnn的安装方法，网上有很多，就不在这里写了，只要去nvida的网站下载安装就好了。anaconda 我这边使用的是anaconda3 5.2.0。我这边用的显卡是R…

人工智能 2023年5月25日
0049
自然语言处理入门——新手上路

目录一、自然与语言与编程语言二、自然语言处理的层次三、自然语言处理的流派五、语料库六、开源工具七总结自然语言处理（NLP）是一门融合了计算机科学、人工智能以及语言…

人工智能 2023年6月16日
0064
基于朴素贝叶斯的垃圾邮件分类Python实现

背景垃圾邮件的问题一直困扰着人们，传统的垃圾邮件分类的方法主要有”关键词法”和”校验码法”等，然而这两种方法效果并不理想。其中，如…

人工智能 2023年7月4日
0095
使用“Opencv“时遇到terminate called after throwing an instance of ‘cv::Exception‘问题的解决方案

使用”Opencv”时遇到terminate called after throwing an instance of ‘cv::Excepti…

人工智能 2023年6月18日
00175
Python数据分析-房价预测及模型分析

摘要上一篇OF讲述了房价的影响因素，主要是房屋面积、卫生间数、卧室数。今天，我们通过建立模型来预测房价。机器学习中关于回归算法-数据发展的预测，包含了几个模型： 1、线性回归；…

人工智能 2023年7月16日
00107
R构建指数回归模型（Exponential Regression）

R构建指数回归模型（Exponential Regression）目录 R构建指数回归模型（Exponential Regression）指数回归（Exponential Re…

人工智能 2023年6月18日
0078
遗传算法的神经网络python实现源码

代码过程中，把代码过程较好的一些代码段记录起来，下边代码是关于遗传算法的神经网络python实现的代码，应该对大伙有一些用处。 from operator import itemg…

人工智能 2023年5月25日
0082
java中以字符分隔的字符串与字符串数组的相互转换

1.字符串数组拼接成一个以指定字符(包括空字符)分隔的字符串—— String.join()，JDK8的新特性 String[] strArray = {"aaa&quo…

人工智能 2023年6月6日
0067
Teams app LukcyDraw 的升级之路

我已经有很长一段时间没有更新我的 Teams App：LuckyDraw 了，有很多用户反馈给我，因为快到圣诞，新年和春节了，很多公司都开始要使用LuckyDraw来搞抽奖活动，希…

人工智能 2023年7月30日
0065
实测AIGC工作流，Stable Diffusion + Mubert 实现图片与音乐的转换生成

社区分享了不少文本生成图像的AIGC（AI生成内容）应用的突破，图像类的生成已经是”红海”了。我们需要寻找”蓝海”，近期出现了其他…

人工智能 2023年7月30日
0053
pytorch中nn.Dropout的使用技巧

dropout是Hinton老爷子提出来的一个用于训练的trick。在pytorch中，除了原始的用法以外，还有数据增强的用法（后文提到）。首先要知道，dropout是专门用于训…

人工智能 2023年7月21日
0054
人脸识别技术实现，机器学习分类，网络搭建

一。人脸识别技术 1.python编程环境构建下载anaconda，CUDA，CUDNN 下载完成后（开始键+r）输入cmd后输入conda，显示以下内容即可 2.程序实现环境…

人工智能 2023年7月2日
0060
机器学习强基计划0-4：通俗理解奥卡姆剃刀与没有免费午餐定理

目录 0 写在前面 1 奥卡姆剃刀原则 2 天下没有免费的午餐 3 丑小鸭定理 ; 0 写在前面机器学习强基计划聚焦深度和广度，加深对机器学习模型的理解与应用。”深&…

人工智能 2023年6月19日
0072

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

【DIoU CIoU】DIoU和CIoU损失函数理解及代码实现

文章目录

大家都在看