UNet3+详解

2023年6月16日下午3:16 • 人工智能 • 阅读 72

(1) 编码层

(2) 解码层

a.跳跃连接

b.分类引导模块（CGM）

c.特征聚合机制

d.深监督

e.混合损失函数

UNet3+解决的问题

UNet++是由UNet结构更改而来，然而他不能从全部尺度上捕获信息，仍有很大的提升空间。于是UNet3+主要是解决了可以从全尺度上获取信息的问题。

既然说到全尺度，那就将UNet、UNet++、UNet3+捕获信息的情况讲解下。

（1）UNet

UNet主要是将解码层（下采样层）与编码层（上采样层）中同层之间进行连接。U型结构的低层主要是获取细粒度的细节特征（捕获丰富的空间信息），高层主要是获取粗粒度的语义特征（提现位置信息），所以UNet的这种仅有同层之间的连接，使得他的上下层连接时存在信息代沟现象。

（2）UNet++

UNet++由UNet改编而来，具有嵌套、稠密的跳跃连接，一定程度缓解了UNet层次间的代沟问题，性能也相对提升了不少。

UNet3+的创新点

（1）UNet3+提出全尺度跳跃连接，此连接将来自不同尺度特征图的低级细节与高级语义相结合，最大程度使用全尺度的特征图，提高分割精度。

（2）深监督：深度监督从全尺度聚合特征图中学习层次代表。

（3）通过减少网络参数来提升计算效益。

（4）提出混合损失函数以增强器官边缘。

（5）设计了一个分类引导模型以减少在无器官图像中的过度分割现象。

UNet3+的结构体

由于我在初学该模型的时候，论文里的结构是分两个图提供的，当时我理解起来是有点小困难的，所以就想着画个完整的图分享给大家。

嘿，这是我第一次画网络图，不确定用哪个软件画比较好，就在派派（ipad）上手画的，有点乱乱的。等我有空时，再好好画画。如果大家有好用的画网络图的软件，也欢迎推荐我呀~

从左至右讲解~

(1) 编码Encoder

编码部分和UNet的编码部分是一样的。首先将输入的图像经过两次33卷积，每次卷积都紧跟着BatchNorm2d、ReLU。然后进行最大池操作，即 stride=2的22卷积。值得注意的是最下面一层即第五层卷积后不再进行下采样（最大池操作）。

解释 3*3卷积、BatchNorm2d、ReLU、最大池操作：

a. 3*3卷积

卷积操作影响特征通道，下采样（本模型下采样用的maxpool）影响分辨率。

b. BatchNorm2d

此处是为了将3*3卷积后的特征图进行数据归一化处理，防止后续的ReLU激活函数操作时，出现由于特征图数据过大，出现网络性能不稳定的问题。

c. ReLU

激活函数。

d. 最大池

最大池max pool使用的是2*2的卷积核，用来提取特征，使特征图分辨率缩小1倍。

(2) 解码Decoder

每个解码的实现机制是一样的，论文中是以Decoder3为例讲解的，我们也以Decoder3为例详细讲解，同时也介绍其它层的具体实现过程。

a. 跳跃连接 (此步骤在图中，由于图的跳跃线比较密集，没有将操作全部标在图上。)

Unet3+以全尺度连接为突出创新点，他的全尺度连接是在解码层实现的，具体来说就是该网络的每个解码层的特征图是通过5个尺度的特征图通过一定操作（torch.cat() 拼接）组成的。以Decoder3 为例详细介绍：

Decoder3的特征图是由来源于编码层中比它低层的Encoder1、Encoder2，和它同层的Encoder3，以及解码层中比它高层的Decoder4、Decoder5的特征图分别通过一些操作后构成的。那分别是做了哪些操作呢？

Encoder1：将特征图进行最大池无重叠操作，即kernel_size=4, stride=4的操作，记为maxpooling(4)，maxpooling是用来降低分辨率，达到和Decoder3层同样的分辨率，方便后续的特征图拼接操作；然后33卷积，使特征通道变为64；以及ReLU，总结就是maxpooling(4)、33conv、ReLU。

Encoder2：maxpooling(2)， 3*3Conv，ReLU。

Encoder3：3*3conv， ReLU. （因为是同层，他们的特征图分辨率是相同的，不需要最大池化操作改变分辨率。）

Decoder4：首先进行双线性上采样操作，此操作用来提高分辨率；然后33卷积，此操作用来改变特征通道为64；ReLU操作。总计即 bilinear upsample(2), 33conv, ReLU。

Decoder5：bilinear upsample(4), 3*3conv, ReLU。

即在Encoder中，比Decoder3低的encoder中做分辨率缩小操作（maxpool）；在decoder中，比decoder3高的decoder做分辨率变大操作（双线性上采样 bilinear upsample）。具体的操作倍数由层次决定。

接下来从下往上简要介绍其他解码层的情况。

Decoder5: = Encoder5，故不做任何处理。

Decoder4:

encoder1: maxpooling(8), 3*3conv, ReLU.

encoder2: maxpooling(4), 3*3conv, ReLU.

encoder3: maxpooling(2), 3*3conv, ReLU.

encoder4: 3*3conv, ReLU.

decoder5: bilinear upsample(2), 3*3conv, ReLU.

Decoder2:

encoder1: maxpooling(2), 3*3conv, ReLU.

encoder2: 3*3conv, ReLU.

Decoder3: bilinear umsample(2), 3*3conv, ReLU.

Decoder4: bilinear umsample(4), 3*3conv, ReLU.

Decoder5: bilinear umsample(8), 3*3conv, ReLU.

Decoder1:

encoder1: 3*3conv, ReLU.

Decoder2: bilinear umsample(2), 3*3conv, ReLU.

Decoder3: bilinear umsample(4), 3*3conv, ReLU.

Decoder4: bilinear umsample(8), 3*3conv, ReLU.

Decoder5: bilinear umsample(16), 3*3conv, ReLU.

b. 分类引导模块 classification-guided module(CGM)

1）提出原因

在大多数的医学图像分割中，在无器官的图像中出现假正率是一件必然发生的事情。这种事情的发生可能是由于存在于浅层中图像背景的噪声信息导致的过度分割现象。为实现更精确的分割结果，UNet3+提出通过加入一个额外的分类任务，预测输入的图片是否含有器官。

2）实现

从拥有最丰富语义信息的Encoder5中进行一系列操作，最后分割结果能进一步指导解码层中每一层的输出。

对Encoder5的一系列操作包括dropout，1*1卷积，自适应最大池AdaptiveMaxPool, Sigmoid操作，该一系列操作后得到一个2维张量；通过一个argmax函数，2维tensor转化为 {0,1} 中的一个单一输出，0代表无器官，1代表有器官；在每一个解码层中，将深监督阶段内bilinear up-sampling操作后的分割结果与分割结果0/1相乘。最后实现了将每层中的分割结果进行了分类。

【文章中说该模块的分类结果是在二值交叉上损失函数的优化下实现的，但我在作者提供的代码中没能找到该损失函数。随着深度学习知识的积累，我再一次复现了该模型，发现此处并没有使用二值交叉熵损失函数，而且此处也不需要损失函数。】

c. 特征聚合机制

为了将浅层空间信息与深层语义信息精密合并，提出了特征聚合机制，该机制是将跳跃连接组成的320个特征通道的特征图进一步聚合。具体操作是：将跳跃连接后的320个通道的特征图进行3*3卷积操作，BN数据归一化处理，ReLU激活。

d. 深监督

深监督即将编码层的每个decoder的输出进行了损失计算。

为了了解全尺度聚合特征图的阶层表达，在UNet3+上提出了全尺度深监督，在每一个解码层生成一个受ground truth监督的侧边输出。该步骤的操作包括：3*3conv，bilinear up-sampling, sigmoid。

深监督的具体操作是：将每个解码层的经过特征聚合机制生成的特征图的最后一层输入3*3卷积层内，之后伴随着一个双线性上采样bilinear up-sampling，此处上采样是为了将特征图分辨率恢复至输入图像水平。然后将上采样后得到的分割结果与分类模块的结果0/1相乘；将相乘后的结果经过sigmoid处理，得到的结果即深监督的输出结果。然后将深监督结果输入损失函数。

e. 混合损失函数

将混合损失函数用在了每个解码层深监督的后面，即将深监督中的最后一步sigmoid得到的操作结果再使用了混合损失函数。

混合损失函数：multi-scale structural sililarity index (MS-SSIM) loss (

)，focal loss (

), IoU loss (

为进一步提升器官的边界，提出多尺度结构相似性损失函数

。通过该函数，UNet3+将密切关注模糊边界，区域分布差异越大，他的MS-SSIM值越大。

focal loss是用在像素级的损失。MS-SSIM loss是用在通道级，IoU loss是用在特征图级。

UNet3+的代码实现解说

由于作者提供的代码内解释太少，对于我这种小白来说是有一定困难的，所以我将带有自己阅读笔记的代码分享给大家。

(1) UNet_3Plus.py

模型代码很不好，不够简洁。

-*- coding: utf-8 -*-
import torch
import torch.nn as nn
import torch.nn.functional as F
from layers import unetConv2
from init_weights import init_weights
'''
    UNet 3+
'''
class UNet_3Plus(nn.Module):

    def __init__(self, in_channels=3, n_classes=1, feature_scale=4, is_deconv=True, is_batchnorm=True): #in_channels=3&#x8868;&#x793A;&#x8F93;&#x5165;&#x7684;&#x662F;&#x5F69;&#x8272;&#x56FE;
        super(UNet_3Plus, self).__init__()
        self.is_deconv = is_deconv  #&#x4F5C;&#x7528;&#xFF1F;&#xFF1F;
        self.in_channels = in_channels
        self.is_batchnorm = is_batchnorm  #&#x4E00;&#x4E2A;&#x5377;&#x79EF;&#x5C42;&#x540E;&#x662F;&#x5426;&#x8FDB;&#x884C;&#x5F52;&#x4E00;&#x5316;&#x5904;&#x7406;&#xFF0C;&#x4EE5;&#x9632;&#x6B62;ReLU&#x5904;&#x7406;&#x65F6;&#xFF0C;&#x7531;&#x4E8E;&#x6570;&#x636E;&#x8FC7;&#x5927;&#xFF0C;&#x5BFC;&#x81F4;&#x7F51;&#x7EDC;&#x6027;&#x80FD;&#x4E0D;&#x7A33;&#x5B9A;&#x3002;
        self.feature_scale = feature_scale

        filters = [64, 128, 256, 512, 1024]  #feature channels

        ## -------------Encoder--------------
        #&#x8FD9;&#x662F;&#x4E00;&#x4E2A;&#x7F16;&#x7801;&#x5C42;
        self.conv1 = unetConv2(self.in_channels, filters[0], self.is_batchnorm)
        self.maxpool1 = nn.MaxPool2d(kernel_size=2)

        self.conv2 = unetConv2(filters[0], filters[1], self.is_batchnorm)
        self.maxpool2 = nn.MaxPool2d(kernel_size=2)

        self.conv3 = unetConv2(filters[1], filters[2], self.is_batchnorm)
        self.maxpool3 = nn.MaxPool2d(kernel_size=2)

        self.conv4 = unetConv2(filters[2], filters[3], self.is_batchnorm)
        self.maxpool4 = nn.MaxPool2d(kernel_size=2)

        self.conv5 = unetConv2(filters[3], filters[4], self.is_batchnorm)

        ## -------------Decoder--------------
        self.CatChannels = filters[0]  #&#x6BCF;&#x4E2A;decoder&#x7528;5&#x4E2A;&#x5C3A;&#x5EA6;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x8FDB;&#x884C;&#x62FC;&#x63A5;&#xFF0C;&#x6BCF;&#x4E2A;&#x5C3A;&#x5EA6;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x7684;&#x7279;&#x5F81;&#x901A;&#x9053;&#x90FD;&#x4E3A;64&#xFF0C;&#x5373;filter[0]
        self.CatBlocks = 5  #&#x6BCF;&#x4E2A;decoder&#x6709;&#x6765;&#x81EA;&#x4E94;&#x4E2A;&#x5C3A;&#x5EA6;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x8FDB;&#x884C;&#x62FC;&#x63A5;
        self.UpChannels = self.CatChannels * self.CatBlocks #&#x6BCF;&#x4E2A;decoder&#x62FC;&#x63A5;&#x540E;&#x7684;&#x7279;&#x5F81;&#x901A;&#x9053;&#x6570;&#x91CF; 320

        '''stage 4d'''
        #Deccoder4&#x4E2D;&#xFF0C;&#x83B7;&#x53D6;&#x8F83;&#x5C0F;&#x56DB;&#x5C42;&#x7684;&#x8BE6;&#x7EC6;&#x4FE1;&#x606F;&#x7684;&#x62FC;&#x63A5;&#x64CD;&#x4F5C;
        #&#x5BF9;En1&#x7684;&#x64CD;&#x4F5C; maxpooling(8), 64, 3*3
        # h1->320*320, hd4->40*40, Pooling 8 times
        self.h1_PT_hd4 = nn.MaxPool2d(8, 8, ceil_mode=True) #MaxPool2d(kernel_size, stride, ceil_mode) kernel_size&#x6307;&#x6700;&#x5927;&#x6C60;&#x7684;&#x7A97;&#x53E3;&#x5927;&#x5C0F;&#xFF0C;stride&#x662F;&#x4E00;&#x6B21;&#x79FB;&#x52A8;&#x7684;&#x6B65;&#x957F;&#xFF0C;ceil_mode&#x662F;&#x5411;&#x4E0A;&#x53D6;&#x6574;&#x3002;
        self.h1_PT_hd4_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1) #padding&#x5BF9;&#x5377;&#x79EF;&#x540E;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x8FDB;&#x884C;&#x4E86;&#x8FB9;&#x7F18;&#x50CF;&#x7D20;&#x7684;&#x4FEE;&#x8865;&#x3002;
        self.h1_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels) #&#x53C2;&#x6570;&#x4E3A;&#x7279;&#x5F81;&#x901A;&#x9053;&#x7684;&#x6570;&#x91CF;
        self.h1_PT_hd4_relu = nn.ReLU(inplace=True) #inplace=True &#x51FD;&#x6570;&#x4F1A;&#x628A;&#x8F93;&#x51FA;&#x76F4;&#x63A5;&#x8986;&#x76D6;&#x5230;&#x8F93;&#x5165;&#x4E2D;&#x3002;

        #&#x5BF9;En2&#x7684;&#x64CD;&#x4F5C;&#xFF0C;maxpooling(4), 64, 3*3
        # h2->160*160, hd4->40*40, Pooling 4 times
        self.h2_PT_hd4 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h2_PT_hd4_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd4_relu = nn.ReLU(inplace=True)

        #&#x5BF9;En3&#x7684;&#x64CD;&#x4F5C;&#xFF0C;maxpooling(2), 64, 3*3
        # h3->80*80, hd4->40*40, Pooling 2 times
        self.h3_PT_hd4 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h3_PT_hd4_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_PT_hd4_relu = nn.ReLU(inplace=True)

        #&#x5BF9;&#x540C;&#x5C42;En4&#x7684;&#x64CD;&#x4F5C;&#xFF0C;64&#xFF0C; 3*3&#x3002; &#x540C;&#x5C42;&#x6CA1;&#x6709;&#x6700;&#x5927;&#x6C60;&#x7684;&#x64CD;&#x4F5C;&#x3002;
        # h4->40*40, hd4->40*40, Concatenation
        self.h4_Cat_hd4_conv = nn.Conv2d(filters[3], self.CatChannels, 3, padding=1)
        self.h4_Cat_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h4_Cat_hd4_relu = nn.ReLU(inplace=True)

        #Decoder4&#x4E2D;&#xFF0C;&#x83B7;&#x53D6;&#x8F83;&#x5927;&#x5C42;&#x7684;&#x7C97;&#x7C92;&#x5EA6;&#x4FE1;&#x606F;&#x7684;&#x62FC;&#x63A5;&#x64CD;&#x4F5C;
        #bilinear upsample(2)
        # hd5->20*20, hd4->40*40, Upsample 2 times
        self.hd5_UT_hd4 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14  #scale_factor&#x6307;&#x5B9A;&#x8F93;&#x51FA;&#x4E3A;&#x8F93;&#x5165;&#x7684;&#x591A;&#x5C11;&#x500D;
        self.hd5_UT_hd4_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd4_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4)
        # #&#x7279;&#x5F81;&#x805A;&#x5408;&#x673A;&#x5236;
        self.conv4d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn4d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu4d_1 = nn.ReLU(inplace=True)

        '''stage 3d'''
        # h1->320*320, hd3->80*80, Pooling 4 times
        self.h1_PT_hd3 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h1_PT_hd3_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd3_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd3->80*80, Pooling 2 times
        self.h2_PT_hd3 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h2_PT_hd3_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd3_relu = nn.ReLU(inplace=True)

        # h3->80*80, hd3->80*80, Concatenation
        self.h3_Cat_hd3_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_Cat_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_Cat_hd3_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd4->80*80, Upsample 2 times
        self.hd4_UT_hd3 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd4_UT_hd3_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)  #&#x6CE8;&#x610F;&#x6B64;&#x5904;&#x5377;&#x79EF;&#x7684;&#x8F93;&#x5165;channels&#x4E3A;UpChannels,&#x5373;Decoder4&#x805A;&#x5408;&#x540E;&#x7684;&#x7279;&#x5F81;&#x901A;&#x9053;&#x6570;&#x91CF;&#x3002;
        self.hd4_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd3_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd4->80*80, Upsample 4 times
        self.hd5_UT_hd3 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd5_UT_hd3_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd3_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3)
        self.conv3d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn3d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu3d_1 = nn.ReLU(inplace=True)

        '''stage 2d '''
        # h1->320*320, hd2->160*160, Pooling 2 times
        self.h1_PT_hd2 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h1_PT_hd2_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd2_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd2->160*160, Concatenation
        self.h2_Cat_hd2_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_Cat_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_Cat_hd2_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd2->160*160, Upsample 2 times
        self.hd3_UT_hd2 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd3_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd2->160*160, Upsample 4 times
        self.hd4_UT_hd2 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd4_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd2->160*160, Upsample 8 times
        self.hd5_UT_hd2 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd5_UT_hd2_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd2_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2)
        self.conv2d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn2d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu2d_1 = nn.ReLU(inplace=True)

        '''stage 1d'''
        # h1->320*320, hd1->320*320, Concatenation
        self.h1_Cat_hd1_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_Cat_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_Cat_hd1_relu = nn.ReLU(inplace=True)

        # hd2->160*160, hd1->320*320, Upsample 2 times
        self.hd2_UT_hd1 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd2_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd2_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd2_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd1->320*320, Upsample 4 times
        self.hd3_UT_hd1 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd3_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd1->320*320, Upsample 8 times
        self.hd4_UT_hd1 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd4_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd1->320*320, Upsample 16 times
        self.hd5_UT_hd1 = nn.Upsample(scale_factor=16, mode='bilinear')  # 14*14
        self.hd5_UT_hd1_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd1_relu = nn.ReLU(inplace=True)

        # fusion(h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1)
        self.conv1d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn1d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu1d_1 = nn.ReLU(inplace=True)

        # output
        self.outconv1 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1) #&#x6700;&#x540E;&#x901A;&#x8FC7;3*3&#x5377;&#x79EF;&#xFF0C;&#x5C06;320&#x901A;&#x9053;&#x8F6C;&#x4E3A;n_classes&#x901A;&#x9053;&#x3002;

        # initialise weights
        for m in self.modules():
            if isinstance(m, nn.Conv2d):  #isinstance(object, classes)&#x5224;&#x65AD;&#x5B9E;&#x4F8B;object&#x662F;&#x5426;&#x4E3A;classes&#x7C7B;&#x578B;&#xFF0C;&#x662F;&#x5219;&#x8FD4;&#x56DE;true
                init_weights(m, init_type='kaiming') #&#x3010;&#x7406;&#x89E3;&#x6743;&#x91CD;&#x51FD;&#x6570;&#xFF1F;&#xFF1F;&#x3011;
            elif isinstance(m, nn.BatchNorm2d):
                init_weights(m, init_type='kaiming')

    def forward(self, inputs):
        ## -------------Encoder-------------
        h1 = self.conv1(inputs)  # h1->320*320*64

        h2 = self.maxpool1(h1)
        h2 = self.conv2(h2)  # h2->160*160*128

        h3 = self.maxpool2(h2)
        h3 = self.conv3(h3)  # h3->80*80*256

        h4 = self.maxpool3(h3)
        h4 = self.conv4(h4)  # h4->40*40*512

        h5 = self.maxpool4(h4)
        hd5 = self.conv5(h5)  # h5->20*20*1024

        ## -------------Decoder-------------
        #decoder4
        h1_PT_hd4 = self.h1_PT_hd4_relu(self.h1_PT_hd4_bn(self.h1_PT_hd4_conv(self.h1_PT_hd4(h1))))
        h2_PT_hd4 = self.h2_PT_hd4_relu(self.h2_PT_hd4_bn(self.h2_PT_hd4_conv(self.h2_PT_hd4(h2))))
        h3_PT_hd4 = self.h3_PT_hd4_relu(self.h3_PT_hd4_bn(self.h3_PT_hd4_conv(self.h3_PT_hd4(h3))))
        h4_Cat_hd4 = self.h4_Cat_hd4_relu(self.h4_Cat_hd4_bn(self.h4_Cat_hd4_conv(h4)))
        hd5_UT_hd4 = self.hd5_UT_hd4_relu(self.hd5_UT_hd4_bn(self.hd5_UT_hd4_conv(self.hd5_UT_hd4(hd5))))
        hd4 = self.relu4d_1(self.bn4d_1(self.conv4d_1(
            torch.cat((h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4), 1)))) # hd4->40*40*UpChannels

        #decoder3
        h1_PT_hd3 = self.h1_PT_hd3_relu(self.h1_PT_hd3_bn(self.h1_PT_hd3_conv(self.h1_PT_hd3(h1))))
        h2_PT_hd3 = self.h2_PT_hd3_relu(self.h2_PT_hd3_bn(self.h2_PT_hd3_conv(self.h2_PT_hd3(h2))))
        h3_Cat_hd3 = self.h3_Cat_hd3_relu(self.h3_Cat_hd3_bn(self.h3_Cat_hd3_conv(h3)))
        hd4_UT_hd3 = self.hd4_UT_hd3_relu(self.hd4_UT_hd3_bn(self.hd4_UT_hd3_conv(self.hd4_UT_hd3(hd4))))
        hd5_UT_hd3 = self.hd5_UT_hd3_relu(self.hd5_UT_hd3_bn(self.hd5_UT_hd3_conv(self.hd5_UT_hd3(hd5))))
        hd3 = self.relu3d_1(self.bn3d_1(self.conv3d_1(
            torch.cat((h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3), 1)))) # hd3->80*80*UpChannels

        #decoder2
        h1_PT_hd2 = self.h1_PT_hd2_relu(self.h1_PT_hd2_bn(self.h1_PT_hd2_conv(self.h1_PT_hd2(h1))))
        h2_Cat_hd2 = self.h2_Cat_hd2_relu(self.h2_Cat_hd2_bn(self.h2_Cat_hd2_conv(h2)))
        hd3_UT_hd2 = self.hd3_UT_hd2_relu(self.hd3_UT_hd2_bn(self.hd3_UT_hd2_conv(self.hd3_UT_hd2(hd3))))
        hd4_UT_hd2 = self.hd4_UT_hd2_relu(self.hd4_UT_hd2_bn(self.hd4_UT_hd2_conv(self.hd4_UT_hd2(hd4))))
        hd5_UT_hd2 = self.hd5_UT_hd2_relu(self.hd5_UT_hd2_bn(self.hd5_UT_hd2_conv(self.hd5_UT_hd2(hd5))))
        hd2 = self.relu2d_1(self.bn2d_1(self.conv2d_1(
            torch.cat((h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2), 1)))) # hd2->160*160*UpChannels

        #decoder1
        h1_Cat_hd1 = self.h1_Cat_hd1_relu(self.h1_Cat_hd1_bn(self.h1_Cat_hd1_conv(h1)))
        hd2_UT_hd1 = self.hd2_UT_hd1_relu(self.hd2_UT_hd1_bn(self.hd2_UT_hd1_conv(self.hd2_UT_hd1(hd2))))
        hd3_UT_hd1 = self.hd3_UT_hd1_relu(self.hd3_UT_hd1_bn(self.hd3_UT_hd1_conv(self.hd3_UT_hd1(hd3))))
        hd4_UT_hd1 = self.hd4_UT_hd1_relu(self.hd4_UT_hd1_bn(self.hd4_UT_hd1_conv(self.hd4_UT_hd1(hd4))))
        hd5_UT_hd1 = self.hd5_UT_hd1_relu(self.hd5_UT_hd1_bn(self.hd5_UT_hd1_conv(self.hd5_UT_hd1(hd5))))
        hd1 = self.relu1d_1(self.bn1d_1(self.conv1d_1(
            torch.cat((h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1), 1)))) # hd1->320*320*UpChannels

        #&#x8F93;&#x51FA;
        d1 = self.outconv1(hd1)  # d1->320*320*n_classes
        return F.sigmoid(d1) #&#x8F93;&#x51FA;&#x7684;&#x7ED3;&#x679C;&#x9700;&#x8981;&#x7ECF;&#x8FC7;sigmoid&#x6FC0;&#x6D3B;

'''
    UNet 3+ with deep supervision
'''
class UNet_3Plus_DeepSup(nn.Module):
    def __init__(self, in_channels=3, n_classes=1, feature_scale=4, is_deconv=True, is_batchnorm=True):
        super(UNet_3Plus_DeepSup, self).__init__()
        self.is_deconv = is_deconv
        self.in_channels = in_channels
        self.is_batchnorm = is_batchnorm
        self.feature_scale = feature_scale

        filters = [64, 128, 256, 512, 1024]

        ## -------------Encoder--------------
        self.conv1 = unetConv2(self.in_channels, filters[0], self.is_batchnorm)
        self.maxpool1 = nn.MaxPool2d(kernel_size=2)

        self.conv2 = unetConv2(filters[0], filters[1], self.is_batchnorm)
        self.maxpool2 = nn.MaxPool2d(kernel_size=2)

        self.conv3 = unetConv2(filters[1], filters[2], self.is_batchnorm)
        self.maxpool3 = nn.MaxPool2d(kernel_size=2)

        self.conv4 = unetConv2(filters[2], filters[3], self.is_batchnorm)
        self.maxpool4 = nn.MaxPool2d(kernel_size=2)

        self.conv5 = unetConv2(filters[3], filters[4], self.is_batchnorm)

        ## -------------Decoder--------------
        self.CatChannels = filters[0]
        self.CatBlocks = 5
        self.UpChannels = self.CatChannels * self.CatBlocks

        '''stage 4d'''
        # h1->320*320, hd4->40*40, Pooling 8 times
        self.h1_PT_hd4 = nn.MaxPool2d(8, 8, ceil_mode=True)
        self.h1_PT_hd4_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd4_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd4->40*40, Pooling 4 times
        self.h2_PT_hd4 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h2_PT_hd4_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd4_relu = nn.ReLU(inplace=True)

        # h3->80*80, hd4->40*40, Pooling 2 times
        self.h3_PT_hd4 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h3_PT_hd4_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_PT_hd4_relu = nn.ReLU(inplace=True)

        # h4->40*40, hd4->40*40, Concatenation
        self.h4_Cat_hd4_conv = nn.Conv2d(filters[3], self.CatChannels, 3, padding=1)
        self.h4_Cat_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h4_Cat_hd4_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd4->40*40, Upsample 2 times
        self.hd5_UT_hd4 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd5_UT_hd4_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd4_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4)
        self.conv4d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn4d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu4d_1 = nn.ReLU(inplace=True)

        '''stage 3d'''
        # h1->320*320, hd3->80*80, Pooling 4 times
        self.h1_PT_hd3 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h1_PT_hd3_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd3_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd3->80*80, Pooling 2 times
        self.h2_PT_hd3 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h2_PT_hd3_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd3_relu = nn.ReLU(inplace=True)

        # h3->80*80, hd3->80*80, Concatenation
        self.h3_Cat_hd3_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_Cat_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_Cat_hd3_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd4->80*80, Upsample 2 times
        self.hd4_UT_hd3 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd4_UT_hd3_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd3_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd4->80*80, Upsample 4 times
        self.hd5_UT_hd3 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd5_UT_hd3_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd3_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3)
        self.conv3d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn3d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu3d_1 = nn.ReLU(inplace=True)

        '''stage 2d '''
        # h1->320*320, hd2->160*160, Pooling 2 times
        self.h1_PT_hd2 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h1_PT_hd2_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd2_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd2->160*160, Concatenation
        self.h2_Cat_hd2_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_Cat_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_Cat_hd2_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd2->160*160, Upsample 2 times
        self.hd3_UT_hd2 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd3_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd2->160*160, Upsample 4 times
        self.hd4_UT_hd2 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd4_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd2->160*160, Upsample 8 times
        self.hd5_UT_hd2 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd5_UT_hd2_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd2_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2)
        self.conv2d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn2d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu2d_1 = nn.ReLU(inplace=True)

        '''stage 1d'''
        # h1->320*320, hd1->320*320, Concatenation
        self.h1_Cat_hd1_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_Cat_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_Cat_hd1_relu = nn.ReLU(inplace=True)

        # hd2->160*160, hd1->320*320, Upsample 2 times
        self.hd2_UT_hd1 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd2_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd2_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd2_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd1->320*320, Upsample 4 times
        self.hd3_UT_hd1 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd3_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd1->320*320, Upsample 8 times
        self.hd4_UT_hd1 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd4_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd1->320*320, Upsample 16 times
        self.hd5_UT_hd1 = nn.Upsample(scale_factor=16, mode='bilinear')  # 14*14
        self.hd5_UT_hd1_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd1_relu = nn.ReLU(inplace=True)

        # fusion(h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1)
        self.conv1d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn1d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu1d_1 = nn.ReLU(inplace=True)

        # -------------Bilinear Upsampling--------------     #&#x53CC;&#x7EBF;&#x6027;&#x4E0A;&#x91C7;&#x6837;&#x3010;&#x4EC0;&#x4E48;&#x5730;&#x65B9;&#x7528;&#x5230;&#x4E86;&#xFF1F;&#xFF1F;&#x3011;
        self.upscore6 = nn.Upsample(scale_factor=32,mode='bilinear')###&#x8BE5;&#x51FD;&#x6570;&#x6CA1;&#x7528;&#xFF0C;&#x5E94;&#x8BE5;&#x662F;&#x4F5C;&#x8005;&#x591A;&#x5199;&#x4E86;&#x3002;
        self.upscore5 = nn.Upsample(scale_factor=16,mode='bilinear')  #scale_factor&#x5904;&#x7406;&#xFF0C;&#x662F;&#x4E3A;&#x4E86;&#x8BA9;&#x8F93;&#x51FA;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x548C;&#x7B2C;&#x4E00;&#x5C42;&#x7684;&#x7279;&#x5F81;&#x56FE;&#x5C3A;&#x5BF8;&#x76F8;&#x540C;&#x3002;
        self.upscore4 = nn.Upsample(scale_factor=8,mode='bilinear')
        self.upscore3 = nn.Upsample(scale_factor=4,mode='bilinear')
        self.upscore2 = nn.Upsample(scale_factor=2, mode='bilinear')

        # DeepSup #&#x6DF1;&#x76D1;&#x7763;&#xFF1A;&#x5C06;decoder&#x7684;&#x6700;&#x540E;&#x4E00;&#x5C42;&#x8FDB;&#x884C;conv3*3&#xFF0C;bilinear upsampling, sigmoid
        self.outconv1 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv2 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv3 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv4 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv5 = nn.Conv2d(filters[4], n_classes, 3, padding=1)

        # initialise weights
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                init_weights(m, init_type='kaiming')
            elif isinstance(m, nn.BatchNorm2d):
                init_weights(m, init_type='kaiming')

    def forward(self, inputs):
        ## -------------Encoder-------------
        h1 = self.conv1(inputs)  # h1->320*320*64

        h2 = self.maxpool1(h1)
        h2 = self.conv2(h2)  # h2->160*160*128

        h3 = self.maxpool2(h2)
        h3 = self.conv3(h3)  # h3->80*80*256

        h4 = self.maxpool3(h3)
        h4 = self.conv4(h4)  # h4->40*40*512

        h5 = self.maxpool4(h4)
        hd5 = self.conv5(h5)  # h5->20*20*1024

        ## -------------Decoder-------------
        #decoder4
        h1_PT_hd4 = self.h1_PT_hd4_relu(self.h1_PT_hd4_bn(self.h1_PT_hd4_conv(self.h1_PT_hd4(h1))))  #&#x4E0E;encoder1&#x8FDE;&#x63A5;&#xFF0C;relu( bn( conv( maxpool( h1 ) ) ) )
        h2_PT_hd4 = self.h2_PT_hd4_relu(self.h2_PT_hd4_bn(self.h2_PT_hd4_conv(self.h2_PT_hd4(h2))))  #&#x4E0E;encoder2&#x8FDE;&#x63A5;
        h3_PT_hd4 = self.h3_PT_hd4_relu(self.h3_PT_hd4_bn(self.h3_PT_hd4_conv(self.h3_PT_hd4(h3))))  #&#x4E0E;encoder3&#x8FDE;&#x63A5;
        h4_Cat_hd4 = self.h4_Cat_hd4_relu(self.h4_Cat_hd4_bn(self.h4_Cat_hd4_conv(h4)))   #&#x540C;&#x5C42;&#x8FDE;&#x63A5;-&#x4E0E;encoder4&#x8FDE;&#x63A5;&#xFF0C;relu( bn( conv( h4 ) ) )
        hd5_UT_hd4 = self.hd5_UT_hd4_relu(self.hd5_UT_hd4_bn(self.hd5_UT_hd4_conv(self.hd5_UT_hd4(hd5))))  #&#x4E0E;decoder5&#x8FDE;&#x63A5;&#xFF0C;relu( bn( conv( upsample ) ) )
        hd4 = self.relu4d_1(self.bn4d_1(self.conv4d_1(
            torch.cat((h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4), 1)))) # hd4->40*40*UpChannels   #&#x7279;&#x5F81;&#x805A;&#x5408;&#x673A;&#x5236; relu( bn( conv( cat( &#x4E94;&#x4E2A;&#x7279;&#x5F81;&#x56FE; ) ) ) ) &#xFF0C;  1&#x8868;&#x793A;&#x6309;&#x7EF4;&#x5EA6;1&#x62FC;&#x63A5;&#xFF0C;&#x5373;&#x6309;&#x884C;&#x62FC;&#x63A5;&#x3002;

        #decoder3
        h1_PT_hd3 = self.h1_PT_hd3_relu(self.h1_PT_hd3_bn(self.h1_PT_hd3_conv(self.h1_PT_hd3(h1))))
        h2_PT_hd3 = self.h2_PT_hd3_relu(self.h2_PT_hd3_bn(self.h2_PT_hd3_conv(self.h2_PT_hd3(h2))))
        h3_Cat_hd3 = self.h3_Cat_hd3_relu(self.h3_Cat_hd3_bn(self.h3_Cat_hd3_conv(h3)))
        hd4_UT_hd3 = self.hd4_UT_hd3_relu(self.hd4_UT_hd3_bn(self.hd4_UT_hd3_conv(self.hd4_UT_hd3(hd4))))
        hd5_UT_hd3 = self.hd5_UT_hd3_relu(self.hd5_UT_hd3_bn(self.hd5_UT_hd3_conv(self.hd5_UT_hd3(hd5))))
        hd3 = self.relu3d_1(self.bn3d_1(self.conv3d_1(
            torch.cat((h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3), 1)))) # hd3->80*80*UpChannels

        #decoder2
        h1_PT_hd2 = self.h1_PT_hd2_relu(self.h1_PT_hd2_bn(self.h1_PT_hd2_conv(self.h1_PT_hd2(h1))))
        h2_Cat_hd2 = self.h2_Cat_hd2_relu(self.h2_Cat_hd2_bn(self.h2_Cat_hd2_conv(h2)))
        hd3_UT_hd2 = self.hd3_UT_hd2_relu(self.hd3_UT_hd2_bn(self.hd3_UT_hd2_conv(self.hd3_UT_hd2(hd3))))
        hd4_UT_hd2 = self.hd4_UT_hd2_relu(self.hd4_UT_hd2_bn(self.hd4_UT_hd2_conv(self.hd4_UT_hd2(hd4))))
        hd5_UT_hd2 = self.hd5_UT_hd2_relu(self.hd5_UT_hd2_bn(self.hd5_UT_hd2_conv(self.hd5_UT_hd2(hd5))))
        hd2 = self.relu2d_1(self.bn2d_1(self.conv2d_1(
            torch.cat((h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2), 1)))) # hd2->160*160*UpChannels

        #decoder1
        h1_Cat_hd1 = self.h1_Cat_hd1_relu(self.h1_Cat_hd1_bn(self.h1_Cat_hd1_conv(h1)))
        hd2_UT_hd1 = self.hd2_UT_hd1_relu(self.hd2_UT_hd1_bn(self.hd2_UT_hd1_conv(self.hd2_UT_hd1(hd2))))
        hd3_UT_hd1 = self.hd3_UT_hd1_relu(self.hd3_UT_hd1_bn(self.hd3_UT_hd1_conv(self.hd3_UT_hd1(hd3))))
        hd4_UT_hd1 = self.hd4_UT_hd1_relu(self.hd4_UT_hd1_bn(self.hd4_UT_hd1_conv(self.hd4_UT_hd1(hd4))))
        hd5_UT_hd1 = self.hd5_UT_hd1_relu(self.hd5_UT_hd1_bn(self.hd5_UT_hd1_conv(self.hd5_UT_hd1(hd5))))
        hd1 = self.relu1d_1(self.bn1d_1(self.conv1d_1(
            torch.cat((h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1), 1)))) # hd1->320*320*UpChannels

        #&#x6DF1;&#x76D1;&#x7763;&#xFF1A;&#x5C06;&#x6BCF;&#x4E2A;decoder&#x7684;&#x6700;&#x540E;&#x4E00;&#x5C42;&#x8FDB;&#x884C;conv3*3&#xFF0C;bilinear upsampling, sigmoid
        #decoder5&#x7684;&#x6DF1;&#x76D1;&#x7763;
        d5 = self.outconv5(hd5)            #conv3*3
        d5 = self.upscore5(d5) # 16->256   #bilinear upsampling

        #decoder4&#x7684;&#x6DF1;&#x76D1;&#x7763;
        d4 = self.outconv4(hd4)
        d4 = self.upscore4(d4) # 32->256

        #decoder3&#x7684;&#x6DF1;&#x76D1;&#x7763;
        d3 = self.outconv3(hd3)
        d3 = self.upscore3(d3) # 64->256

        #decoder2&#x7684;&#x6DF1;&#x76D1;&#x7763;
        d2 = self.outconv2(hd2)
        d2 = self.upscore2(d2) # 128->256

        #decoder1&#x7684;&#x6DF1;&#x76D1;&#x7763;
        d1 = self.outconv1(hd1) # 256  #decoder1&#x4E0D;&#x505A;&#x4E0A;&#x91C7;&#x6837;&#x5904;&#x7406;
        return F.sigmoid(d1), F.sigmoid(d2), F.sigmoid(d3), F.sigmoid(d4), F.sigmoid(d5)  #&#x5C06;&#x6DF1;&#x76D1;&#x7763;&#x7684;&#x7ED3;&#x679C;&#x8FDB;&#x884C;sigmoid&#xFF0C;&#x505A;&#x6700;&#x540E;&#x5904;&#x7406;&#x3002;

'''
    UNet 3+ with deep supervision and class-guided module
'''
class UNet_3Plus_DeepSup_CGM(nn.Module):

    def __init__(self, in_channels=3, n_classes=1, feature_scale=4, is_deconv=True, is_batchnorm=True):
        super(UNet_3Plus_DeepSup_CGM, self).__init__()
        self.is_deconv = is_deconv   #&#x3010;&#x4EC0;&#x4E48;&#x5730;&#x65B9;&#x7528;&#x5230;&#x4E86;&#xFF1F;&#xFF1F;&#x3011;
        self.in_channels = in_channels
        self.is_batchnorm = is_batchnorm
        self.feature_scale = feature_scale ##&#x3010;&#x4EC0;&#x4E48;&#x5730;&#x65B9;&#x7528;&#x5230;&#x4E86;&#xFF1F;&#xFF1F;&#x3011;

        filters = [64, 128, 256, 512, 1024]

        ## -------------Encoder--------------
        self.conv1 = unetConv2(self.in_channels, filters[0], self.is_batchnorm)
        self.maxpool1 = nn.MaxPool2d(kernel_size=2)

        self.conv2 = unetConv2(filters[0], filters[1], self.is_batchnorm)
        self.maxpool2 = nn.MaxPool2d(kernel_size=2)

        self.conv3 = unetConv2(filters[1], filters[2], self.is_batchnorm)
        self.maxpool3 = nn.MaxPool2d(kernel_size=2)

        self.conv4 = unetConv2(filters[2], filters[3], self.is_batchnorm)
        self.maxpool4 = nn.MaxPool2d(kernel_size=2)

        self.conv5 = unetConv2(filters[3], filters[4], self.is_batchnorm)

        ## -------------Decoder--------------
        self.CatChannels = filters[0]
        self.CatBlocks = 5
        self.UpChannels = self.CatChannels * self.CatBlocks

        '''stage 4d'''
        # h1->320*320, hd4->40*40, Pooling 8 times
        self.h1_PT_hd4 = nn.MaxPool2d(8, 8, ceil_mode=True)
        self.h1_PT_hd4_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd4_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd4->40*40, Pooling 4 times
        self.h2_PT_hd4 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h2_PT_hd4_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd4_relu = nn.ReLU(inplace=True)

        # h3->80*80, hd4->40*40, Pooling 2 times
        self.h3_PT_hd4 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h3_PT_hd4_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_PT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_PT_hd4_relu = nn.ReLU(inplace=True)

        # h4->40*40, hd4->40*40, Concatenation
        self.h4_Cat_hd4_conv = nn.Conv2d(filters[3], self.CatChannels, 3, padding=1)
        self.h4_Cat_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.h4_Cat_hd4_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd4->40*40, Upsample 2 times
        self.hd5_UT_hd4 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd5_UT_hd4_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd4_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd4_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4)
        self.conv4d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn4d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu4d_1 = nn.ReLU(inplace=True)

        '''stage 3d'''
        # h1->320*320, hd3->80*80, Pooling 4 times
        self.h1_PT_hd3 = nn.MaxPool2d(4, 4, ceil_mode=True)
        self.h1_PT_hd3_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd3_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd3->80*80, Pooling 2 times
        self.h2_PT_hd3 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h2_PT_hd3_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_PT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_PT_hd3_relu = nn.ReLU(inplace=True)

        # h3->80*80, hd3->80*80, Concatenation
        self.h3_Cat_hd3_conv = nn.Conv2d(filters[2], self.CatChannels, 3, padding=1)
        self.h3_Cat_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.h3_Cat_hd3_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd4->80*80, Upsample 2 times
        self.hd4_UT_hd3 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd4_UT_hd3_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd3_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd4->80*80, Upsample 4 times
        self.hd5_UT_hd3 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd5_UT_hd3_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd3_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd3_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3)
        self.conv3d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn3d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu3d_1 = nn.ReLU(inplace=True)

        '''stage 2d '''
        # h1->320*320, hd2->160*160, Pooling 2 times
        self.h1_PT_hd2 = nn.MaxPool2d(2, 2, ceil_mode=True)
        self.h1_PT_hd2_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_PT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_PT_hd2_relu = nn.ReLU(inplace=True)

        # h2->160*160, hd2->160*160, Concatenation
        self.h2_Cat_hd2_conv = nn.Conv2d(filters[1], self.CatChannels, 3, padding=1)
        self.h2_Cat_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.h2_Cat_hd2_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd2->160*160, Upsample 2 times
        self.hd3_UT_hd2 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd3_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd2->160*160, Upsample 4 times
        self.hd4_UT_hd2 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd4_UT_hd2_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd2_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd2->160*160, Upsample 8 times
        self.hd5_UT_hd2 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd5_UT_hd2_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd2_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd2_relu = nn.ReLU(inplace=True)

        # fusion(h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2)
        self.conv2d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn2d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu2d_1 = nn.ReLU(inplace=True)

        '''stage 1d'''
        # h1->320*320, hd1->320*320, Concatenation
        self.h1_Cat_hd1_conv = nn.Conv2d(filters[0], self.CatChannels, 3, padding=1)
        self.h1_Cat_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.h1_Cat_hd1_relu = nn.ReLU(inplace=True)

        # hd2->160*160, hd1->320*320, Upsample 2 times
        self.hd2_UT_hd1 = nn.Upsample(scale_factor=2, mode='bilinear')  # 14*14
        self.hd2_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd2_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd2_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd3->80*80, hd1->320*320, Upsample 4 times
        self.hd3_UT_hd1 = nn.Upsample(scale_factor=4, mode='bilinear')  # 14*14
        self.hd3_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd3_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd3_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd4->40*40, hd1->320*320, Upsample 8 times
        self.hd4_UT_hd1 = nn.Upsample(scale_factor=8, mode='bilinear')  # 14*14
        self.hd4_UT_hd1_conv = nn.Conv2d(self.UpChannels, self.CatChannels, 3, padding=1)
        self.hd4_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd4_UT_hd1_relu = nn.ReLU(inplace=True)

        # hd5->20*20, hd1->320*320, Upsample 16 times
        self.hd5_UT_hd1 = nn.Upsample(scale_factor=16, mode='bilinear')  # 14*14
        self.hd5_UT_hd1_conv = nn.Conv2d(filters[4], self.CatChannels, 3, padding=1)
        self.hd5_UT_hd1_bn = nn.BatchNorm2d(self.CatChannels)
        self.hd5_UT_hd1_relu = nn.ReLU(inplace=True)

        # fusion(h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1)
        self.conv1d_1 = nn.Conv2d(self.UpChannels, self.UpChannels, 3, padding=1)  # 16
        self.bn1d_1 = nn.BatchNorm2d(self.UpChannels)
        self.relu1d_1 = nn.ReLU(inplace=True)

        # -------------Bilinear Upsampling--------------
        self.upscore6 = nn.Upsample(scale_factor=32,mode='bilinear')###
        self.upscore5 = nn.Upsample(scale_factor=16,mode='bilinear')
        self.upscore4 = nn.Upsample(scale_factor=8,mode='bilinear')
        self.upscore3 = nn.Upsample(scale_factor=4,mode='bilinear')
        self.upscore2 = nn.Upsample(scale_factor=2, mode='bilinear')

        # DeepSup
        self.outconv1 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv2 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv3 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv4 = nn.Conv2d(self.UpChannels, n_classes, 3, padding=1)
        self.outconv5 = nn.Conv2d(filters[4], n_classes, 3, padding=1)

        #&#x5206;&#x7C7B;&#x6307;&#x5F15;&#xFF0C;&#x4EC5;&#x7528;&#x4E8E;encoder5
        self.cls = nn.Sequential(
                    nn.Dropout(p=0.5),  #p&#x4E3A;&#x4E0D;&#x4FDD;&#x7559;&#x8282;&#x70B9;&#x6570;&#x7684;&#x6BD4;&#x4F8B;&#x3002;
                    nn.Conv2d(filters[4], 2, 1), #Conv2d&#x7684;&#x53C2;&#x6570;&#xFF1A;&#x8F93;&#x5165;&#x7684;&#x7279;&#x5F81;&#x901A;&#x9053;&#xFF0C;&#x5377;&#x79EF;&#x6838;&#x5C3A;&#x5BF8;&#xFF0C;&#x6B65;&#x957F;
                    nn.AdaptiveMaxPool2d(1), #&#x81EA;&#x9002;&#x5E94;&#x6700;&#x5927;&#x6C60;&#x5316;&#xFF0C;&#x53C2;&#x6570;&#x4E3A;&#xFF08;H,W&#xFF09;&#x6216;&#x53EA;&#x6709;&#x4E00;&#x4E2A;H&#xFF0C;&#x8868;&#x793A;&#x8F93;&#x51FA;&#x4FE1;&#x53F7;&#x7684;&#x5C3A;&#x5BF8;&#x3002;&#x8F93;&#x51FA;&#x7684;&#x5C3A;&#x5BF8;&#x4E0D;&#x53D8;&#xFF0C;&#x540E;&#x4E24;&#x4E2A;&#x7EF4;&#x5EA6;&#x53D8;&#x4E3A;&#x53C2;&#x6570;&#x5927;&#x5C0F;&#x3002;
                    nn.Sigmoid())

        # initialise weights
        for m in self.modules():
            if isinstance(m, nn.Conv2d):
                init_weights(m, init_type='kaiming')
            elif isinstance(m, nn.BatchNorm2d):
                init_weights(m, init_type='kaiming')

    #&#x5C06;&#x5206;&#x5272;&#x7ED3;&#x679C;&#x4E0E;&#x5206;&#x7C7B;&#x4E8C;&#x7EF4;&#x77E9;&#x9635;&#x8FDB;&#x884C;&#x76F8;&#x4E58;&#xFF0C;&#x8FD4;&#x56DE;&#x56DB;&#x7EF4;&#x4E58;&#x79EF;&#x7ED3;&#x679C;&#x3002;
    def dotProduct(self,seg,cls):
        B, N, H, W = seg.size() #seg&#x662F;&#x4F20;&#x5165;&#x7684;&#x6DF1;&#x5EA6;&#x5377;&#x79EF;&#x7ED3;&#x679C;&#xFF0C;&#x662F;&#x77E9;&#x9635;&#x3002;
        seg = seg.view(B, N, H * W)  #view&#x548C;reshape&#x4F5C;&#x7528;&#x4E00;&#x6837;&#xFF0C;&#x91CD;&#x65B0;&#x5B9A;&#x4E49;&#x77E9;&#x9635;&#x7684;&#x6027;&#x72B6;&#x3002;
        final = torch.einsum("ijk,ij->ijk", [seg, cls]) #&#x5229;&#x7528;&#x7231;&#x56E0;&#x65AF;&#x5766;&#x6C42;&#x548C;&#x7EA6;&#x5B9A;&#x65B9;&#x6CD5;&#x6C42;&#x4E58;&#x79EF;&#x7684;&#x548C;&#x3002;
        final = final.view(B, N, H, W)
        return final

    def forward(self, inputs):
        ## -------------Encoder-------------
        h1 = self.conv1(inputs)  # h1->320*320*64

        h2 = self.maxpool1(h1)
        h2 = self.conv2(h2)  # h2->160*160*128

        h3 = self.maxpool2(h2)
        h3 = self.conv3(h3)  # h3->80*80*256

        h4 = self.maxpool3(h3)
        h4 = self.conv4(h4)  # h4->40*40*512

        h5 = self.maxpool4(h4)
        hd5 = self.conv5(h5)  # h5->20*20*1024

        # -------------Classification-------------
        #&#x5BF9;encoder5&#x505A;&#x5206;&#x7C7B;&#x5904;&#x7406;
        cls_branch = self.cls(hd5).squeeze(3).squeeze(2)  # (B,N,1,1)->(B,N)  #&#x64CD;&#x4F5C;(dropout, conv1*1, adaptiveMaxPool, Sigmoid)&#x540E;&#xFF0C;&#x4EA7;&#x751F;&#x4E00;&#x4E2A;&#x4E8C;&#x7EF4;&#x5F20;&#x91CF;  squeeze(x)&#x53EA;&#x6709;&#x5F53;&#x7EF4;&#x5EA6;x&#x7684;&#x503C;&#x4E3A;1&#x65F6;&#xFF0C;&#x624D;&#x80FD;&#x53BB;&#x6389;&#x8BE5;&#x7EF4;&#x5EA6;&#x3002;
        cls_branch_max = cls_branch.argmax(dim=1) #dim=1&#x5C06;1&#x7EF4;&#x53BB;&#x6389;&#xFF0C;&#x8FD4;&#x56DE;&#x6700;&#x5927;&#x503C;&#x5BF9;&#x5E94;&#x7684;&#x7D22;&#x5F15;&#x3002;  &#x901A;&#x8FC7;argmax&#xFF0C;&#x5206;&#x7C7B;&#x7ED3;&#x679C;&#x88AB;&#x8F6C;&#x4E3A;&#x4E00;&#x4E2A;&#x5355;&#x4E00;&#x6570;&#x5B57;&#x8F93;&#x51FA;&#x3002;  #argmax(a, axis=None, out=Nont):a&#x4E3A;&#x8F93;&#x5165;&#x7684;&#x6570;&#x7EC4;&#xFF1B;axis=0&#x6309;&#x5217;&#x5BFB;&#x627E;&#xFF0C;axis=1&#x6309;&#x884C;&#x5BFB;&#x627E;&#x6700;&#x5927;&#x503C;&#x5BF9;&#x5E94;&#x7684;&#x7D22;&#x5F15;&#xFF1B;out&#x7ED3;&#x679C;&#x5C06;&#x88AB;&#x63D2;&#x5165;&#x5230;a&#x4E2D;&#x3002;
        cls_branch_max = cls_branch_max[:, np.newaxis].float()  #&#x5728;np.newaxis&#x7684;&#x4F4D;&#x7F6E;&#x589E;&#x52A0;&#x4E00;&#x4E2A;&#x7EF4;&#x5EA6;&#xFF0C;&#x6545;&#x6B64;&#x65F6;&#x662F;&#x589E;&#x52A0;&#x4E00;&#x4E2A;&#x5217;&#x7EF4;&#x5EA6;&#x3002;

        ## -------------Decoder-------------
        h1_PT_hd4 = self.h1_PT_hd4_relu(self.h1_PT_hd4_bn(self.h1_PT_hd4_conv(self.h1_PT_hd4(h1))))
        h2_PT_hd4 = self.h2_PT_hd4_relu(self.h2_PT_hd4_bn(self.h2_PT_hd4_conv(self.h2_PT_hd4(h2))))
        h3_PT_hd4 = self.h3_PT_hd4_relu(self.h3_PT_hd4_bn(self.h3_PT_hd4_conv(self.h3_PT_hd4(h3))))
        h4_Cat_hd4 = self.h4_Cat_hd4_relu(self.h4_Cat_hd4_bn(self.h4_Cat_hd4_conv(h4)))
        hd5_UT_hd4 = self.hd5_UT_hd4_relu(self.hd5_UT_hd4_bn(self.hd5_UT_hd4_conv(self.hd5_UT_hd4(hd5))))
        hd4 = self.relu4d_1(self.bn4d_1(self.conv4d_1(
            torch.cat((h1_PT_hd4, h2_PT_hd4, h3_PT_hd4, h4_Cat_hd4, hd5_UT_hd4), 1)))) # hd4->40*40*UpChannels   #1&#x8868;&#x793A;&#x6309;&#x7EF4;&#x5EA6;1&#x62FC;&#x63A5;&#xFF0C;&#x5373;&#x6309;&#x5217;&#x62FC;&#x63A5;&#xFF0C;&#x5373;&#x5217;&#x53D8;&#x591A;&#x3002;

        h1_PT_hd3 = self.h1_PT_hd3_relu(self.h1_PT_hd3_bn(self.h1_PT_hd3_conv(self.h1_PT_hd3(h1))))
        h2_PT_hd3 = self.h2_PT_hd3_relu(self.h2_PT_hd3_bn(self.h2_PT_hd3_conv(self.h2_PT_hd3(h2))))
        h3_Cat_hd3 = self.h3_Cat_hd3_relu(self.h3_Cat_hd3_bn(self.h3_Cat_hd3_conv(h3)))
        hd4_UT_hd3 = self.hd4_UT_hd3_relu(self.hd4_UT_hd3_bn(self.hd4_UT_hd3_conv(self.hd4_UT_hd3(hd4))))
        hd5_UT_hd3 = self.hd5_UT_hd3_relu(self.hd5_UT_hd3_bn(self.hd5_UT_hd3_conv(self.hd5_UT_hd3(hd5))))
        hd3 = self.relu3d_1(self.bn3d_1(self.conv3d_1(
            torch.cat((h1_PT_hd3, h2_PT_hd3, h3_Cat_hd3, hd4_UT_hd3, hd5_UT_hd3), 1)))) # hd3->80*80*UpChannels

        h1_PT_hd2 = self.h1_PT_hd2_relu(self.h1_PT_hd2_bn(self.h1_PT_hd2_conv(self.h1_PT_hd2(h1))))
        h2_Cat_hd2 = self.h2_Cat_hd2_relu(self.h2_Cat_hd2_bn(self.h2_Cat_hd2_conv(h2)))
        hd3_UT_hd2 = self.hd3_UT_hd2_relu(self.hd3_UT_hd2_bn(self.hd3_UT_hd2_conv(self.hd3_UT_hd2(hd3))))
        hd4_UT_hd2 = self.hd4_UT_hd2_relu(self.hd4_UT_hd2_bn(self.hd4_UT_hd2_conv(self.hd4_UT_hd2(hd4))))
        hd5_UT_hd2 = self.hd5_UT_hd2_relu(self.hd5_UT_hd2_bn(self.hd5_UT_hd2_conv(self.hd5_UT_hd2(hd5))))
        hd2 = self.relu2d_1(self.bn2d_1(self.conv2d_1(
            torch.cat((h1_PT_hd2, h2_Cat_hd2, hd3_UT_hd2, hd4_UT_hd2, hd5_UT_hd2), 1)))) # hd2->160*160*UpChannels

        h1_Cat_hd1 = self.h1_Cat_hd1_relu(self.h1_Cat_hd1_bn(self.h1_Cat_hd1_conv(h1)))
        hd2_UT_hd1 = self.hd2_UT_hd1_relu(self.hd2_UT_hd1_bn(self.hd2_UT_hd1_conv(self.hd2_UT_hd1(hd2))))
        hd3_UT_hd1 = self.hd3_UT_hd1_relu(self.hd3_UT_hd1_bn(self.hd3_UT_hd1_conv(self.hd3_UT_hd1(hd3))))
        hd4_UT_hd1 = self.hd4_UT_hd1_relu(self.hd4_UT_hd1_bn(self.hd4_UT_hd1_conv(self.hd4_UT_hd1(hd4))))
        hd5_UT_hd1 = self.hd5_UT_hd1_relu(self.hd5_UT_hd1_bn(self.hd5_UT_hd1_conv(self.hd5_UT_hd1(hd5))))
        hd1 = self.relu1d_1(self.bn1d_1(self.conv1d_1(
            torch.cat((h1_Cat_hd1, hd2_UT_hd1, hd3_UT_hd1, hd4_UT_hd1, hd5_UT_hd1), 1)))) # hd1->320*320*UpChannels

        #&#x505A;&#x6DF1;&#x76D1;&#x7763;&#x5904;&#x7406;
        d5 = self.outconv5(hd5)
        d5 = self.upscore5(d5) # 16->256

        d4 = self.outconv4(hd4)
        d4 = self.upscore4(d4) # 32->256

        d3 = self.outconv3(hd3)
        d3 = self.upscore3(d3) # 64->256

        d2 = self.outconv2(hd2)
        d2 = self.upscore2(d2) # 128->256

        d1 = self.outconv1(hd1) # 256

        #&#x5C06;&#x6BCF;&#x4E2A;decoder&#x7684;&#x5206;&#x5272;&#x7ED3;&#x679C;&#x4E0E;&#x5206;&#x7C7B;&#x7ED3;&#x679C;&#x76F8;&#x4E58;&#xFF0C;&#x8FD4;&#x56DE;&#x8BA1;&#x7B97;&#x540E;&#x7684;&#x56DB;&#x7EF4;&#x77E9;&#x9635;&#x3002;
        d1 = self.dotProduct(d1, cls_branch_max) #d1&#x4E3A;&#x6DF1;&#x76D1;&#x7763;&#x4E2D;&#x7684;&#x5377;&#x79EF;&#x7ED3;&#x679C;&#xFF0C;&#x4E3A;&#x77E9;&#x9635;&#xFF1B;cls_branch_max&#xFF1A;float&#x7C7B;&#x578B;&#xFF0C;&#x4E3A;&#x5206;&#x7C7B;&#x7ED3;&#x679C;&#x3002;
        d2 = self.dotProduct(d2, cls_branch_max)
        d3 = self.dotProduct(d3, cls_branch_max)
        d4 = self.dotProduct(d4, cls_branch_max)
        d5 = self.dotProduct(d5, cls_branch_max)

        return F.sigmoid(d1), F.sigmoid(d2), F.sigmoid(d3), F.sigmoid(d4), F.sigmoid(d5)  #sigmoid&#x8FDB;&#x884C;&#x56DE;&#x5F52;&#xFF0C;&#x5C06;&#x7ED3;&#x679C;&#x56DE;&#x5F52;&#x5230;0-1&#x4E4B;&#x95F4;&#x3002;

(2) layers.py

import torch
import torch.nn as nn
import torch.nn.functional as F
from init_weights import init_weights

#&#x83B7;&#x5F97;&#x4E24;&#x4E2A;&#x5377;&#x79EF;&#x540E;&#x7684;&#x7ED3;&#x679C;
class unetConv2(nn.Module):
    def __init__(self, in_size, out_size, is_batchnorm, n=2, ks=3, stride=1, padding=1):  #is_batchnorm&#x5377;&#x79EF;&#x540E;&#x662F;&#x5426;&#x505A;&#x5F52;&#x4E00;&#x5316;&#x5904;&#x7406;
        super(unetConv2, self).__init__()
        self.n = n
        self.ks = ks
        self.stride = stride
        self.padding = padding
        s = stride
        p = padding
        if is_batchnorm: #&#x5377;&#x79EF;&#x540E;&#xFF0C;&#x505A;&#x6570;&#x636E;&#x5F52;&#x4E00;&#x5316;&#x5904;&#x7406;
            for i in range(1, n + 1):  #range(start, stop, step)&#x8BA1;&#x6570;&#x8FED;&#x4EE3;&#x7684;&#x8FC7;&#x7A0B;&#x4E2D;&#x4E0D;&#x5305;&#x62EC;stop
                conv = nn.Sequential(nn.Conv2d(in_size, out_size, ks, s, p),
                                     nn.BatchNorm2d(out_size),
                                     nn.ReLU(inplace=True), )
                setattr(self, 'conv%d' % i, conv)
                in_size = out_size

        else:  #&#x5377;&#x79EF;&#x540E;&#x4E0D;&#x505A;&#x6570;&#x636E;&#x5F52;&#x4E00;&#x5316;&#x5904;&#x7406;&#xFF0C;&#x53EF;&#x80FD;&#x4F1A;&#x51FA;&#x73B0;&#x5377;&#x79EF;&#x5F97;&#x5230;&#x7684;&#x6570;&#x636E;&#x8FC7;&#x5927;&#xFF0C;&#x5BFC;&#x81F4;ReLU&#x7684;&#x7F51;&#x7EDC;&#x6027;&#x80FD;&#x4E0D;&#x7A33;&#x5B9A;
            for i in range(1, n + 1):
                conv = nn.Sequential(nn.Conv2d(in_size, out_size, ks, s, p),
                                     nn.ReLU(inplace=True), )
                setattr(self, 'conv%d' % i, conv)
                in_size = out_size

        # initialise the blocks  #&#x81EA;&#x5B9A;&#x4E49;&#x53C2;&#x6570;&#x521D;&#x59CB;&#x5316;&#x65B9;&#x6CD5;
        for m in self.children():  #children&#x5305;&#x62EC;net&#x7684;&#x65B9;&#x6CD5;
            init_weights(m, init_type='kaiming')  #&#x610F;&#x601D;&#x662F;&#x7ED9;&#x7F51;&#x7EDC;&#x7684;&#x6BCF;&#x4E00;&#x5C42;&#x8D4B;&#x4E88;&#x6743;&#x91CD;

    def forward(self, inputs): #&#x8FD4;&#x56DE;&#x6BCF;&#x4E00;&#x4E2A;&#x7F16;&#x7801;&#x6A21;&#x5757;&#x7684;&#x7ED3;&#x679C;
        x = inputs
        for i in range(1, self.n + 1):  #n&#x8868;&#x793A;&#x5377;&#x79EF;&#x5C42;&#x7684;&#x4E2A;&#x6570;&#xFF0C;&#x4E3A;2
            conv = getattr(self, 'conv%d' % i)
            x = conv(x)

        return x

class unetUp(nn.Module):
    def __init__(self, in_size, out_size, is_deconv, n_concat=2):
        super(unetUp, self).__init__()
        # self.conv = unetConv2(in_size + (n_concat - 2) * out_size, out_size, False)
        self.conv = unetConv2(out_size*2, out_size, False)
        if is_deconv:
            self.up = nn.ConvTranspose2d(in_size, out_size, kernel_size=4, stride=2, padding=1)
        else:
            self.up = nn.UpsamplingBilinear2d(scale_factor=2)

        # initialise the blocks
        for m in self.children():
            if m.__class__.__name__.find('unetConv2') != -1: continue
            init_weights(m, init_type='kaiming')

    def forward(self, inputs0, *input):
        # print(self.n_concat)
        # print(input)
        outputs0 = self.up(inputs0)
        for i in range(len(input)):
            outputs0 = torch.cat([outputs0, input[i]], 1)
        return self.conv(outputs0)

class unetUp_origin(nn.Module):
    def __init__(self, in_size, out_size, is_deconv, n_concat=2):
        super(unetUp_origin, self).__init__()
        # self.conv = unetConv2(out_size*2, out_size, False)
        if is_deconv:
            self.conv = unetConv2(in_size + (n_concat - 2) * out_size, out_size, False)
            self.up = nn.ConvTranspose2d(in_size, out_size, kernel_size=4, stride=2, padding=1)
        else:
            self.conv = unetConv2(in_size + (n_concat - 2) * out_size, out_size, False)
            self.up = nn.UpsamplingBilinear2d(scale_factor=2)

        # initialise the blocks
        for m in self.children():
            if m.__class__.__name__.find('unetConv2') != -1: continue
            init_weights(m, init_type='kaiming')

    def forward(self, inputs0, *input):
        # print(self.n_concat)
        # print(input)
        outputs0 = self.up(inputs0)
        for i in range(len(input)):
            outputs0 = torch.cat([outputs0, input[i]], 1)
        return self.conv(outputs0)

(3) init_weights.py

此部分本人没再添加多余解释。

import torch
import torch.nn as nn
from torch.nn import init

def weights_init_normal(m):
    classname = m.__class__.__name__
    #print(classname)
    if classname.find('Conv') != -1:
        init.normal_(m.weight.data, 0.0, 0.02)
    elif classname.find('Linear') != -1:
        init.normal_(m.weight.data, 0.0, 0.02)
    elif classname.find('BatchNorm') != -1:
        init.normal_(m.weight.data, 1.0, 0.02)
        init.constant_(m.bias.data, 0.0)

def weights_init_xavier(m):
    classname = m.__class__.__name__
    #print(classname)
    if classname.find('Conv') != -1:
        init.xavier_normal_(m.weight.data, gain=1)
    elif classname.find('Linear') != -1:
        init.xavier_normal_(m.weight.data, gain=1)
    elif classname.find('BatchNorm') != -1:
        init.normal_(m.weight.data, 1.0, 0.02)
        init.constant_(m.bias.data, 0.0)

def weights_init_kaiming(m):
    classname = m.__class__.__name__
    #print(classname)
    if classname.find('Conv') != -1:
        init.kaiming_normal_(m.weight.data, a=0, mode='fan_in')
    elif classname.find('Linear') != -1:
        init.kaiming_normal_(m.weight.data, a=0, mode='fan_in')
    elif classname.find('BatchNorm') != -1:
        init.normal_(m.weight.data, 1.0, 0.02)
        init.constant_(m.bias.data, 0.0)

def weights_init_orthogonal(m):
    classname = m.__class__.__name__
    #print(classname)
    if classname.find('Conv') != -1:
        init.orthogonal_(m.weight.data, gain=1)
    elif classname.find('Linear') != -1:
        init.orthogonal_(m.weight.data, gain=1)
    elif classname.find('BatchNorm') != -1:
        init.normal_(m.weight.data, 1.0, 0.02)
        init.constant_(m.bias.data, 0.0)

def init_weights(net, init_type='normal'):
    #print('initialization method [%s]' % init_type)
    if init_type == 'normal':
        net.apply(weights_init_normal)
    elif init_type == 'xavier':
        net.apply(weights_init_xavier)
    elif init_type == 'kaiming':
        net.apply(weights_init_kaiming)
    elif init_type == 'orthogonal':
        net.apply(weights_init_orthogonal)
    else:
        raise NotImplementedError('initialization method [%s] is not implemented' % init_type)

(4) focalLoss.py

作者虽然提供了bceLoss，但并没有使用该损失函数，而是使用的focalLoss。

class FocalLoss(nn.Module):
    def __init__(self, alpha=0.25, gamma=2):
        super(FocalLoss, self).__init__()
        self.alpha = alpha
        self.gamma = gamma

    def forward(self, pred, mask):
"""
        :param pred: softmax(pred)
        :param mask: one_hot(mask)
        :return:
"""
        eps = 1e-7
        p = pred.view((pred.size()[0], pred.size()[1], -1))
        y = mask.view(p.size())

        ce = -1 * torch.log(p + eps) * y
        floss = torch.pow((1-p), self.gamma) * ce
        floss = torch.mul(floss, self.alpha)
        floss = torch.sum(floss, dim=1)
        return torch.mean(floss)

(5) iouLoss.py

import torch

def _iou(pred, target, size_average = True):

    b = pred.shape[0]
    IoU = 0.0
    for i in range(0,b):
        #compute the IoU of the foreground
        Iand1 = torch.sum(target[i,:,:,:]*pred[i,:,:,:])  #&#x4EA4;
        Ior1 = torch.sum(target[i,:,:,:]) + torch.sum(pred[i,:,:,:])-Iand1  #&#x5E76;
        IoU1 = Iand1/Ior1  #&#x4EA4;&#x5E76;&#x6BD4;

        #IoU loss is (1-IoU1)  #&#x4EA4;&#x5E76;&#x6BD4;&#x7684;&#x635F;&#x5931;&#xFF1A;1-IoU
        IoU = IoU + (1-IoU1)

    return IoU/b #&#x8FD4;&#x56DE;&#x4EA4;&#x5E76;&#x6BD4;&#x7684;&#x5E73;&#x5747;&#x503C;&#x4F5C;&#x4E3A;&#x4EA4;&#x5E76;&#x6BD4;

class IOU(torch.nn.Module):
    def __init__(self, size_average=True):
        super(IOU, self).__init__()
        self.size_average = size_average

    def forward(self, pred, target):

        return _iou(pred, target, self.size_average) #size_average&#x5373;&#x5BF9;&#x7ED3;&#x679C;&#x6C42;&#x5E73;&#x5747;&#x503C;&#x3002;

def IOU_loss(pred,label):
    iou_loss = IOU(size_average=True)
    iou_out = iou_loss(pred, label)
    print("iou_loss:", iou_out.data.cpu().numpy())
    return iou_out

(6) msssimLoss.py

import torch
import torch.nn.functional as F
from math import exp
import numpy as np

#&#x9AD8;&#x65AF;&#x6EE4;&#x6CE2;&#x5668;&#x7684;&#x6B63;&#x6001;&#x5206;&#x5E03;&#xFF0C;window_size&#x662F;&#x4F4D;&#x7F6E;&#x53C2;&#x6570;&#xFF0C;&#x51B3;&#x5B9A;&#x5206;&#x5E03;&#x7684;&#x4F4D;&#x7F6E;&#xFF1B;sigma&#x662F;&#x5C3A;&#x5EA6;&#x53C2;&#x6570;&#xFF0C;&#x51B3;&#x5B9A;&#x5206;&#x5E03;&#x7684;&#x5E45;&#x5EA6;&#x3002;
def gaussian(window_size, sigma):
    gauss = torch.Tensor([exp(-(x - window_size//2)**2/float(2*sigma**2)) for x in range(window_size)])  #&#x53CC;&#x661F;&#x53F7;&#xFF1A;&#x5E42;&#x7684;&#x610F;&#x601D;&#x3002; &#x53CC;//&#xFF1A;&#x8868;&#x793A;&#x5411;&#x4E0B;&#x53D6;&#x6574;&#xFF0C;&#x6709;&#x4E00;&#x65B9;&#x662F;float&#x578B;&#x65F6;&#xFF0C;&#x7ED3;&#x679C;&#x4E3A;float&#x3002;  exp()&#x8FD4;&#x56DE;e&#x7684;x&#x6B21;&#x65B9;&#x3002;
    return gauss/gauss.sum()

def create_window(window_size, channel=1):
    _1D_window = gaussian(window_size, 1.5).unsqueeze(1) #unsqueeze&#xFF08;x&#xFF09;&#x589E;&#x52A0;&#x7EF4;&#x5EA6;x
    _2D_window = _1D_window.mm(_1D_window.t()).float().unsqueeze(0).unsqueeze(0)  #t() &#x5C06;tensor&#x8FDB;&#x884C;&#x8F6C;&#x7F6E;&#x3002;  x.mm(self.y) &#x5C06;x&#x4E0E;y&#x76F8;&#x4E58;&#x3002;
    window = _2D_window.expand(channel, 1, window_size, window_size).contiguous()
    return window

#&#x8FD4;&#x56DE;&#x7684;&#x5747;&#x503C;&#x3002;
def ssim(img1, img2, window_size=11, window=None, size_average=True, full=False, val_range=None):
    #&#x6C42;&#x50CF;&#x7D20;&#x7684;&#x52A8;&#x6001;&#x8303;&#x56F4;
    # Value range can be different from 255. Other common ranges are 1 (sigmoid) and 2 (tanh).

    if val_range is None:
        if torch.max(img1) > 128:
            max_val = 255
        else:
            max_val = 1

        if torch.min(img1) < -0.5:
            min_val = -1
        else:
            min_val = 0
        L = max_val - min_val
    else:
        L = val_range

    #&#x6C42;img1&#xFF0C;img2&#x7684;&#x5747;&#x503C;&#x3002;
    padd = 0
    (_, channel, height, width) = img1.size() # _ &#x4E3A;&#x6279;&#x6B21;batch&#x5927;&#x5C0F;&#x3002;
        #&#x5B9A;&#x4E49;&#x5377;&#x79EF;&#x6838;window
    if window is None:
        real_size = min(window_size, height, width) #&#x6C42;&#x6700;&#x5C0F;&#x503C;&#xFF0C;&#x662F;&#x4E3A;&#x4E86;&#x4FDD;&#x8BC1;&#x5377;&#x79EF;&#x6838;&#x5C3A;&#x5BF8;&#x548C;img1&#xFF0C;img2&#x5C3A;&#x5BF8;&#x76F8;&#x540C;&#x3002;
        window = create_window(real_size, channel=channel).to(img1.device)

        #&#x7A7A;&#x6D1E;&#x5377;&#x79EF;&#xFF1A;&#x6709;groups&#x4EE3;&#x8868;&#x662F;&#x7A7A;&#x6D1E;&#x5377;&#x79EF;&#xFF1B;  F.conv2d(&#x8F93;&#x5165;&#x56FE;&#x50CF;tensor&#xFF0C;&#x5377;&#x79EF;&#x6838;tensor, ...)&#x662F;&#x5377;&#x79EF;&#x64CD;&#x4F5C;&#x3002;
        #mu1&#x4E3A;img1&#x7684;&#x5747;&#x503C;&#xFF1B;mu2&#x4E3A;img2&#x7684;&#x5747;&#x503C;&#x3002;
    mu1 = F.conv2d(img1, window, padding=padd, groups=channel) #groups&#x63A7;&#x5236;&#x5206;&#x7EC4;&#x5377;&#x79EF;&#xFF0C;&#x9ED8;&#x8BA4;&#x4E0D;&#x5206;&#x7EC4;&#xFF0C;&#x5373;&#x4E3A;1.  delition&#x9ED8;&#x8BA4;&#x4E3A;1.

    mu2 = F.conv2d(img2, window, padding=padd, groups=channel) #conv2d&#x8F93;&#x51FA;&#x7684;&#x662F;&#x4E00;&#x4E2A;tensor-&#x65B0;&#x7684;feature map&#x3002;

        #mu1_sq:img1&#x5747;&#x503C;&#x7684;&#x5E73;&#x65B9;&#x3002; mu2_sq:img2&#x5747;&#x503C;&#x7684;&#x5E73;&#x65B9;
    mu1_sq = mu1.pow(2) #&#x5BF9;mu1&#x4E2D;&#x7684;&#x5143;&#x7D20;&#x9010;&#x4E2A;2&#x6B21;&#x5E42;&#x8BA1;&#x7B97;&#x3002;
    mu2_sq = mu2.pow(2)
        #img1,img2&#x5747;&#x503C;&#x7684;&#x4E58;&#x79EF;&#x3002;
    mu1_mu2 = mu1 * mu2

    #x&#x7684;&#x65B9;&#x5DEE;&#x3C3;x&#xB2;
    sigma1_sq = F.conv2d(img1 * img1, window, padding=padd, groups=channel) - mu1_sq
    #y&#x7684;&#x65B9;&#x5DEE;&#x3C3;y&#xB2;
    sigma2_sq = F.conv2d(img2 * img2, window, padding=padd, groups=channel) - mu2_sq
    #&#x6C42;x,y&#x7684;&#x534F;&#x65B9;&#x5DEE;&#x3C3;xy
    sigma12 = F.conv2d(img1 * img2, window, padding=padd, groups=channel) - mu1_mu2

    #&#x7EF4;&#x6301;&#x7A33;&#x5B9A;&#x7684;&#x4E24;&#x4E2A;&#x53D8;&#x91CF;
    C1 = (0.01 * L) ** 2
    C2 = (0.03 * L) ** 2

    #v1:2&#x3C3;xy+C2
    v1 = 2.0 * sigma12 + C2
    #v2:&#x3C3;x&#xB2;+&#x3C3;y&#xB2;+C2
    v2 = sigma1_sq + sigma2_sq + C2
    cs = torch.mean(v1 / v2)  # contrast sensitivity   #&#x5BF9;&#x6BD4;&#x654F;&#x611F;&#x5EA6;

    #ssim_map&#x4E3A;img1,img2&#x7684;&#x76F8;&#x4F3C;&#x6027;&#x6307;&#x6570;&#x3002;
    ssim_map = ((2 * mu1_mu2 + C1) * v1) / ((mu1_sq + mu2_sq + C1) * v2)

    #&#x6C42;&#x5E73;&#x5747;&#x76F8;&#x4F3C;&#x6027;&#x6307;&#x6570;&#x3002; ??

    if size_average: #&#x8981;&#x6C42;&#x5E73;&#x5747;&#x65F6;
        ret = ssim_map.mean()
    else: #&#x4E0D;&#x8981;&#x6C42;&#x5E73;&#x5747;&#x65F6;
        ret = ssim_map.mean(1).mean(1).mean(1) #mean(1) &#x6C42;&#x7EF4;&#x5EA6;1&#x7684;&#x5E73;&#x5747;&#x503C;

    if full:
        return ret, cs
    return ret

def msssim(img1, img2, window_size=11, size_average=True, val_range=None, normalize=False):
    device = img1.device
    weights = torch.FloatTensor([0.0448, 0.2856, 0.3001, 0.2363, 0.1333]).to(device) #to(device)&#x4F7F;&#x7528;GPU&#x8FD0;&#x7B97;
    # weights = torch.FloatTensor([0.0448, 0.2856, 0.3001, 0.2363, 0.1333])
    levels = weights.size()[0]
    mssim = [] #&#x5B58;&#x653E;&#x6BCF;&#x4E00;&#x5C3A;&#x5EA6;&#x7684;ssim&#x7684;&#x5E73;&#x5747;&#x503C;
    mcs = [] #&#x5B58;&#x653E;&#x6BCF;&#x4E00;&#x5C3A;&#x5EA6;&#x7684;cs&#x7684;&#x5E73;&#x5747;&#x503C;
    #&#x5C06;img1&#xFF0C;img2&#x4E24;&#x5F20;&#x56FE;&#x50CF;&#x5206;&#x4E3A;levels&#x4E2A;&#x5C0F;&#x7A97;&#x53E3;&#xFF0C;&#x6C42;&#x6BCF;&#x5BF9;&#x5C0F;&#x7A97;&#x53E3;&#x7684;SSIM
    for _ in range(levels):
        #&#x6C42;&#x6BCF;&#x4E00;&#x5BF9;&#x5C0F;&#x7A97;&#x53E3;&#x7684;&#x7ED3;&#x6784;&#x76F8;&#x4F3C;&#x6027;&#x6307;&#x6570;&#xFF08;SSIM&#xFF09;
        sim, cs = ssim(img1, img2, window_size=window_size, size_average=size_average, full=True, val_range=val_range)
        print("sim", sim)
        mssim.append(sim)
        mcs.append(cs)

        #&#x4EE5;&#x6C42;&#x6700;&#x5927;&#x6C60;&#x7684;&#x65B9;&#x5F0F;&#x79FB;&#x52A8;&#x56FE;&#x50CF;img1, img2&#x7684;&#x4F4D;&#x7F6E;
        img1 = F.avg_pool2d(img1, (2, 2)) #&#x5E73;&#x5747;&#x6C60;&#x5316;&#x3002; &#xFF08;2&#xFF0C;2&#xFF09;&#xFF1A;stride&#x6A2A;&#x5411;&#x3001;&#x7EB5;&#x5411;&#x90FD;&#x6B65;&#x957F;&#x4E3A;2.

        img2 = F.avg_pool2d(img2, (2, 2))

    mssim = torch.stack(mssim) #torch.stack()&#x4FDD;&#x7559;&#x5E8F;&#x5217;&#x3001;&#x5F20;&#x91CF;&#x77E9;&#x9635;&#x4FE1;&#x606F;&#xFF0C;&#x5C06;&#x4E00;&#x4E2A;&#x4E2A;&#x5F20;&#x91CF;&#x6309;&#x7167;&#x65F6;&#x95F4;&#x5E8F;&#x5217;&#x6392;&#x5E8F;&#xFF0C;&#x62FC;&#x63A5;&#x6210;&#x4E00;&#x4E2A;&#x4E09;&#x7EF4;&#x7ACB;&#x4F53;&#x3002;   &#x6269;&#x5F20;&#x7EF4;&#x5EA6;&#x3002;
    mcs = torch.stack(mcs)

    #&#x907F;&#x514D;&#x5F53;&#x4E24;&#x5F20;&#x56FE;&#x50CF;&#x90FD;&#x6709;&#x975E;&#x5E38;&#x5C0F;&#x7684;MS-SSIM&#x65F6;&#xFF0C;&#x65E0;&#x6CD5;&#x7EE7;&#x7EED;&#x8BAD;&#x7EC3;&#x3002;
    # Normalize (to avoid NaNs during training unstable models, not compliant with original definition)
    if normalize:
        mssim = (mssim + 1) / 2 #mssim+1: &#x5C06;mmsim&#x4E2D;&#x7684;&#x6BCF;&#x4E2A;&#x5143;&#x7D20;&#x90FD;&#x52A0;1.

        mcs = (mcs + 1) / 2

    pow1 = mcs ** weights
    pow2 = mssim ** weights
    # From Matlab implementation https://ece.uwaterloo.ca/~z70wang/research/iwssim/
    output = torch.prod(pow1[:-1] * pow2[-1]) #pow1&#x7684;&#x6240;&#x6709;&#x884C;&#x5217; * pow2&#x6539;&#x6210;&#x4E00;&#x4E32;&#x3002;  &#x8FD4;&#x56DE;&#x8F93;&#x5165;tensor&#x7684;&#x6240;&#x6709;&#x539F;&#x59CB;&#x7684;&#x4E58;&#x79EF;
    return output

#Structural similarity index &#x7ED3;&#x6784;&#x76F8;&#x4F3C;&#x6027;&#x6307;&#x6807;
Classes to re-use window
class SSIM(torch.nn.Module):
    def __init__(self, window_size=11, size_average=True, val_range=None):
        super(SSIM, self).__init__()
        self.window_size = window_size
        self.size_average = size_average
        self.val_range = val_range

        # Assume 1 channel for SSIM  #assume&#xFF1A;&#x5047;&#x5B9A;
        self.channel = 1
        self.window = create_window(window_size)

    def forward(self, img1, img2):
        (_, channel, _, _) = img1.size()

        if channel == self.channel and self.window.dtype == img1.dtype:
            window = self.window
        else:
            window = create_window(self.window_size, channel).to(img1.device).type(img1.dtype)
            self.window = window
            self.channel = channel

        return ssim(img1, img2, window=window, window_size=self.window_size, size_average=self.size_average)

#&#x591A;&#x5C3A;&#x5EA6;&#x7ED3;&#x6784;&#x76F8;&#x4F3C;&#x6027;
class MSSSIM(torch.nn.Module):
    def __init__(self, window_size=11, size_average=True, channel=3):  #size_average&#x6C42;&#x51FA;&#x6BCF;&#x4E2A;&#x5C0F;&#x7A97;&#x53E3;&#x7684;&#x76F8;&#x4F3C;&#x6027;&#x540E;&#xFF0C;&#x8981;&#x8BA1;&#x7B97;&#x6240;&#x6709;&#x7A97;&#x53E3;&#x76F8;&#x4F3C;&#x6027;&#x7684;&#x5E73;&#x5747;&#x503C;&#xFF0C;&#x4F5C;&#x4E3A;&#x6574;&#x4E2A;&#x56FE;&#x50CF;&#x7684;&#x76F8;&#x4F3C;&#x6027;&#x6307;&#x6807;&#x3002;
        super(MSSSIM, self).__init__()
        self.window_size = window_size
        self.size_average = size_average
        self.channel = channel

    def forward(self, img1, img2):
        # TODO: store window between calls if possible,
        # return msssim(img1, img2, window_size=self.window_size, size_average=self.size_average)
        return msssim(img1, img2, window_size=self.window_size, size_average=self.size_average, normalize=True)

Original: https://blog.csdn.net/yjysunshine/article/details/125707704
Author: 一只慢慢飞的笨笨鸟
Title: UNet3+详解

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/624976/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

深度学习之卷积

01 卷积卷积是指在滑动中提取特征的过程，可以形象地理解为用放大镜把每步都放大并且拍下来，再把拍下来的图片拼接成一个新的大图片的过程。 2D卷积是一个相当简单的操作：我们先从一个…

人工智能 2023年7月20日
0056
python tabula获取pdf的列表数据

tabula的功能比camelot更加强大，可以同时对多个表格数据进行提取。项目的具体地址请参考：https://github.com/chezou/tabula-py 安装 ta…

人工智能 2023年7月6日
0063
基础篇：一文讲懂树莓派命令行文本编辑工具Vim的使用

简介众所周知，在Linux系统下的命令行调试界面，经常会遇到需要文本编辑的情况，而树莓派官方系统默认自带了Nano编辑器，Nano的操作门槛更低，但却不如Vim编辑器方便。Vim…

人工智能 2023年6月12日
0062
最小二乘法是一种用于估计模型参数的方法，它通过最小化观测值与模型预测值之间的平方误差来拟合数据

问题描述最小二乘法是一种用于估计模型参数的方法，它通过最小化观测值与模型预测值之间的平方误差来拟合数据。本文将详细介绍最小二乘法的原理和计算步骤，并提供一个复杂的Python代码…

人工智能 2023年12月31日
0050
神经网络学习小记录69——Pytorch使用GoogleColab进行深度学习

神经网络学习小记录69——Pytorch 使用Google Colab进行深度学习注意事项学习前言什么是Google Colab 相关链接利用Colab进行训练 * 一、数…

人工智能 2023年7月23日
0046
用 Pandas 做 ETL，不要太快

久违了，朋友们，来篇干货。 ETL 的全称是 extract, transform, load，意思就是：提取、转换、加载。ETL 是数据分析中的基础工作，获取非结构化或难以使用…

人工智能 2023年6月11日
00101
imutils基础（1）安装与简单使用

imutils对一系列OpenCV函数进行二次封装，执行基本任务，如平移、旋转、调整大小和骨架提取。 1.安装这个包假设您已经安装了NumPy和OpenCV(如果您打算使用ope…

人工智能 2023年7月18日
0055
基于OpenCV的Haar与LBP级联分类器

级联分类器原理-AdaBoost ·Viola和Jones – 2001在CVPR提出 ·一种实时对象(人脸)检测框架 ·训练速度非常慢，检测速度非常快 ·5000个正向人脸样…

人工智能 2023年7月10日
0068
安装mongodb-community之后提示command not found: mongo找不到mongo指令

写在前面最近下载mongodb-community之后，试图使用mongo命令行，遇到了一点小问题。因为当前版本较新，用命令行操作mongodb的人也相对较少，互联网上搜索了很久…

人工智能 2023年7月29日
0095
Python 【问题描述】按照世卫组织的标准：男性：（身高cm-80）×70%=标准体重女性：（身高cm-70）×60%=标准体重标准体重正负10%为正常体重(含10%) 标准体重正负1

【问题描述】按照世卫组织的标准：男性：（身高cm-80）×70%=标准体重女性：（身高cm-70）×60%=标准体重标准体重正负10%为正常体重(含10%) 标准体重正负10…

人工智能 2023年7月9日
0086
PMP每日一练 | 考试不迷路-11.12（包含敏捷+多选）

11.27PMP考试倒计时 15天每日5道PMP习题助大家上岸PMP！题目1-2： 1.在项目的中途，产品负责人从发起人那里了解到：有一个主要组件，它已经完成了 20%，但…

人工智能 2023年6月27日
0068
数据分析：数据处理篇2

问题数据的处理空值的删除 * 空值揭秘 notnull方法 dropna方法 drop方法空值的填补 * fillna方法列间运算填充重复数据的处理 * duplicate…

人工智能 2023年7月8日
0098
python实现kd树以及最近邻查找算法

python实现kd树以及最近邻查找算法一、kd树简介二、kd树生成 * 1.确定切分域 2.确定数据域 3.理解递归树 4.python实现递归树代码三、kd树上的最近邻查…

人工智能 2023年7月3日
0093
点云深度学习——点云配准网络DCP复现

点云配准网络DCP复现前言一、效果展示 * 1.1 open3d中效果展示二、复现源码 * 2.1 参考链接 2.2 复现流程 2.3遇到问题：三、模型测试单个数据，并用o…

人工智能 2023年7月28日
0095
pandas 的数据结构（Series， DataFrame）

Pandas 讲解 Python Data Analysis Library 或 pandas 是基于NumPy 的一种工具，该工具是为了解决数据分析任务而创建的。 Pandas …

人工智能 2023年6月2日
00100
数学建模竞赛中必须掌握的10个统计分析方法

无论你在数据科学中是何种立场，你都无法忽视数据的重要性，数据科学家的职责就是分析、组织和应用这些数据。著名求职网站 Glassdoor 根据庞大的就业数据和员工反馈信息，将…

人工智能 2023年6月15日
0068

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

UNet3+详解

（1）UNet

（2）UNet++

(1) UNet_3Plus.py

(2) layers.py

(3) init_weights.py

(4) focalLoss.py

(5) iouLoss.py

(6) msssimLoss.py

大家都在看