Pytorch：卷积神经网络-空洞卷积

2023年5月26日上午9:52 • 人工智能 • 阅读 84

Pytorch: 空洞卷积神经网络

Copyright: Jingmin Wei, Pattern Recognition and Intelligent System, School of Artificial and Intelligence, Huazhong University of Science and Technology

Pytorch教程专栏链接

文章目录

*
–
+ Pytorch: 空洞卷积神经网络
– @[toc]
–
+
* 空洞卷积神经网络搭建
* 数据预处理
* 空洞卷积神经网络的训练和预测

本教程不商用，仅供学习和参考交流使用，如需转载，请联系本人。

相对于普通卷积，空洞卷积通过在卷积核中添加空洞( 0 0 0 元素)，从而增大感受野，获取更多信息。感受野为在卷积神经网络中，决定某一层输出结果中一个元素对应的输入层的区域大小，通俗解释就是特征映射上的一个点对应输入图上的区域大小。

对于一个 3 × 3 3\times3 3 ×3 的 2 2 2-空洞卷积运算，实际的卷积核大小还是 3 × 3 3\times3 3 ×3 。但是空洞为 1 1 1 ，这样卷积核就会扩充一个 7 × 7 7\times7 7 ×7 的图像块，但只有 9 9 9 个红色的点会有权重取值进行卷积操作。也可以理解为卷积核的大小为 7 × 7 7\times7 7 ×7 ，但只有图中的 9 9 9 个点的权重不为 0 0 0 ，其他均为 0 0 0 。实际卷积权重只有 3 × 3 3\times3 3 ×3 ，但感受野实际为 7 × 7 7\times7 7 ×7 。对于 15 × 15 15\times15 1 5 ×1 5 的，实际卷积只有 9 × 9 9\times9 9 ×9 。

在 nn.Conv2d() 函数中，调节 dilation 的取值，即可进行不同大小卷积核的空洞卷积运算。

我们搭建的空洞卷积神经网络有两个空洞卷积层，两个池化层和两个全连接层，分类器依旧包含 10 10 1 0 个神经元，除了卷积方式差异，与前文识别 FashionMNIST 的网络结构完全相同。

空洞卷积神经网络搭建

import numpy as np
import pandas as pd
from sklearn.metrics import accuracy_score, confusion_matrix
import matplotlib.pyplot as plt
import seaborn as sns
import copy
import time
import torch
import torch.nn as nn
from torch.optim import Adam
import torch.utils.data as Data
from torchvision import transforms
from torchvision.datasets import FashionMNIST

class MyConvDilaNet(nn.Module):
    def __init__(self):
        super(MyConvDilaNet, self).__init__()

        self.conv1 = nn.Sequential(
            nn.Conv2d(
                in_channels = 1,
                out_channels = 16,
                kernel_size = 3,
                stride = 1,
                padding = 1,
                dilation = 2,
            ),
            nn.ReLU(),
            nn.AvgPool2d(
                kernel_size = 2,
                stride = 2,
            ),

        )
        self.conv2 = nn.Sequential(
            nn.Conv2d(16, 32, 3, 1, 0, dilation = 2),
            nn.ReLU(),
            nn.AvgPool2d(2, 2),
        )
        self.classifier = nn.Sequential(
            nn.Linear(32 * 4 * 4, 256),
            nn.ReLU(),
            nn.Linear(256, 128),
            nn.ReLU(),
            nn.Linear(128, 10),
        )

    def forward(self, x):

        x = self.conv1(x)
        x = self.conv2(x)
        x = x.view(x.size(0), -1)
        output = self.classifier(x)

        return output

数据预处理

数据预处理部分和上文相同。


train_data = FashionMNIST(
    root = './data/FashionMNIST',
    train = True,
    transform = transforms.ToTensor(),
    download = False
)

train_loader = Data.DataLoader(
    dataset = train_data,
    batch_size = 64,
    shuffle = False,
    num_workers = 2,
)

print(len(train_loader))

for step, (b_x, b_y) in enumerate(train_loader):
    if step > 0:
        break

batch_x = b_x.squeeze().numpy()
batch_y = b_y.numpy()
label = train_data.classes
label[0] = 'T-shirt'

plt.figure(figsize = (12, 5))
for i in np.arange(len(batch_y)):
    plt.subplot(4, 16, i + 1)
    plt.imshow(batch_x[i, :, :], cmap = plt.cm.gray)
    plt.title(label[batch_y[i]], size = 9)
    plt.axis('off')
    plt.subplots_adjust(wspace = 0.05)

test_data = FashionMNIST(
    root = './data/FashionMNIST',
    train = False,
    download = False
)

test_data_x = test_data.data.type(torch.FloatTensor) / 255.0
test_data_x = torch.unsqueeze(test_data_x, dim = 1)
test_data_y = test_data.targets

print(test_data_x.shape)
print(test_data_y.shape)

938
torch.Size([10000, 1, 28, 28])
torch.Size([10000])


myconvdilanet = MyConvDilaNet()

from torchsummary import summary
summary(myconvdilanet, input_size=(1, 28, 28))

Input size (MB): 0.00
Forward/backward pass size (MB): 0.24
Params size (MB): 0.65
Estimated Total Size (MB): 0.89
0 Train Loss: 0.8922 Train Acc: 0.6718
0 Val Loss: 0.6322 Val Acc: 0.7498
Train and Val complete in 1m 2s
Epoch 1/24
2 Train Loss: 0.5331 Train Acc: 0.7948
2 Val Loss: 0.5047 Val Acc: 0.8107
Train and Val complete in 3m 3s
Epoch 3/24
4 Train Loss: 0.4540 Train Acc: 0.8320
4 Val Loss: 0.4491 Val Acc: 0.8369
Train and Val complete in 4m 54s
Epoch 5/24
6 Train Loss: 0.4047 Train Acc: 0.8512
6 Val Loss: 0.4075 Val Acc: 0.8531
Train and Val complete in 6m 45s
Epoch 7/24
8 Train Loss: 0.3690 Train Acc: 0.8655
8 Val Loss: 0.3762 Val Acc: 0.8653
Train and Val complete in 8m 36s
Epoch 9/24
10 Train Loss: 0.3440 Train Acc: 0.8751
10 Val Loss: 0.3552 Val Acc: 0.8710
Train and Val complete in 10m 26s
Epoch 11/24
12 Train Loss: 0.3250 Train Acc: 0.8812
12 Val Loss: 0.3412 Val Acc: 0.8762
Train and Val complete in 12m 21s
Epoch 13/24
14 Train Loss: 0.3092 Train Acc: 0.8870
14 Val Loss: 0.3299 Val Acc: 0.8810
Train and Val complete in 14m 26s
Epoch 15/24
16 Train Loss: 0.2956 Train Acc: 0.8921
16 Val Loss: 0.3182 Val Acc: 0.8838
Train and Val complete in 16m 30s
Epoch 17/24
18 Train Loss: 0.2836 Train Acc: 0.8961
18 Val Loss: 0.3093 Val Acc: 0.8872
Train and Val complete in 18m 33s
Epoch 19/24
20 Train Loss: 0.2728 Train Acc: 0.9004
20 Val Loss: 0.3020 Val Acc: 0.8911
Train and Val complete in 20m 52s
Epoch 21/24
22 Train Loss: 0.2625 Train Acc: 0.9038
22 Val Loss: 0.2960 Val Acc: 0.8942
Train and Val complete in 22m 55s
Epoch 23/24
24 Train Loss: 0.2531 Train Acc: 0.9062
24 Val Loss: 0.2907 Val Acc: 0.8942
Train and Val complete in 24m 58s

使用折线图可视化训练过程：


plt.figure(figsize = (12, 4))
plt.subplot(1, 2, 1)
plt.plot(train_process.epoch, train_process.train_loss_all, 'ro-', label = 'Train loss')
plt.plot(train_process.epoch, train_process.val_loss_all, 'bs-', label = 'Val loss')
plt.legend()
plt.xlabel('epoch')
plt.ylabel('Loss')

plt.subplot(1, 2, 2)
plt.plot(train_process.epoch, train_process.train_acc_all, 'ro-', label = 'Train acc')
plt.plot(train_process.epoch, train_process.val_acc_all, 'bs-', label = 'Val acc')
plt.legend()
plt.xlabel('epoch')
plt.ylabel('Acc')

plt.show()

计算空洞卷积模型的泛化能力：


myconvdilanet.eval()
output = myconvdilanet(test_data_x)
pre_lab = torch.argmax(output, 1)
acc = accuracy_score(test_data_y, pre_lab)
print(test_data_y)
print(pre_lab)
print('测试集上的预测精度为', acc)

tensor([9, 2, 1,  ..., 8, 1, 5])
tensor([9, 2, 1,  ..., 8, 1, 5])
&#x6D4B;&#x8BD5;&#x96C6;&#x4E0A;&#x7684;&#x9884;&#x6D4B;&#x7CBE;&#x5EA6;&#x4E3A; 0.8841

使用热力图，观察每类数据上的预测情况：


conf_mat = confusion_matrix(test_data_y, pre_lab)
df_cm = pd.DataFrame(conf_mat, index = label, columns = label)
heatmap = sns.heatmap(df_cm, annot = True, fmt = 'd', cmap = 'YlGnBu')
heatmap.yaxis.set_ticklabels(heatmap.yaxis.get_ticklabels(), rotation = 0, ha = 'right')
heatmap.xaxis.set_ticklabels(heatmap.xaxis.get_ticklabels(), rotation = 45, ha = 'right')
plt.ylabel('True label')
plt.xlabel('Predicted label')
plt.show()

Original: https://blog.csdn.net/weixin_44979150/article/details/122778696
Author: 宅家的小魏
Title: Pytorch：卷积神经网络-空洞卷积

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/519073/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

基于深度学习的车牌检测识别(Pytorch)(ResNet +Transformer）

车牌识别概述基于深度学习的车牌识别，其中，车辆检测网络直接使用YOLO侦测。而后，才是使用网络侦测车牌与识别车牌号。车牌的侦测网络，采用的是resnet18，网络输出检测边框…

人工智能 2023年7月27日
0070
Pytorch入门教程

👨‍💻 作者简介：大数据专业硕士在读，CSDN人工智能领域博客专家，阿里云专家博主，专注大数据与人工智能知识分享，公众号：GoAI的学习小屋，免费分享书籍、简历、导图等资料，更有…

人工智能 2023年6月16日
0081
Spring Cloud Alibaba —— 服务注册与配置中心

🔎这里是【秒懂·云原生】，关注我学习云原生不迷路👍如果对你有帮助，给博主一个免费的点赞以示鼓励欢迎各位🔎点赞👍评论收藏⭐️ ; 👀专栏介绍【秒懂·云原生】目前主要更新微服务，一…

人工智能 2023年5月30日
0096
基于OpenGL实现PS部分混合模式

混合模式介绍 1.什么是混合模式? 为了让不同色彩的图片叠加后能够实现更多种色彩组合,从而渲染出各式各样的画面,PS 提供了各式各样规则的混合模式(这里就不具体一一介绍了,提供一个…

人工智能 2023年6月20日
00101
MATLAB 基础知识数据类型分组数组对分类数据绘图

本文演示了如何对分类数组中的数据绘图。加载样本数据加载从 100 位患者收集的样本数据。 load patients whos Name Size Bytes Class At…

人工智能 2023年7月3日
0068
Pandas groupby用法

创建数据： #!/usr/bin/python import pandas as pd import numpy as np data = {‘A’: [‘a’, ‘b’, ‘a’…

人工智能 2023年7月8日
0047
Ros入门（一）创建工作空间及功能包

一、创建工作空间二、创建功能包并导入依赖三、在功能包下新建几个文件夹存放数据 mkdir -p ~/ros001/src cd ~/ros001/src catkin_init…

人工智能 2023年6月2日
0058
3D-SIS: 3D Semantic Instance Segmentation of RGB-D Scans学习总结

概述：这篇文章的介绍了一个3D-SIS的新颖卷积神经网络用来对商品的RGB-D扫描图像进行语义实例分割。序言：他们将每个图像中的每个像素通过2D卷积提取特征图谱，之后将结果反投影…

人工智能 2023年6月10日
0081
Paddlenlp之UIE关系抽取模型【高管关系抽取为例】

NLP专栏简介：数据增强、智能标注、意图识别算法|多分类算法、文本信息抽取、多模态信息抽取、可解释性分析、性能调优、模型压缩算法等专栏详细介绍：NLP专栏简介：数据增强、智能标注…

人工智能 2023年6月1日
0069
Pandas缺失值处理

一、什么是稀疏数据？稀疏数据指的是在数据库或者数据集中存在大量缺失数据或者空值，我们把这样的数据集称为稀疏数据集。大致原因由于调查不当产生的稀疏数据；由于天然限制产生的稀疏数…

人工智能 2023年7月6日
0081
[论文][半监督语义分割]Semi-Supervised Semantic Segmentation with Cross Pseudo Supervision

CVPR2021 原文半监督语义分割方法的总结：主要思想： Consistency regularization ：希望不同扰动之下网络的输出结果一致，扰动的加入的位置：（1）…

人工智能 2023年6月6日
00112
【Python机器学习项目】项目一：心脏病二分类问题

使用机器学习预测心脏病根据一些病理学属性预测心脏病特别说明：开新坑啦！本系列共2个项目，难度不大，特别适合新手入坑由于本项目只是系列课程的第一个项目，所以很多细节不深挖，仅…

人工智能 2023年7月2日
00112
【时序列】时序列数据如何一步步分解成趋势（trend）季节性（seasonality）和误差（residual）- 详细理解python sm.tsa.seasonal_decompose

【时序列】时序列数据如何一步步分解成趋势（trend）季节性（seasonality）和误差（residual）- 理解python sm.tsa.seasonal_decompo…

人工智能 2023年7月28日
00145
使用tensorRT C++ API搭建MLP网络详解

本文详细说明，如何使用 tensorrt C++ API搭建MPL网络，实现推理，帮助与我类似的小白更快上手C++ 版本的方法，我将介绍内容为：简单介绍、visual studi…

人工智能 2023年6月4日
0093
MATLAB–二维图像和三维图像的绘制

目录一、基本绘图命令 1、plot绘图命令 ①plot命令的几种不同格式 ②基本线形、标记和颜色 2、fplot绘图命令 3、ezplot绘图命令 4、subplot绘图命令 5…

人工智能 2023年6月29日
0062
双目立体视觉(一) 基本原理和步骤

目录一、双目立体视觉系统的四个基本步骤二、各步骤原理 1、相机标定 2、立体校正 3、立体匹配一、双目立体视觉系统的四个基本步骤相机标定主要包含两部分内容: 单相机的内参标…

人工智能 2023年6月1日
0099

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Pytorch：卷积神经网络-空洞卷积

Pytorch: 空洞卷积神经网络

文章目录

空洞卷积神经网络搭建

数据预处理

大家都在看