Pytorch搭建LeNet5网络

2023年7月22日下午4:08 • 人工智能 • 阅读 51

本讲目标：
介绍Pytorch搭建LeNet5网络的流程。

Pytorch八股法搭建LeNet5网络

1.LeNet5网络介绍
2.Pytorch搭建LeNet5网络
*
2.1搭建LeNet网络
2.2测试LeNet网络输出
2.3下载数据集
2.4加载并配置网络
2.5训练并保存网络
2.6测试图片

1.LeNet5网络介绍

借鉴点：共享卷积核，减少网络参数（通过共享卷积参数，避免了像全连接层那样存在大量参数）。

LeNet由Yann LeCun于1998年提出，是卷积网络的开篇之作。LeNet-5提出以后，卷积神经网络成功的被商用，广泛的应用在邮政编码、支票号码识别等相关任务中。本节用LeNet网络实现Cifar10数据集的识别任务。

在统计卷积神经网络的层数时，一般只统计卷积计算层和全连接层，其余操作可以认为是卷积层的附属。
LeNet共5层网络（如下图），两层卷积层（C1、C2，前面两个卷积层，我们定义卷积层为包含了 conv、 batch normalization、 activation、 pool、 dropout等层的单元，这五部分不一定全部具备）和三层全连接层（D1、D2、D3，最后三个Dense层）。

卷积就是特征提取器：CBAPB

Conv2D、 BatchNormalization、 Activation、 Pooling、 Dropout

特征提取器（卷积层）:
C1:
C(核k：6x5x5，步长s：1，填充p：valid)
B(None)
A(sigmoid)
P(max,核k：2×2，步长s：2，填充p：valid)
D(None)
C2:
C(核k：16x5x5，步长s：1，填充k：valid)
B(None)
A(sigmoid)
P(max,核k：2×2，步长s：2，填充p：valid)
D(None)
分类器（全连接层）:
D1:
Dense(神经元：120，激活：sigmoid)
D2:
Dense(神经元： 84，激活：sigmoid)
D3:
Dense(神经元： 10，激活：softmax)

; 2.Pytorch搭建LeNet5网络

2.1搭建LeNet网络

搭建网络在 model.py文件中。

import torch
import torch.nn as nn
import torch.nn.functional as F

class LeNet(nn.Module):
    def __init__(self):
        super(LeNet,self).__init__()

        self.conv1=nn.Conv2d(in_channels=3,out_channels=16,kernel_size=5)
        self.pool1=nn.MaxPool2d(2,2)
        self.conv2=nn.Conv2d(16,32,5)
        self.pool2=nn.MaxPool2d(2,2)
        self.fc1=nn.Linear(32*5*5,120)
        self.fc2=nn.Linear(120,84)
        self.fc3=nn.Linear(84,10)

    def forward(self,x):
        x=F.relu(self.conv1(x))
        x=self.pool1(x)
        x=F.relu(self.conv2(x))
        x=self.pool2(x)
        x=x.view(-1,32*5*5)
        x=F.relu((self.fc1(x)))
        x=F.relu((self.fc2(x)))
        x=self.fc3(x)
        return x

2.2测试LeNet网络输出

我们输入的数据是一个32×32大小的3通道图像(3,32,32):input(3,32,32)->C1(16,28,28)->P1(16,14,14)->C2(32,10,10)->P2(32,5,5)->reshape(1,32 _5_5)->fc1(120)->fc2(80)->fc3(10)->output
测试网络如下：生成一个[1,3,32,32]大小的张量，其中1为batch_size，3为通道数，32为图像尺寸。观察输出是否为[1,10]。

input=torch.rand([1,3,32,32])
model=LeNet()
output=model(input)
print(model)
print(output.shape)

输出结果如下：

LeNet(
  (conv1): Conv2d(3, 16, kernel_size=(5, 5), stride=(1, 1))
  (pool1): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
  (conv2): Conv2d(16, 32, kernel_size=(5, 5), stride=(1, 1))
  (pool2): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
  (fc1): Linear(in_features=800, out_features=120, bias=True)
  (fc2): Linear(in_features=120, out_features=84, bias=True)
  (fc3): Linear(in_features=84, out_features=10, bias=True)
)
torch.Size([1, 10])

2.3下载数据集

下载数据集在 train.py文件中。

import torch.optim as optim
import torch.utils.data
import torchvision.datasets
import torchvision.transforms as transforms
import matplotlib.pyplot as plt
import numpy as np
import  torch.nn as nn

from model import  LeNet

transform=transforms.Compose(
    [transforms.ToTensor(),
     transforms.Normalize((0.5,0.5,0.5,),(0.5,0.5,0.5))
    ]
)

trainset=torchvision.datasets.CIFAR10(
    root='./data',
    train=True,
    download=True,
    transform=transform
    )

trainloader=torch.utils.data.DataLoader(
    trainset,
    batch_size=36,
    shuffle=True,
    num_workers=0
    )

testset=torchvision.datasets.CIFAR10(
    root='./data',
    train=False,
    download=True,
    transform=transform
    )

testloader=torch.utils.data.DataLoader(
    testset,
    batch_size=10000,
    shuffle=False,
    num_workers=0
    )
test_data_iter=iter(testloader)
test_image,test_label=test_data_iter.next()

2.4加载并配置网络

仍然在 train.py文件中。

net=LeNet()
loss_function=nn.CrossEntropyLoss()
optimizer=optim.Adam(net.parameters(),lr=0.001)

2.5训练并保存网络

仍然在 train.py文件中。
总共训练5个epoch，之前将batch_size设置为32，训练过程中，每100次batch_size，就打印以此在testloader上的损失。最后保存训练好的模型。

for epoch in range(5):
    running_loss=0.0
    for step,data in enumerate(trainloader,start=0):
        inputs,labels=data
        optimizer.zero_grad()
        outputs=net(inputs)
        loss=loss_function(outputs,labels)
        loss.backward()
        optimizer.step()

        running_loss+=loss.item()
        if step % 100==99:
            with torch.no_grad():
                outputs=net(test_image)
                predict_y=torch.max(outputs,dim=1)[1]
                accuracy=torch.eq(predict_y,test_label).sum().item()/test_label.size(0)

                print('[%d,%5d train_loss:%.3f test_accuracy:%.3f'%
                      (epoch+1,step+1,running_loss/500,accuracy))
                running_loss=0.0
print('Finish Training')

save_path= 'LeNet.pth'
torch.save(net.state_dict(),save_path)

最后一次epoch的打印信息如下，可以观察到识别精度已经达到了65.8%，对于一个古老的网络，这个精度已经是非常高了。

[5,  100 train_loss:0.163 test_accuracy:0.649
[5,  200 train_loss:0.171 test_accuracy:0.658
[5,  300 train_loss:0.172 test_accuracy:0.658
[5,  400 train_loss:0.173 test_accuracy:0.654
[5,  500 train_loss:0.170 test_accuracy:0.647
[5,  600 train_loss:0.175 test_accuracy:0.655
[5,  700 train_loss:0.176 test_accuracy:0.668
[5,  800 train_loss:0.172 test_accuracy:0.658
[5,  900 train_loss:0.168 test_accuracy:0.640
[5, 1000 train_loss:0.173 test_accuracy:0.658
[5, 1100 train_loss:0.173 test_accuracy:0.643
[5, 1200 train_loss:0.169 test_accuracy:0.661
[5, 1300 train_loss:0.167 test_accuracy:0.658
Finish Training

2.6测试图片

测试图片程序在 predict.py文件中。

import torch
import torchvision.transforms as transforms
from PIL import  Image
from  model import  LeNet

transform=transforms.Compose(
    [transforms.Resize((32,32)),
     transforms.ToTensor(),
     transforms.Normalize((0.5,0.5,0.5,),(0.5,0.5,0.5))
    ]
)

classes=('plane','car','bird','cat','deer','dog','frog','horse','ship','truck')

net = LeNet()
net.load_state_dict(torch.load('LeNet.pth'))
im=Image.open('1.jpg')
im=transform(im)
im=torch.unsqueeze(im,dim=0)
with torch.no_grad():
    outputs=net(im)
    predict=torch.softmax(outputs,dim=1)
    predict_class=torch.max(outputs,dim=1)[1].numpy()

print(classes[predict_class[0]],predict[0][predict_class[0]].numpy())

输出结果：

plane 0.8187075

Original: https://blog.csdn.net/qq_39400324/article/details/124383550
Author: AI Chen
Title: Pytorch搭建LeNet5网络

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/709218/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

手动绘制R语言Logistic回归模型的外部验证校准曲线（Calibration curve）（2）

校准曲线图表示的是预测值和实际值的差距，作为预测模型的重要部分，目前很多函数能绘制校准曲线。一般分为两种，一种是通过Hosmer-Lemeshow检验，把P值分为10等分，求出每等…

人工智能 2023年6月17日
0099
js获取当前时间

### 回答1： Auto. js_是一款能够模拟人的操作来自动化手机操作的工具。 _获取当前时间_也是Auto. _js_的基本功能之一。可以使用 _JavaScript_中的D…

人工智能 2023年6月29日
0065
AAAI‘22 推荐系统论文梳理

2022推荐系统论文梳理系列推荐系统相关顶会整理 IJCAI’22 推荐系统论文梳理 ICML/ICLR’22 推荐系统论文梳理 WWW’22…

人工智能 2023年6月16日
0059
阿里云机器学习PAI开源中文NLP算法框架EasyNLP，助力NLP大模型落地

作者：临在、岑鸣、熊兮一导读随着 BERT、Megatron、GPT-3 等预训练模型在NLP领域取得瞩目的成果，越来越多团队投身到超大规模训练中，这使得训练模型的规模从亿级…

人工智能 2023年5月27日
0068
解决pycharm安装深度学习pytorch的d2l包失败问题

解决pycharm安装深度学习pytorch的d2l包失败问题 1、首先查看现在pycharm所在的环境 2、打开Anaconda Prompt 3、激活现在的虚拟环境 4、安装d…

人工智能 2023年7月21日
0096
无人机编队的典型应用及控制算法开发流程介绍

随着材料科学、导航定位系统、飞控核心技术的发展和进步，旋翼飞行器在军事和民用应用上越来越广泛，众所周知，微型旋翼飞行器具备价格低廉、操作灵活、控制稳定、适应环境能力强等特点，然而单…

人工智能 2023年6月10日
0096
玩转MySQL：分清回滚、重做、逻辑这些日志很重要！

引言任何项目都会有日志，MySQL也不例外，而且MySQL更是其中的佼佼者，日志种类繁多，而本篇的目的就是全解MySQL中的各类日志，如撤销日志、错误日志、慢查询日志、中继日志、…

人工智能 2023年6月26日
0073
依存句法分析

捂脸欢迎star ^_^ 定义 HanLP的定义依存句法分析，是指识别语句中词与词之间的依存关系，并揭示其句法结构，包括主谓关系、动宾关系、核心关系等。用依存语言学来理解语义，…

人工智能 2023年5月30日
00126
知识图谱学习笔记-知识图谱价值和发展历程

一：知识图谱的价值： 1.辅助搜索传统搜索引擎依靠网页之间的超链接实现网页的搜索，而语义搜索是直接对事物进行搜索，如人物、机构、地点等。这些事物可能来自文本、图片、视频、音频、I…

人工智能 2023年6月10日
0057
将Python文件打包成exe文件（超详细）

首先，我们为什么要把Python文件打包成exe文件？因为，Python文件需要Python IDE打开而exe，就是一个程序，双击就开了！岂不是妙哉？首先，打开终端，我们…

人工智能 2023年7月5日
0062
华为Atlas200DK的环境部署与运行demo（人脸识别）

文章目录前言一、部署准备 * 1.基本准备 2.安全清空sd卡 3.安装摄像头二、环境部署 * 1.运行环境与开发环境合设 – 1.烧录dd镜像 2.开发板启动 …

人工智能 2023年6月16日
0058
多激光雷达标定multi_LiDAR_calibration

多激光雷达标定multi_LiDAR_calibration 对于多激光雷达的标定主要采用ICP、NDT等配准方法进行估计多个激光雷达的外参变换矩阵T T T。在这里先介绍一些先前…

人工智能 2023年5月26日
0069
超详细的OpenCV入门教程，12小时带你吃透OpenCV。

OpenCV简介： OpenCV是一个基于Apache2.0许可（开源）发行的跨平台计算机视觉和机器学习软件库，可以运行在linux、Windows、Android和MAC OS操…

人工智能 2023年6月17日
0065
视觉机器学习20讲-MATLAB源码示例（7）-EM算法

视觉机器学习20讲-MATLAB源码示例（7）-EM算法 1. EM算法 2. Matlab仿真 3. 仿真结果 4. 小结 ; 1. EM算法最大期望算法（Expectatio…

人工智能 2023年5月28日
0060
基于卷积神经网络的图像识别技术从入门到深爱（理论思想与代码实践齐飞）

基于卷积神经网络的图像识别技术从入门到深爱（理论与代码实践齐飞！）零、前言一、手写数字识别入门神经网络（入门篇） * 1. 手写数字数据集及神经网络数据概念介绍 –…

人工智能 2023年5月25日
00104
Python学习记录逻辑回归

什么是逻辑回归逻辑回归，简称LR它可以将我们离散的特征输入集合转换为0和1这两类的概率它只有两种结果的选择比如说购买商品可以选择买或者不买逻辑回归会将特征值转化为0,1它可以…

人工智能 2023年6月18日
0050

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30