365天深度学习训练营-第P1周：实现mnist手写数字识别

2023年6月30日上午7:37 • 人工智能 • 阅读 105

🍨 本文为🔗365天深度学习训练营内部限免文章（版权归 K同学啊所有）
🍦 参考文章地址： 🔗第P1周：实现mnist手写数字识别 | 365天深度学习训练营
🍖 作者：K同学啊 | 接辅导、程序定制

文章目录

我的环境：
一、前期工作
*
1. 设置 GPU
2. 导入数据
3. 数据可视化
二、构建简单的CNN网络
三、训练模型
*
1. 设置超参数
2. 编写训练函数
3. 编写测试函数
4. 正式训练
四、结果可视化
五、用自己制作的图片进行预测

我的环境：

语言环境：Python 3.7.13
编译器：jupyter notebook
深度学习环境：
torch==1.12.1+cu113、cuda==11.3.1
torchvision==0.13.1+cu113、cuda==11.3.1

一、前期工作

1. 设置 GPU

import torch
import torch.nn as nn
import matplotlib.pyplot as plt
import torchvision

device = torch.device("cuda" if torch.cuda.is_available() else "cpu")

device

device(type='cuda')

2. 导入数据

train_ds = torchvision.datasets.MNIST('data',
                                      train=True,
                                      transform=torchvision.transforms.ToTensor(),
                                      download=True)

test_ds  = torchvision.datasets.MNIST('data',
                                      train=False,
                                      transform=torchvision.transforms.ToTensor(),
                                      download=True)

batch_size = 32

train_dl = torch.utils.data.DataLoader(train_ds,
                                       batch_size=batch_size,
                                       shuffle=True)

test_dl  = torch.utils.data.DataLoader(test_ds,
                                       batch_size=batch_size)


imgs, labels = next(iter(train_dl))
imgs.shape

torch.Size([32, 1, 28, 28])

3. 数据可视化

import numpy as np

plt.figure(figsize=(20, 5))
for i, imgs in enumerate(imgs[:20]):

    npimg = np.squeeze(imgs.numpy())

    plt.subplot(2, 10, i+1)
    plt.imshow(npimg, cmap=plt.cm.binary)
    plt.axis('off')

二、构建简单的CNN网络

使用 image_dataset_from_directory 方法将磁盘中的数据加载到 tf.data.Dataset 中

import torch.nn.functional as F

num_classes = 10

class Model(nn.Module):
     def __init__(self):
        super().__init__()

        self.conv1 = nn.Conv2d(1, 32, kernel_size=3)
        self.pool1 = nn.MaxPool2d(2)
        self.conv2 = nn.Conv2d(32, 64, kernel_size=3)
        self.pool2 = nn.MaxPool2d(2)

        self.fc1 = nn.Linear(1600, 64)
        self.fc2 = nn.Linear(64, num_classes)

     def forward(self, x):
        x = self.pool1(F.relu(self.conv1(x)))
        x = self.pool2(F.relu(self.conv2(x)))

        x = torch.flatten(x, start_dim=1)

        x = F.relu(self.fc1(x))
        x = self.fc2(x)

        return x

from torchinfo import summary

model = Model().to(device)

summary(model)

=================================================================
Layer (type:depth-idx)                   Param
=================================================================
Model                                    --
├─Conv2d: 1-1                            320
├─MaxPool2d: 1-2                         --
├─Conv2d: 1-3                            18,496
├─MaxPool2d: 1-4                         --
├─Linear: 1-5                            102,464
├─Linear: 1-6                            650
=================================================================
Total params: 121,930
Trainable params: 121,930
Non-trainable params: 0
=================================================================

三、训练模型

1. 设置超参数

loss_fn    = nn.CrossEntropyLoss()
learn_rate = 1e-2
opt        = torch.optim.SGD(model.parameters(),lr=learn_rate)

2. 编写训练函数


def train(dataloader, model, loss_fn, optimizer):
    size = len(dataloader.dataset)
    num_batches = len(dataloader)

    train_loss, train_acc = 0, 0

    for X, y in dataloader:
        X, y = X.to(device), y.to(device)

        pred = model(X)
        loss = loss_fn(pred, y)

        optimizer.zero_grad()
        loss.backward()
        optimizer.step()

        train_acc  += (pred.argmax(1) == y).type(torch.float).sum().item()
        train_loss += loss.item()

    train_acc  /= size
    train_loss /= num_batches

    return train_acc, train_loss

3. 编写测试函数

def test (dataloader, model, loss_fn):
    size        = len(dataloader.dataset)
    num_batches = len(dataloader)
    test_loss, test_acc = 0, 0

    with torch.no_grad():
        for imgs, target in dataloader:
            imgs, target = imgs.to(device), target.to(device)

            target_pred = model(imgs)
            loss        = loss_fn(target_pred, target)

            test_loss += loss.item()
            test_acc  += (target_pred.argmax(1) == target).type(torch.float).sum().item()

    test_acc  /= size
    test_loss /= num_batches

    return test_acc, test_loss

4. 正式训练

epochs     = 5
train_loss = []
train_acc  = []
test_loss  = []
test_acc   = []

for epoch in range(epochs):
    model.train()
    epoch_train_acc, epoch_train_loss = train(train_dl, model, loss_fn, opt)

    model.eval()
    epoch_test_acc, epoch_test_loss = test(test_dl, model, loss_fn)

    train_acc.append(epoch_train_acc)
    train_loss.append(epoch_train_loss)
    test_acc.append(epoch_test_acc)
    test_loss.append(epoch_test_loss)

    template = ('Epoch:{:2d}, Train_acc:{:.1f}%, Train_loss:{:.3f}, Test_acc:{:.1f}%，Test_loss:{:.3f}')
    print(template.format(epoch+1, epoch_train_acc*100, epoch_train_loss, epoch_test_acc*100, epoch_test_loss))
print('Done')

Epoch: 1, Train_acc:77.6%, Train_loss:0.744, Test_acc:91.1%，Test_loss:0.284
Epoch: 2, Train_acc:94.1%, Train_loss:0.196, Test_acc:96.2%，Test_loss:0.128
Epoch: 3, Train_acc:96.2%, Train_loss:0.123, Test_acc:97.5%，Test_loss:0.089
Epoch: 4, Train_acc:97.1%, Train_loss:0.094, Test_acc:97.4%，Test_loss:0.078
Epoch: 5, Train_acc:97.5%, Train_loss:0.078, Test_acc:98.0%，Test_loss:0.062
Done

四、结果可视化

import matplotlib.pyplot as plt

import warnings
warnings.filterwarnings("ignore")
plt.rcParams['font.sans-serif']    = ['SimHei']
plt.rcParams['axes.unicode_minus'] = False
plt.rcParams['figure.dpi']         = 100

epochs_range = range(epochs)

plt.figure(figsize=(12, 3))
plt.subplot(1, 2, 1)

plt.plot(epochs_range, train_acc, label='Training Accuracy')
plt.plot(epochs_range, test_acc, label='Test Accuracy')
plt.legend(loc='lower right')
plt.title('Training and Validation Accuracy')

plt.subplot(1, 2, 2)
plt.plot(epochs_range, train_loss, label='Training Loss')
plt.plot(epochs_range, test_loss, label='Test Loss')
plt.legend(loc='upper right')
plt.title('Training and Validation Loss')
plt.show()

五、用自己制作的图片进行预测

for i in range(10):
    img_path = 'imgs/no' + str(i) + '.png'
    img = Image.open(img_path)
    img = img.convert('L')
    img = data_transform(img)
    img = torch.unsqueeze(img, dim=0)
    img = img.to(device)

    model.eval()
    with torch.no_grad():
        output = model(img)
        print(output.argmax(1).item())

预测结果：

Original: https://blog.csdn.net/lele_ne/article/details/127801624
Author: lele_ne
Title: 365天深度学习训练营-第P1周：实现mnist手写数字识别

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/660902/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

CVPR2019领域自适应/语义分割：Adapting Structural Information across Domains for Boosting Sema适应结构信息跨领域促进语义分割

CVPR2019 All about Structure: Adapting Structural Information across Domains for Boosting …

人工智能 2023年6月22日
0088
基于猎人猎物算法优化LSTM的碳交易价格预测

目录一、背景及介绍 1.1 长短时记忆网络 1.2 猎人猎物优化算法（HPO）二、HPO-LSTM算法三、模型仿真与结果分析四、总结与展望一、背景及介绍 1.1 长短时记…

人工智能 2023年7月14日
0087
故障分类中的特征提取效果浅见

0 前言首先说明基本情况，本人入坑轴承故障诊断一年多，对于很多问题可定认识还不到位，研究的也不够彻底，所以这篇文章算是自己的阶段性认知，不一定正确，希望各位道友批评指正。当初之…

人工智能 2023年6月30日
0068
自动驾驶轨迹预测论文阅读（二）TPNet: Trajectory Proposal Network for Motion Prediction

论文链接：https://openaccess.thecvf.com/content_CVPR_2020/papers/Fang_TPNet_Trajectory_Proposal…

人工智能 2023年7月28日
0092
opencv改变图片大小，cv2.resize方法详解

cv2.resize可以改变图片的尺寸，方法如下 def resize(src, dsize, dst=None, fx=None, fy=None, interpolation=…

人工智能 2023年6月18日
0061
【sklearn】使用sklearn实现决策树

【sklearn】使用sklearn实现决策树 * – + 1. 决策树介绍 + * 1. 信息熵 * 2. 信息增益 * 3. 信息熵和信息增益 + 2. 使用skl…

人工智能 2023年6月24日
0085
通过深度学习实现对网络异常流量检测

消失了好几个月，突然想起来自己还有这么一个CSDN的账号，趁着这几天有空，总结一下最近这段时间所做的事情。前言：随着网络技术的快速发展，各式各样的新型恶意攻击不断出现。如何改善对…

人工智能 2023年6月17日
0075
【Bug解决】nvcc fatal : Unsupported gpu architecture ‘compute_86‘

报错描述执行 pip install ‘git+https://github.com/facebookresearch/detectron2.git’ 安装 detectron2…

人工智能 2023年6月16日
00172
chrome 开启HEVC硬件解码

chrome 开启HEVC硬件解码文章目录 1. chrome 开启HEVC硬件解码 * 1.1 判断客户机是否支持HEVC硬解码 1.2 chrome浏览器配置 1.3 通过播…

人工智能 2023年5月30日
00146
项目中如何配置 Maven 为国内源

目录 1. 创建出一个 Maven 项目 2. 打开项目配置界面, 检查并配置国内源 2.1 打开配置界面 (当前项目界面和新项目配置界面) 2.2 搜索 “Maven…

人工智能 2023年6月26日
0077
GIt的使用

简介全局设置 Git常用命令远程仓库命令从远程仓库拉取分支标签 idea集合git 加入缓存区推送远程仓库拉取远程仓库分支合并/新增本地推送远程仓库简介 Git…

人工智能 2023年6月27日
0064
python2.7.13安装keras记录

keras给出的版本大多对应的是python3.x版本，但有时一些项目需要用到python2.x的环境，版本找起来很麻烦。所以拉宝要写这篇文章来记录和总结自己的安装过程(并防止下一…

人工智能 2023年5月25日
0082
Python如何优雅地可视化目标检测框

1 引言随着计算机视觉算法工程师的内卷,从事目标检测的小伙伴们越来越多了. 很多时候我们费了九牛二虎之力训练了一版模型,可是可视化出来的效果平淡无奇. 是不是有点太不给力啦,作为…

人工智能 2023年6月22日
0072
Deep Reasoning with Knowledge Graph for Social Relationship Understanding 阅读笔记

0 摘要任务：社交关系推理现状：以往的研究忽略了(i)社交关系之间的相互影响，(ii)人周围的场景信息解决办法：1 提出了一个端到端的可训练图推理模块，探索人与前后物体的交互关系…

人工智能 2023年6月1日
0089
融合transformer和对抗学习的多变量时间序列异常检测算法TranAD论文和代码解读…

一、前言今天的文章来自VLDB TranAD: Deep Transformer Networks for Anomaly Detection in Multivariate T…

人工智能 2023年7月26日
0063
代码随想录训练营day53

题目一：最长公共子序列题目描述：给定两个字符串 text1 和 text2，返回这两个字符串的最长公共子序列的长度。如果不存在公共子序列，返回 0 。一个字符串的子…

人工智能 2023年6月29日
0074

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31