Pytorch加载模型并进行图像分类预测

2023年7月22日下午9:31 • 人工智能 • 阅读 58

1) How can i convert an RGB image into grayscale in Python?

2）PIL 处理图像的基本操作

3）图像通道数的理解

4）Convert 3 channel image to 2 channel

5）图像通道转换

6）将所有的图像合并为一个numpy数组

7）torch.from_numpy VS torch.Tensor

8）torch.squeeze() 和torch.unsqueeze()

3.issue

1）TypeError: ‘module’ object is not callable

2）TypeError: ‘collections.OrderedDict’ object is not callable

3) TypeError: init() missing 1 required positional argument: ‘XX’

4) RuntimeError: Error(s) in loading state_dict for PythonNet: Missing key(s) in state_dict:

5) RuntimeError: Expected 4-dimensional input for 4-dimensional weight [128, 1, 3, 3], but got 2-dimensional input of size [480, 640] instead

6）RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor

1）实例化模型

Assume that the content of YourClass.py is:

class YourClass:
    # ......

If you use:

from YourClassParentDir import YourClass  # means YourClass

from model import PythonNet

net= PythonNet(T=16).eval().cuda()

2）加载模型

import torch

net.load_state_dict(torch.load('checkpoint_max.pth'),False)

3）输入图像

目的：从文件夹中加载所有图像组合为一个 numpy array 作为模型输入

原始图像输入维度：（480,640）

目标图像输入维度：（16,1,128,128）

import glob
from PIL import Image
import numpy as np
from torch.autograd import Variable

#获取图像路径
filelist = glob.glob('./testdata/a/*.jpg')

#打开图像open('frame_path')--》转换为灰度图convert('L')--》缩放图像resize((width, height)) --》合并文件夹中的所有图像为一个numpy array
x = np.array([np.array(Image.open(frame).convert('L').resize((128,128))) for frame in filelist])

#用torch.from_numpy这个方法将numpy类转换成tensor类
x = torch.from_numpy(x).type(torch.FloatTensor).cuda()

#扩充数据维度
x = Variable(torch.unsqueeze(x,dim=1).float(),requires_grad=False)

4）输出分类结果

outputs = net(x)
_, predicted = torch.max(outputs,1)

torch.max()这个函数返回输入张量中所有元素的最大值。

返回一个命名元组(values,indices)，其中values是给定维dim中输入张量的每一行的最大值。indices是找到的每个最大值的索引位置(argmax)。也就是说，返回的第一个值是对应图像在类别中的最大概率值，第二个值是最大概率值的对应类别。

5）完整代码

from PIL import Image
from torch.autograd import Variable
import numpy as np
import torch
import glob
from model import PythonNet

##############处理输入图像#######################################
#获取图像路径
filelist = glob.glob('./testdata/a/*.jpg')

#打开图像open('frame_path')--》转换为灰度图convert('L')--》缩放图像resize((width, height)) --》合并文件夹中的所有图像为一个numpy array
x = np.array([np.array(Image.open(frame).convert('L').resize((128,128))) for frame in filelist])

#用torch.from_numpy这个方法将numpy类转换成tensor类
x = torch.from_numpy(x).type(torch.FloatTensor).cuda()

#扩充数据维度
x = Variable(torch.unsqueeze(x,dim=1).float(),requires_grad=False)

#############定义预测函数#######################################
def predict(x):
    net= PythonNet(T=16).eval().cuda()
    net.load_state_dict(torch.load('checkpoint_max.pth'),False)

    outputs = net(x)
    _, predicted = torch.max(outputs,1)
    print("_:",_)
    print("predicted:",predicted)
    print("outputs:",outputs)

############输入图像进行预测#####################################
predict(x)

1) How can i convert an RGB image into grayscale in Python?

2）PIL 处理图像的基本操作

3）图像通道数的理解

4）Convert 3 channel image to 2 channel

5）图像通道转换

6）将所有的图像合并为一个numpy数组

7）torch.from_numpy VS torch.Tensor

8）torch.squeeze() 和torch.unsqueeze()

1）TypeError: ‘module’ object is not callable

2）TypeError: ‘collections.OrderedDict’ object is not callable

3) TypeError: init() missing 1 required positional argument: ‘XX’

4) RuntimeError: Error(s) in loading state_dict for PythonNet: Missing key(s) in state_dict:

5) RuntimeError: Expected 4-dimensional input for 4-dimensional weight [128, 1, 3, 3], but got 2-dimensional input of size [480, 640] instead

6）RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor

Original: https://blog.csdn.net/June19/article/details/120861316
Author: 无穷QQ君
Title: Pytorch加载模型并进行图像分类预测

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/709635/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

《Generalized Focal Loss V2》论文笔记

参考代码：GFocalV2 概述导读：这篇文章是在之前V1版本的基础上增强了目标检测中定位质量估计能力。在之前的一些网络中会在分类分支和检测分支的特征图基础上实现定位质量的估计（…

人工智能 2023年7月12日
0096
【Python K均值聚类算法】

K均值聚类算法 1. 什么是聚类 * 1.1 聚类概念: 1.2 聚类常用的距离判定: 1.3 聚类目的: 2. K均值算法实现过程 * 2.1 K是什么? Means是什么? 2…

人工智能 2023年6月2日
0088
04查找算法：顺序查找法、二分查找法

开始系统学习算法啦！为后面力扣和蓝桥杯的刷题做准备！这个专栏将记录自己学习算法是的笔记，包括概念，算法运行过程，以及代码实现，希望能给大家带来帮助，感兴趣的小伙伴欢迎评论区留…

人工智能 2023年7月31日
0057
使用SimpleITK进行3D图像连通域分析

一、简介本文叙述了使用SimpleITK进行3D医疗图像连通域分析的方法。（相邻的像素值视为同一个连通域，不区分像素值）非医疗图像需要先封装为SimpleITK.Image，或…

人工智能 2023年5月26日
00103
python | 连接MySQL数据库的简单方法

进行Python连接mysql数据库之前，需要先安装一下pymysql。直接在终端执行下面的命令即可。 pip install PyMySQL 以下代码实现了连接到一个datab…

人工智能 2023年7月4日
0080
用PyTorch搭建卷积神经网络

用PyTorch搭建卷积神经网络本篇是加拿大McGill本科，Waterloo硕士林羿实习时所作的工作。发文共享，主要是面对PyTorch的初学者。本篇文章是一篇基础向的PyT…

人工智能 2023年6月16日
0086
构建卷积神经网络（Convolutional Neural Network,CNN）

本节重点讲解通过代码构建CNN，具体理论知识会在以后专栏讲解，在此不过多赘述。卷积神经网络主要由多个卷积层、池化层、激活函数和全连接层组成。首先输入数据进入卷积层进行特征提取，然…

人工智能 2023年5月25日
0077
TensorFlow推荐系统（二）

人工智能 2023年5月26日
0066
全面解析PaDiM

使用PaDiM网络跑自己的数据集，除去测试时读入dataloader的时间，每张图片测试时间在20-30ms，精度比较高，图像分类准确率99-100，像素分割准确率97以上，但是最…

人工智能 2023年7月28日
00100
调用云服务实现语音识别合成以及感情分析

人工智能 2023年5月23日
00103
多层感知机(MLP)的构建与实现

前面介绍的线性回归与Softmax回归，都属于单层神经网络，而在深度学习领域，主要关注多层模型，这节主要熟悉多层感知机（MultiLayer Perceptron，MLP），因为神…

人工智能 2023年7月14日
0082
头歌平台（EduCoder）—卷积神经网络的实现

什么是卷积层。 import numpy as np from utils import im2col class Convolution: def __init__(self, …

人工智能 2023年7月13日
00102
Pandas中Series和DataFrame的索引

在对Series对象和DataFrame对象进行索引的时候要明确这么一个概念：是使用下标进行索引，还是使用关键字进行索引。比如list进行索引的时候使用的是下标，而dict索引的时…

人工智能 2023年6月2日
0096
PL-Marker(ACL 2022)——信息抽取(NER+RE)新SOTA，论文浅析与代码浏览

文章目录前言：相关工作介绍论文思路整体框架 * 1. NER阶段 2. RE阶段 Train * 1.1 ACEDatasetNER 1.2 for _ in train_i…

人工智能 2023年6月25日
00114
有哪些常见的神经网络架构

人工智能 2024年1月6日
0090
智能家居灯光控制系统

引言智能化家居，或称智能化住宅，在英文中常用Smart Home、Inte1ligent home，与此含义相近的还有家庭自动化（Home Automation）、电子家庭（El…

人工智能 2023年6月27日
0086

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Pytorch加载模型并进行图像分类预测

1）实例化模型

2）加载模型

3）输入图像

4）输出分类结果

5）完整代码

1) How can i convert an RGB image into grayscale in Python?

2）PIL 处理图像的基本操作

3）图像通道数的理解

4）Convert 3 channel image to 2 channel

5）图像通道转换

6）将所有的图像合并为一个numpy数组

7）torch.from_numpy VS torch.Tensor

8）torch.squeeze() 和torch.unsqueeze()

1）TypeError: ‘module’ object is not callable

2）TypeError: ‘collections.OrderedDict’ object is not callable

3) TypeError: init() missing 1 required positional argument: ‘XX’

4) RuntimeError: Error(s) in loading state_dict for PythonNet: Missing key(s) in state_dict:

5) RuntimeError: Expected 4-dimensional input for 4-dimensional weight [128, 1, 3, 3], but got 2-dimensional input of size [480, 640] instead

6）RuntimeError: Input type (torch.FloatTensor) and weight type (torch.cuda.FloatTensor) should be the same or input should be a MKLDNN tensor and weight is a dense tensor

大家都在看