SSD-Pytorch训练自己的VOC数据集&遇到的问题及解决办法

2023年7月9日下午1:33 • 人工智能 • 阅读 124

SSD

训练
*
data/init.py
data/config.py
data/voc0712.py
layers/modules/multibox_loss.py
ssd.py
train.py
预训练文件vgg16_reducedfc.pth
eval.py
demo.py
demo/live.py
遇到的问题
*
train.py
–
eval.py
–

训练

去GitHub上下载SSD源码
新建一个VOCdevkit文件夹，放入VOC2007数据集

import os
import random

trainval_percent = 0.9
train_percent = 0.8
xmlfilepath = './Annotations/'
txtsavepath = './ImageSets/Main/'
total_xml = os.listdir(xmlfilepath)

num = len(total_xml)
list = range(num)
tv = int(num * trainval_percent)
tr = int(tv * train_percent)
trainval = random.sample(list, tv)
train = random.sample(trainval, tr)

ftrainval = open(txtsavepath + '/trainval.txt', 'w')
ftest = open(txtsavepath + '/test.txt', 'w')
ftrain = open(txtsavepath + '/train.txt', 'w')
fval = open(txtsavepath + '/val.txt', 'w')

for i in list:
    name = total_xml[i][:-4] + '\n'
    if i in trainval:
        ftrainval.write(name)
        if i in train:
            ftrain.write(name)
        else:
            fval.write(name)
    else:
        ftest.write(name)

ftrainval.close()
ftrain.close()
fval.close()
ftest.close()

data/ init .py

注释第3行 from .coco import COCODetection, COCOAnnotationTransform, COCO_CLASSES, COCO_ROOT, get_label_map

data/config.py

第15行的 num_classes改成 自己设定的类别数+1；
设置 max_iter最大迭代数

data/voc0712.py

第20行的 VOC_CLASSES =改成自己的类别名；
第93行改为 image_sets=[('2007', 'trainval')]

layers/modules/multibox_loss.py

第97行的 loss_c[pos] = 0前面加上一句 loss_c = loss_c.view(num, -1)

ssd.py

把所有的 num_classes的数量(第32、198行)都改为 类别数+1

train.py

parser batch_size， learning-rate根据自己电脑情况修改(batchsize=16)；
basenet 预训练模型， start_iter迭代起始点， save_folder模型保存地址
搜索这里面的 data[0]，全部替换为 item()；
第84、85行注释掉；

第198行 iteration % 5000 == 0，意味着每5000次保存一次模型，可改为200。后两行可改保存的模型名。

可以在第195行创建txt记录loss值：

with open('loss.txt', 'a') as f:
    f.write(str(loss.item()) + '\n')

165行的 images, targets = next(batch_iterator)改成：

try:
    images, targets = next(batch_iterator)
except StopIteration:
    batch_iterator = iter(data_loader)
    images, targets = next(batch_iterator)

预训练文件vgg16_reducedfc.pth

开始训练时需要一个预训练文件 vgg16_reducedfc.pth

链接

下载之后放在SSD项目下新建的weights文件夹下，然后就可以进行训练了。
注：训练中途遇到 loss=nan 的现象，将 train.py中， parser.add_argument('--lr', '--learning-rate', default=1e-3, type=float,中的 default=1e-3改为 default=1e-4。*直到loss降低到1左右时即可 *

eval.py

trained_model评估的模型路径， save_folder 评估保存路径

demo.py

新建 test_image，在文件夹中放置几张待测图片（四处修改 20220106更新）

import os
import sys
import torch
from torch.autograd import Variable
import numpy as np
import cv2
from ssd import build_ssd
from data import VOC_CLASSES as labels
from matplotlib import pyplot as plt

os.environ["KMP_DUPLICATE_LIB_OK"] = "TRUE"

module_path = os.path.abspath(os.path.join('..'))
if module_path not in sys.path:
    sys.path.append(module_path)

if torch.cuda.is_available():
    torch.set_default_tensor_type('torch.cuda.FloatTensor')

net = build_ssd('test', 300, 5)

net.load_weights('weights/ssd300_VOC_1995.pth')

imgs = 'test_image/'
img_list = os.listdir(imgs)
for img in img_list:

    current_img = imgs + img
    image = cv2.imread(current_img)
    rgb_image = cv2.cvtColor(image, cv2.COLOR_BGR2RGB)

    x = cv2.resize(image, (300, 300)).astype(np.float32)
    x -= (104.0, 117.0, 123.0)
    x = x.astype(np.float32)
    x = x[:, :, ::-1].copy()
    x = torch.from_numpy(x).permute(2, 0, 1)

    xx = Variable(x.unsqueeze(0))
    if torch.cuda.is_available():
        xx = xx.cuda()
    y = net(xx)

    top_k = 10

    plt.figure(figsize=(6, 6))
    colors = plt.cm.hsv(np.linspace(0, 1, 21)).tolist()
    currentAxis = plt.gca()

    detections = y.data
    scale = torch.Tensor(rgb_image.shape[1::-1]).repeat(2)
    for i in range(detections.size(1)):
        j = 0
        while detections[0, i, j, 0] >= 0.6:
            score = detections[0, i, j, 0]
            label_name = labels[i-1]
            display_txt = '%s: %.2f'%(label_name, score)
            print(display_txt)
            pt = (detections[0,i,j,1:]*scale).cpu().numpy()
            coords = (pt[0], pt[1]), pt[2]-pt[0]+1, pt[3]-pt[1]+1
            color = colors[i]
            currentAxis.add_patch(plt.Rectangle(*coords, fill=False, edgecolor=color, linewidth=2))
            currentAxis.text(pt[0], pt[1], display_txt, bbox={'facecolor':color, 'alpha':0.5})
            j += 1
    plt.imshow(rgb_image)
    plt.show()

demo/live.py

摄像头识别 (没试)
第10行用…/找到上一级目录

parser.add_argument('--weights', default='../weights/xxxxxx.pth',

第78行类别+1

遇到的问题

报错顺序不记得了，下面是遇到的大部分错误

train.py

TypeError: unsupported operand type(s) for /=: ‘Tensor’ and ‘builtin_function_or_method’…

loss_l /= N这句错误

因为一些教程里还改了 layers/modules/multibox_loss.py程序：
第115行 N = num_pos.data.sum()改为

 N = num_pos.data.sum().double
 loss_l = loss_l.double()
 loss_c = loss_c.double()

会出现这个问题.

找不到数据集里的文件夹/文件

VOC数据集名字错了注意名称和大小写

FileNotFoundError: [Errno 2] No such file or directory: ‘C:\Users\Administrator\data/coco/coco_labels.txt’

train.py 第二行如果有 from data.coco import COCO_ROOT, COCODetection注释掉

RuntimeError: Legacy autograd function with non-static forward method is deprecated. Please use new-style autograd function with static forward method.

版本问题。参考
改 detection.py为（更新注释部分已翻译）

"""
Copyright (c) 2017 Max deGroot, Ellis Brown
Released under the MIT license
https://github.com/amdegroot/ssd.pytorch
Updated by: Takuya Mouri
"""
import torch
from torch.autograd import Function
from ..box_utils import decode, nms
from data import voc as cfg

class Detect(Function):
    """At test time, Detect is the final layer of SSD.  Decode location preds,
    apply non-maximum suppression to location predictions based on conf
    scores and threshold to a top_k number of output predictions for both
    confidence score and locations.

"""

    @staticmethod
    def forward(self, num_classes, bkg_label, top_k, conf_thresh, nms_thresh, loc_data, conf_data, prior_data):
        self.num_classes = num_classes
        self.background_label = bkg_label
        self.top_k = top_k

        self.nms_thresh = nms_thresh
        if nms_thresh  0:
            raise ValueError('nms_threshold must be non negative.')
        self.conf_thresh = conf_thresh
        self.variance = cfg['variance']

"""
        Args:
            loc_data: (tensor) Loc preds from loc layers
                Shape: [batch,num_priors*4]
            conf_data: (tensor) Shape: Conf preds from conf layers
                Shape: [batch*num_priors,num_classes]
            prior_data: (tensor) Prior boxes and variances from priorbox layers
                Shape: [1,num_priors,4]
"""
        num = loc_data.size(0)
        num_priors = prior_data.size(0)

        output = torch.zeros(num, self.num_classes, self.top_k, 5)

        conf_preds = conf_data.view(num, num_priors,
                                    self.num_classes).transpose(2, 1)

        for i in range(num):
            decoded_boxes = decode(loc_data[i], prior_data, self.variance)

            conf_scores = conf_preds[i].clone()

            for cl in range(1, self.num_classes):

                c_mask = conf_scores[cl].gt(self.conf_thresh)
                scores = conf_scores[cl][c_mask]

                if scores.size(0) == 0:

                    continue
                l_mask = c_mask.unsqueeze(1).expand_as(decoded_boxes)

                boxes = decoded_boxes[l_mask].view(-1, 4)

                ids, count = nms(boxes, scores, self.nms_thresh, self.top_k)
                output[i, cl, :count] = \
                    torch.cat((scores[ids[:count]].unsqueeze(1),
                               boxes[ids[:count]]), 1)
        flt = output.contiguous().view(num, -1, 5)
        _, idx = flt[:, :, 0].sort(1, descending=True)
        _, rank = idx.sort(1)
        flt[(rank < self.top_k).unsqueeze(-1).expand_as(flt)].fill_(0)
        return output

ssd.py中99行左右

output = self.detect(

改为

output = self.detect.apply(self.num_classes, 0, 200, 0.01, 0.45,

AttributeError: ‘NoneType’ object has no attribute ‘shape’

change coco.py:
from: img=cv2.imread(osp.join(self.root,path))
to: img=cv2.imread(path)

IndexError: Too many indices for array:Array is 1-dimensional,but 2 were indexed （20220105更新）

annotation也就是xml文件里面有些包含空目标（我的没有也报错了）
参考网址
出错的xml和jpg修改或删掉（流程结束后需要重新生成VOC的四个txt文件）
新建 check.py
修改 root 、 classes

import argparse
import sys
import cv2
import os

import os.path          as osp
import numpy            as np

if sys.version_info[0] == 2:
    import xml.etree.cElementTree as ET
else:
    import xml.etree.ElementTree  as ET

parser    = argparse.ArgumentParser(
            description='Single Shot MultiBox Detector Training With Pytorch')
train_set = parser.add_mutually_exclusive_group()

parser.add_argument('--root', default="xxxxxxxxxxxxxxxxxxxxxxxxxxx", help='Dataset root directory path')

args = parser.parse_args()

CLASSES = (
    'fire', 'xxxxxxxxxxxxxxxxxxxxx')

annopath = osp.join('%s', 'Annotations', '%s.{}'.format("xml"))
imgpath  = osp.join('%s', 'JPEGImages',  '%s.{}'.format("jpg"))

def vocChecker(image_id, width, height, keep_difficult = False):
    target   = ET.parse(annopath % image_id).getroot()
    res      = []

    for obj in target.iter('object'):

        difficult = int(obj.find('difficult').text) == 1

        if not keep_difficult and difficult:
            continue

        name = obj.find('name').text.lower().strip()
        bbox = obj.find('bndbox')

        pts    = ['xmin', 'ymin', 'xmax', 'ymax']
        bndbox = []

        for i, pt in enumerate(pts):

            cur_pt = int(bbox.find(pt).text) - 1

            cur_pt = float(cur_pt) / width if i % 2 == 0 else float(cur_pt) / height

            bndbox.append(cur_pt)

        print(name)
        label_idx =  dict(zip(CLASSES, range(len(CLASSES))))[name]
        bndbox.append(label_idx)
        res += [bndbox]

    print(res)
    try :
        print(np.array(res)[:,4])
        print(np.array(res)[:,:4])
    except IndexError:
        print("\nINDEX ERROR HERE !\n")
        exit(0)
    return res

if __name__ == '__main__' :

    i = 0

    for name in sorted(os.listdir(osp.join(args.root,'Annotations'))):

        i += 1

        img    = cv2.imread(imgpath  % (args.root,name.split('.')[0]))
        height, width, channels = img.shape
        print("path : {}".format(annopath % (args.root,name.split('.')[0])))
        res = vocChecker((args.root, name.split('.')[0]), height, width)
    print("Total of annotations : {}".format(i))

eval.py

右键运行变成test模式

打开pycharm进入了test模式，具体表现为用”Run ‘py.test xxx.py'”
左上角File-settings-python integrated tools里面修改，选择unittest修改后记得apply

开始运行后到某一个图片突然出错

改VOC2007的main里边的 test.txt 删掉错误的那一行

eval运行到最后 FileNotFoundError: [Errno 2] No such file or directory: ‘test.txt’

这只是一个符号问题；os.path.join 不接受在原始实现中加入带有括号”{😒}.txt”的路径。它会忽略所有路径 ~/VOC2007/ImageSets/Main/test.txt 并简单地假设路径是： currentpath/test.txt

修复指定 imgsetpath 的行，如下所示：

imgsetpath = os.path.join(args.voc_root, 'VOC2007', 'ImageSets', 'Main', '%s.txt')

在函数 do_python_eval 中将

filename, annopath, imgsetpath.format(set_type), cls, cachedir,

改为

filename, annopath, imgsetpath % set_type, cls, cachedir,

我不管未来会怎么样但我每天都想见到你

Original: https://blog.csdn.net/zrg_hzr_1/article/details/121661026
Author: 国服最强貂蝉
Title: SSD-Pytorch训练自己的VOC数据集&遇到的问题及解决办法

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/680770/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

对比学习的应用（SimCSE，CLEAR，DeCLUTR，DiffCSE）

前已经有博文整理过了对比学习的概念，比较重要且流行的文章，和一些已经有的应用，主要是在image或者video上的应用： Contrastive Learning（对比学习，MoC…

人工智能 2023年5月27日
0087
图像处理：推导三种边缘检测算法（Sobel，FFT，FHT）

目录概述 Sobel算子 FFT算子 Numpy中的傅里叶变换 OpenCV中的傅里叶变换 FHT算子最后的评估概述之前写的推导Canny边缘检测算法得到了大家的认可，我也…

人工智能 2023年6月18日
00137
Python 数据降噪处理的四种方法——均值滤波、小波变换、奇异值分解、改变binSize

Python 数据降噪处理的四种方法——均值滤波、小波变换、奇异值分解、改变binSize github主页： https://github.com/Taot-chen 一、均值滤…

人工智能 2023年7月4日
0064
《剑指Offer》30-包含min函数的栈

理解题意 MinStack minStack = new MinStack(); minStack.push(-2); minStack.push(0); minStack.pus…

人工智能 2023年6月4日
0072
10行代码集2000张美女图，Python爬虫120例，再上征途

《Python 爬虫 120 例》专栏简介 Python 爬虫 100 例教程，编写自 2018-07-30 到 2020-10-28，将近 800 天，至今依旧是 Python …

人工智能 2023年7月30日
0063
【ML】机器学习数据集：sklearn中分类数据集介绍

目录 1.乳腺癌分类数据集（二分类） 2.鸢尾花分类数据集（三分类） 3.葡萄酒分类数据集（三分类） 4.手写数字分类数据集（十分类） 5.其他数据集参考资料在机器学习的教程中…

人工智能 2023年6月15日
00109
使用python读取tiff文件中的经纬度，并将数据以excel表的形式输出（详细步骤）

近日，因为某任务，需要批量的读取tiff文件里的经度、纬度和高程，最后生成excel。步骤如下： 2.获取tif对象 filePath = ‘tif_k36a/K36TIFF.t…

人工智能 2023年7月6日
0087
Pyspark分类–LogisticRegression

LogisticRegression：逻辑回归分类 class pyspark.ml.classification.LogisticRegression(featuresCol=&…

人工智能 2023年7月1日
0082
关于git，你需要了解这些

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月29日
0081
用爬虫分析上热榜涨的600粉，竟发现。。。（含代码和详解）

目录前言下面我就分享一下这次经历！ 1、编程环境及相关库的安装 1.1、编程环境： 1.2、第三方库： 1.3库的安装方法： 1.3.1、Windows的shell命令安装 1…

人工智能 2023年7月15日
0078
二十. 在ROS系统上实现基于PyTorch YOLO v5的实时物体检测

一. 背景介绍在我前面的博文十八.在JetsonNano上为基于PyTorch的物体检测网络测速和选型中,我介绍过在基于Jetson Nano硬件平台和Ubuntu 18.0…

人工智能 2023年7月21日
0087
ELAN：将超分网络SwinIR高效化，最快可达4.5倍

关注公众号，发现CV技术之美本篇分享论文『Efficient Long-Range Attention Network for Image Super-resolution』，由…

人工智能 2023年6月16日
0097
基于pytorch搭建多特征LSTM时间序列预测代码详细解读（附完整代码）

文章目录 LSTM时间序列预测 * 数据获取与预处理模型构建训练与测试 LSTM时间序列预测对于LSTM神经网络的概念想必大家也是熟练掌握了，所以本文章不涉及对LSTM概念的…

人工智能 2023年7月23日
0070
昇腾Ascend 随记 —— TensorFlow 模型迁移

文章目录 * – 一、为什么要做模型迁移 – 二、了解两种模型迁移方式 – 三、TensorFlow AI 模型自动迁移详解 – +…

人工智能 2023年5月23日
0072
C++开发过程中的笔记

目录 1.#ifndef/#define/#endif 2.C++中map类型的使用 * 2.1 删除元素 2.2 map的用法 2.3 判断map中key值是否存在 3.虚函数后…

人工智能 2023年5月30日
0093
《Word2vec》1 模型的引入介绍与相关概念

文章目录一、Word2Vec模型的背景引入 * 1.1 One-hot模型 1.2 One-Hot编码的手动实现 1.3 Keras中one-hot编码的实现 2. Word2…

人工智能 2023年5月28日
0067

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

SSD-Pytorch训练自己的VOC数据集&遇到的问题及解决办法

SSD

data/ init .py

data/config.py

data/voc0712.py

layers/modules/multibox_loss.py

ssd.py

train.py

预训练文件vgg16_reducedfc.pth

eval.py

demo.py

demo/live.py

train.py

TypeError: unsupported operand type(s) for /=: ‘Tensor’ and ‘builtin_function_or_method’…

找不到数据集里的文件夹/文件

FileNotFoundError: [Errno 2] No such file or directory: ‘C:\Users\Administrator\data/coco/coco_labels.txt’

RuntimeError: Legacy autograd function with non-static forward method is deprecated. Please use new-style autograd function with static forward method.

AttributeError: ‘NoneType’ object has no attribute ‘shape’

IndexError: Too many indices for array:Array is 1-dimensional,but 2 were indexed （20220105更新）

eval.py

右键运行变成test模式

开始运行后到某一个图片突然出错

eval运行到最后 FileNotFoundError: [Errno 2] No such file or directory: ‘test.txt’

大家都在看