【目标检测】YOLOv5跑xView数据集/小样本检测策略实验

2023年6月25日下午2:38 • 人工智能 • 阅读 77

前言

在YOLOv5的6.1版本新出了 xView.yaml数据配置文件，提供了遥感数据集xView的检测方法。此篇就使用YOLOv5来试跑xView数据集，并对一些小样本检测的策略进行消融实验。

xView数据集下载：https://github.com/zstar1003/Dataset

数据预处理

在YOLOv5的 xView.yaml文件中，提供了xView数据集的预处理方式。
这里单独新建一个脚本文件 xView.py

import json
import os
from pathlib import Path

import numpy as np
import yaml
from PIL import Image
from tqdm import tqdm

from utils.datasets import autosplit
from utils.general import download, xyxy2xywhn

def convert_labels(fname=Path('xView/xView_train.geojson')):

    path = fname.parent
    with open(fname) as f:
        print(f'Loading {fname}...')
        data = json.load(f)

    labels = Path(path / 'labels' / 'train')
    os.system(f'rm -rf {labels}')
    labels.mkdir(parents=True, exist_ok=True)

    xview_class2index = [-1, -1, -1, -1, -1, -1, -1, -1, -1, -1, -1, 0, 1, 2, -1, 3, -1, 4, 5, 6, 7, 8, -1, 9, 10, 11,
                         12, 13, 14, 15, -1, -1, 16, 17, 18, 19, 20, 21, 22, -1, 23, 24, 25, -1, 26, 27, -1, 28, -1,
                         29, 30, 31, 32, 33, 34, 35, 36, 37, -1, 38, 39, 40, 41, 42, 43, 44, 45, -1, -1, -1, -1, 46,
                         47, 48, 49, -1, 50, 51, -1, 52, -1, -1, -1, 53, 54, -1, 55, -1, -1, 56, -1, 57, -1, 58, 59]

    shapes = {}
    for feature in tqdm(data['features'], desc=f'Converting {fname}'):
        p = feature['properties']
        if p['bounds_imcoords']:
            id = p['image_id']
            file = path / 'train_images' / id
            if file.exists():
                try:
                    box = np.array([int(num) for num in p['bounds_imcoords'].split(",")])
                    assert box.shape[0] == 4, f'incorrect box shape {box.shape[0]}'
                    cls = p['type_id']
                    cls = xview_class2index[int(cls)]
                    assert 59 >= cls >= 0, f'incorrect class index {cls}'

                    if id not in shapes:
                        shapes[id] = Image.open(file).size
                    box = xyxy2xywhn(box[None].astype(np.float), w=shapes[id][0], h=shapes[id][1], clip=True)
                    with open((labels / id).with_suffix('.txt'), 'a') as f:
                        f.write(f"{cls}{' '.join(f'{x:.6f}' for x in box[0])}\n")
                except Exception as e:
                    print(f'WARNING: skipping one label for {file}: {e}')

dir = Path('D:/Dataset/Xview')

convert_labels(Path('D:/Dataset/Xview/xView_train.geojson'))

images = Path(dir / 'images')
images.mkdir(parents=True, exist_ok=True)
Path(dir / 'train_images').rename(dir / 'images' / 'train')
Path(dir / 'val_images').rename(dir / 'images' / 'val')

autosplit(dir / 'images' / 'train')

运行之后，在train文件夹里会新增训练集和验证集的划分文件。

注：xView数据集没有提供测试集，并且其验证集没有标签，因此这里在train中划分出训练集和验证集。

训练配置

训练和之前跑VOC的流程类似，首先需要修改配置文件路径 myxView.yaml

train: D:/Dataset/Xview/images/train/autosplit_train.txt
val: D:/Dataset/Xview/images/train/autosplit_val.txt

nc: 60
names: ['Fixed-wing Aircraft', 'Small Aircraft', 'Cargo Plane', 'Helicopter', 'Passenger Vehicle', 'Small Car', 'Bus',
        'Pickup Truck', 'Utility Truck', 'Truck', 'Cargo Truck', 'Truck w/Box', 'Truck Tractor', 'Trailer',
        'Truck w/Flatbed', 'Truck w/Liquid', 'Crane Truck', 'Railway Vehicle', 'Passenger Car', 'Cargo Car',
        'Flat Car', 'Tank car', 'Locomotive', 'Maritime Vessel', 'Motorboat', 'Sailboat', 'Tugboat', 'Barge',
        'Fishing Vessel', 'Ferry', 'Yacht', 'Container Ship', 'Oil Tanker', 'Engineering Vehicle', 'Tower crane',
        'Container Crane', 'Reach Stacker', 'Straddle Carrier', 'Mobile Crane', 'Dump Truck', 'Haul Truck',
        'Scraper/Tractor', 'Front loader/Bulldozer', 'Excavator', 'Cement Mixer', 'Ground Grader', 'Hut/Tent', 'Shed',
        'Building', 'Aircraft Hangar', 'Damaged Building', 'Facility', 'Construction Site', 'Vehicle Lot', 'Helipad',
        'Storage Tank', 'Shipping container lot', 'Shipping Container', 'Pylon', 'Tower']

之后在 train.py中修改对应的 weights、 cfg、 data等参数。

小样本检测策略实验

起初我使用默认的640×640的 img-size，但是在这种小样本的检测中，效果很糟。
于是我将 img-size的尺寸改成1280×1280，使用官方提供的 yolov5l6.pt这个预训练模型训练100个epoch。
测试得到的AP50为0.847%。

下面我输入验证集中的 2618.tif这张图片来进行检测。
我想到了之前学习过的【目标检测】YOLOv5针对小目标检测的改进模型中的小样本检测策略，正好在此次也加入测试。
detect.py中的改进代码如下所示：


t1 = time_synchronized()
pred = model(img, augment=opt.augment)[0]

'''
此处进行分块预测改进
'''
mulpicplus = "3"
assert (int(mulpicplus) >= 1)
if mulpicplus == "1":
    pred = model(img, augment=opt.augment)[0]
else:

    xsz = img.shape[2]
    ysz = img.shape[3]
"""
    输入图片：1400x788
    1400/640 = 2.1875
    788/2.1875 = 360.2285..

    384(32的整数倍)最接近360.2285..

    因此输出：640x384
"""

    mulpicplus = int(mulpicplus)
    x_smalloccur = int(xsz / mulpicplus * 1.2)
    y_smalloccur = int(ysz / mulpicplus * 1.2)

    for i in range(mulpicplus):
        x_startpoint = int(i * (xsz / mulpicplus))
        for j in range(mulpicplus):
            y_startpoint = int(j * (ysz / mulpicplus))
            x_real = min(x_startpoint + x_smalloccur, xsz)
            y_real = min(y_startpoint + y_smalloccur, ysz)
            if (x_real - x_startpoint) % 64 != 0:
                x_real = x_real - (x_real - x_startpoint) % 64
            if (y_real - y_startpoint) % 64 != 0:
                y_real = y_real - (y_real - y_startpoint) % 64
            dicsrc = img[:, :, x_startpoint:x_real, y_startpoint:y_real]

            pred_temp = model(dicsrc, augment=opt.augment)[0]

"""
            pred_temp[..., 0] 取最后一维度的第一个，也就是x
            pred_temp[..., 1] 取最后一维度的第二个，也就是y
            注意这里的y_startpoint和x_startpoint和从原图上看是相反的
"""
            pred_temp[..., 0] = pred_temp[..., 0] + y_startpoint
            pred_temp[..., 1] = pred_temp[..., 1] + x_startpoint
            if i == 0 and j == 0:
                pred = pred_temp
            else:
                pred = torch.cat([pred, pred_temp], dim=1)

pred = non_max_suppression(pred, opt.conf_thres, opt.iou_thres, classes=opt.classes, agnostic=opt.agnostic_nms)

简单来说，这个优化策略就是对图像进行切块分别预测，为了防止切块时会把目标给切开，每块之间有20%的重合度。最后将所得到的预测框全部进行叠加，统一输入到NMS之中进行过滤。

下面是我的实验结果：

可以看到，这个切片检测策略一定程度上确实能够缓解漏检情况，不过对于这幅图来说提升并不显著。同时，我也使用了更大尺寸的输入图片尺寸，结果却使小样本丢失，而大样本检测效果更好。

下面是可视化的展示结果：图一是原图标签可视化；图二是表中第二行结果；图三是表中最后一行结果。

Original: https://blog.csdn.net/qq1198768105/article/details/126283196
Author: zstar-_
Title: 【目标检测】YOLOv5跑xView数据集/小样本检测策略实验

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/651087/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

计算机视觉教程2-6：八大图像特效算法制作你的专属滤镜(附Python代码)

目录 0 写在前面 1 毛玻璃特效 2 浮雕特效 3 油画特效 4 马赛克特效 5 素描特效 6 怀旧特效 7 流年特效 8 卡通特效 0 写在前面图像特效处理是基于图像像素数据…

人工智能 2023年7月27日
0083
stata基础–回归，画散点图，异质性分析

利用stata的内部数据来进行回归代码： sysuse auto sysuse dir /可以看到所有的数据/ su price mpg foreign reg price mp…

人工智能 2023年6月16日
0072
数据挖掘-KNN算法+sklearn代码实现(六)

🤵‍♂️ 个人主页：@Lingxw_w的个人主页✍🏻作者简介：计算机科学与技术研究生在读🐋 希望大家多多支持，我们一起进步！😄如果文章对你有帮助的话，欢迎评论 💬点赞👍🏻 收藏 📂…

人工智能 2023年7月25日
0049
2022年Pycharm+Anaconda安装配置教程

文章目录 1.Anaconda安装流程 * 1.1Anaconda安装包链接知识：Anaconda Prompt是什么？浅析pip与conda的区别 2.Pycharm安装流程…

人工智能 2023年7月4日
00116
2021年中国道路交通事故情况分析（附机动车保有量、交通事故发生数量、死亡人数、受伤人数、直接经济损失）[图]

一、交通事故因素凡在行车工作中，因违反规章制度，违反劳动纪律，技术设备不良及其他原因，在行车中造成人员伤亡、设备损害、经济损失、影响正常行车或危及行车安全的，均构成行车事故。行…

人工智能 2023年6月25日
00205
JZ47 礼物的最大价值

描述在一个m×n的棋盘的每一格都放有一个礼物，每个礼物都有一定的价值（价值大于 0）。你可以从棋盘的左上角开始拿格子里的礼物，并每次向右或者向下移动一格、直到到达棋盘的右下角。给定…

人工智能 2023年6月28日
0058
决策树初探- 决策树的实现与可视化

之前写过基于逻辑回归的鸢尾花分类,这一次我们用决策树来试试分类结果 import sklearn.datasets from pathlib import Path import …

人工智能 2023年7月17日
0072
中科大2021年自然语言理解nlp/nlu期末试题回忆

一、分析句子是否有歧义，并指出是因为句法结构、词义、语义结构因素或者多个因素导致的。（30分） 1）A man stopped at every truck stop. 2）咬死猎…

人工智能 2023年5月31日
0079
一个简单的逻辑回归多分类例子与代码（python-sklearn实现）

目录一.问题二.流程与代码 (一) 流程 (二)代码 (三)模型表达式 sklearn逻辑回归多分类有两种模式： ovr与multinomial。在multi_class设为…

人工智能 2023年6月30日
0067
tf.argmax()的详细用法

tf.argmax(data, axis=None)用tensorflow 做 mnist分类时，用到这个接口，于是就研究了下这个接口的用法：如果是一维数组呢？ data = t…

人工智能 2023年6月15日
0076
python学习 –DataFrame数据清洗（空值、重复值）

目录空值的处理 1、检查是否有空值 2、统计空值的数量 3、删除空值 4、填补空值用value参数替换空值将空值替换成上一列的值将空值替换成上一行的值将空值替换成下一列的…

人工智能 2023年7月6日
0087
微信小程序是什么？如何快速搭建一个微信小程序？

目录 * – 专栏导读 – 一、微信小程序是什么 – 二、安全管理 – 三、微信小程序的功能 – 四、快速开发一个微信小…

人工智能 2023年7月1日
0071
Netty源码阅读(1)之——客户端源码梗概

目录准备开始 NioSocketChannel 的初始化过程指定初始化关于unsafe属性：关于pipeline的初始化小结 EventLoopGroup初始化小结…

人工智能 2023年6月29日
00111
是否有一种统一的解决方案来避免所有类型的过拟合

问题概述过拟合是机器学习中常见的问题之一，它指的是模型在训练集上表现良好，但在测试集或未知数据上表现较差的现象。为了避免过拟合，常常需要采用一些手段来限制模型的复杂度。本文将介绍…

人工智能 2023年12月30日
0055
CUDA11.7版本与pytorch1.12下载（conda安装pytorch出现）相关出错解决 HTTP 000 CONNECTION FAILED for url

. HTTP 000 CONNECTION FAILED for url An HTTP error ocurred when trying to retrieve this UR…

人工智能 2023年7月20日
0047
【渝粤教育】电大中专学前儿童语言教育 (2)作业题库

作业视频教务托管1、3、鲁、鲁、10、0 [En] Homework video educational administration trusteeship, No. 1, 3,…

人工智能 2023年5月27日
0047

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

【目标检测】YOLOv5跑xView数据集/小样本检测策略实验

大家都在看