YOLOV5超参数设置与数据增强解析

2023年7月30日上午9:27 • 人工智能 • 阅读 63

1、YOLOV5的超参数配置文件介绍

YOLOv5有大约30个超参数用于各种训练设置。它们在*xml中定义。/data目录下的Yaml文件。更好的初始猜测将产生更好的最终结果，因此在进化之前正确地初始化这些值是很重要的。如果有疑问，只需使用缺省值，这些缺省值是为YOLOv5 COCO训练从头优化的。

YOLOv5的超参文件见data/hyp.finetune.yaml（适用VOC数据集）或者hyo.scrach.yaml（适用COCO数据集）文件

1、yolov5/data/hyps/hyp.scratch-low.yaml(YOLOv5 COCO训练从头优化，数据增强低)


 lr0: 0.01
 lrf: 0.01
 momentum: 0.937
 weight_decay: 0.0005
 warmup_epochs: 3.0
 warmup_momentum: 0.8
 warmup_bias_lr: 0.1
 box: 0.05
 cls: 0.5
 cls_pw: 1.0
 obj: 1.0
 obj_pw: 1.0
 iou_t: 0.20
 anchor_t: 4.0

 fl_gamma: 0.0

 hsv_h: 0.015
 hsv_s: 0.7
 hsv_v: 0.4

 degrees: 0.0

 translate: 0.1

 scale: 0.5

 shear: 0.0

 perspective: 0.0
 flipud: 0.0
 fliplr: 0.5
 mosaic: 1.0
 mixup: 0.0
 copy_paste: 0.0

2、yolov5/data/hyps/hyp.scratch-mdeia.yaml（数据增强中）


lr0: 0.01
lrf: 0.1
momentum: 0.937
weight_decay: 0.0005
warmup_epochs: 3.0
warmup_momentum: 0.8
warmup_bias_lr: 0.1
box: 0.05
cls: 0.3
cls_pw: 1.0
obj: 0.7
obj_pw: 1.0
iou_t: 0.20
anchor_t: 4.0

fl_gamma: 0.0
hsv_h: 0.015
hsv_s: 0.7
hsv_v: 0.4
degrees: 0.0
translate: 0.1
scale: 0.9
shear: 0.0
perspective: 0.0
flipud: 0.0
fliplr: 0.5
mosaic: 1.0
mixup: 0.1
copy_paste: 0.0

3、hyp.scratch-high.yaml（数据增强高）


lr0: 0.01
lrf: 0.1
momentum: 0.937
weight_decay: 0.0005
warmup_epochs: 3.0
warmup_momentum: 0.8
warmup_bias_lr: 0.1
box: 0.05
cls: 0.3
cls_pw: 1.0
obj: 0.7
obj_pw: 1.0
iou_t: 0.20
anchor_t: 4.0

fl_gamma: 0.0
hsv_h: 0.015
hsv_s: 0.7
hsv_v: 0.4
degrees: 0.0
translate: 0.1
scale: 0.9
shear: 0.0
perspective: 0.0
flipud: 0.0
fliplr: 0.5
mosaic: 1.0
mixup: 0.1
copy_paste: 0.1

2、OneCycleLR学习率

根据”OneCycleLR学习率”策略，设置各参数组的学习率。1cycle策略将学习率从初始学习率退火到最大学习率，然后从最大学习率退火到远低于初始学习率的最小学习率。论文地址

3、Warmup

warmup是一种学习率优化方法，最早出现在resnet论文中，在模型训练初期选用较小的学习率，训练一段时间之后（10epoch 或者 10000steps）使用预设的学习率进行训练

为什么使用

模型训练初期，权重随机化，对数据的理解为0，在第一个epoch中，模型会根据输入的数据进行快速的调参，此时如果采用较大的学习率，有很大的可能使模型学偏，后续需要更多的轮次才能拉回来

当模型训练一段时间之后，对数据有一定的先验知识，此时使用较大的学习率模型不容易学偏，可以使用较大的学习率加速训练。

当模型使用较大的学习率训练一段时间之后，模型的分布相对比较稳定，此时不宜从数据中再学到新的特点，如果继续使用较大的学习率会破坏模型的稳定性，而使用较小的学习率更获得最优。

Pytorch内部并没有warmup的接口，为此需要使用第三方包pytorch_warmup，可以使用命令pip install pytorch_warmup进行安装

1、当学习率计划使用全局迭代数时，未调优的线性预热可以这样使用:

import torch
import pytorch_warmup as warmup

optimizer = torch.optim.AdamW(params, lr=0.001, betas=(0.9, 0.999), weight_decay=0.01)
num_steps = len(dataloader) * num_epochs
lr_scheduler = torch.optim.lr_scheduler.CosineAnnealingLR(optimizer, T_max=num_steps)
warmup_scheduler = warmup.UntunedLinearWarmup(optimizer)
for epoch in range(1,num_epochs+1):
    for batch in dataloader:
        optimizer.zero_grad()
        loss = ...

        loss.backward()
        optimizer.step()
        with warmup_scheduler.dampening():
            lr_scheduler.step()

2、如果你想使用PyTorch 1.4.0或更高版本支持的学习率调度”链接”，你可以简单地给出一组with语句的学习率调度程序代码:

lr_scheduler1 = torch.optim.lr_scheduler.ExponentialLR(optimizer, gamma=0.9)
lr_scheduler2 = torch.optim.lr_scheduler.StepLR(optimizer, step_size=3, gamma=0.1)
warmup_scheduler = warmup.UntunedLinearWarmup(optimizer)
for epoch in range(1,num_epochs+1):
    for batch in dataloader:
        ...

        optimizer.step()
        with warmup_scheduler.dampening():
            lr_scheduler1.step()
            lr_scheduler2.step()

3、当学习率计划使用epoch号时，预热计划可以这样使用:

lr_scheduler = torch.optim.lr_scheduler.MultiStepLR(optimizer, milestones=[num_epochs//3], gamma=0.1)
warmup_scheduler = warmup.UntunedLinearWarmup(optimizer)
for epoch in range(1,num_epochs+1):
    for iter, batch in enumerate(dataloader):
        optimizer.zero_grad()
        loss = ...

        loss.backward()
        optimizer.step()
        if iter < len(dataloader)-1:
            with warmup_scheduler.dampening():
                pass
    with warmup_scheduler.dampening():
        lr_scheduler.step()

4、Warmup Schedules

1、Manual Warmup

预热因子w(t)取决于预热期，必须手动指定线性预热和指数预热。

1、 Linear

w(t) = min(1, t / warmup_period)
warmup_scheduler = warmup.LinearWarmup(optimizer, warmup_period=2000)

2、 Exponential

warmup_period = 1 / (1 - beta2)

warmup_scheduler = warmup.UntunedExponentialWarmup(optimizer)

3、 RAdam Warmup

The warmup factor depends on Adam’s beta2 parameter for RAdamWarmup. Please see the original paper for the details.

warmup_scheduler = warmup.RAdamWarmup(optimizer)

4、 Apex’s Adam

The Apex library provides an Adam optimizer tuned for CUDA devices, FusedAdam. The FusedAdam optimizer can be used with the warmup schedulers. For example:

optimizer = apex.optimizers.FusedAdam(params, lr=0.001, betas=(0.9, 0.999), weight_decay=0.01)
lr_scheduler = torch.optim.lr_scheduler.CosineAnnealingLR(optimizer, T_max=num_steps)
warmup_scheduler = warmup.UntunedLinearWarmup(optimizer)

4、YOLOV5数据增强（yolov5-v6\utils\datasets.py）

目标检测 YOLOv5 – 数据增强
 Yolov5(v6.1)数据增强方式解析
一旦训练开始，您可以在train_batch*.jpg图像中查看增强策略的效果。这些图像将在你的火车日志目录中，通常是yolov5/runs/train/exp:
train_batch0.jpg shows train batch 0 mosaics and labels:

; 5、 YOLOv5集成Albumentations，添加新的数据增强方法

To use albumentations simply pip install -U albumentations and then update the augmentation pipeline as you see fit in the new Albumentations class in yolov5/utils/augmentations.py. Note these Albumentations operations run in addition to the YOLOv5 hyperparameter augmentations, i.e. defined in hyp.scratch.yaml.

Here’s an example that applies Blur, MedianBlur and ToGray albumentations in addition to the YOLOv5 hyperparameter augmentations normally applied to your training mosaics 😃

class Albumentations:

    def __init__(self):
        self.transform = None
        try:
            import albumentations as A
            check_version(A.__version__, '1.0.3')

            self.transform = A.Compose([
                A.Blur(blur_limit=50, p=0.1),
                A.MedianBlur(blur_limit=51, p=0.1),
                A.ToGray(p=0.3)],
                bbox_params=A.BboxParams(format='yolo', label_fields=['class_labels']))

            logging.info(colorstr('albumentations: ') + ', '.join(f'{x}' for x in self.transform.transforms))
        except ImportError:
            pass
        except Exception as e:
            logging.info(colorstr('albumentations: ') + f'{e}')

    def __call__(self, im, labels, p=1.0):
        if self.transform and random.random() < p:
            new = self.transform(image=im, bboxes=labels[:, 1:], class_labels=labels[:, 0])
            im, labels = new['image'], np.array([[c, *b] for c, b in zip(new['class_labels'], new['bboxes'])])
        return im, labels

您可以在YOLOv5数据加载器中集成额外的Albumentations增强功能:

在YOLOv5数据加载器中插入albumentaugment功能的最佳位置是这里:

if self.augment:

     if not mosaic:
         img, labels = random_perspective(img, labels,
                                          degrees=hyp['degrees'],
                                          translate=hyp['translate'],
                                          scale=hyp['scale'],
                                          shear=hyp['shear'],
                                          perspective=hyp['perspective'])

     augment_hsv(img, hgain=hyp['hsv_h'], sgain=hyp['hsv_s'], vgain=hyp['hsv_v'])

其中img为图像，label为边框标签。请注意，您添加的任何albuments增强都将是对超参数文件中定义的现有自动YOLOv5增强的补充:

6、定义评估指标

健康是我们追求的价值最大化。在YOLOv5中，我们将默认适应度函数定义为指标的加权组合:mAP@0.5占权重的10%，mAP@0.5:0.95占剩余的90%，没有Precision P和Recall R。您可以根据自己的需要进行调整，或者使用默认的适合度定义(推荐)。

yolov5/utils/metrics.py

Lines 12 to 16 in 4103ce9

 def fitness(x):

     w = [0.0, 0.0, 0.1, 0.9]
     return (x[:, :4] * w).sum(1)

7、 Evolve（模型参数更新进化）


python train.py --epochs 10 --data coco128.yaml --weights yolov5s.pt --cache --evolve

for i in 0 1 2 3 4 5 6 7; do
  sleep $(expr 30 \* $i) &&
  echo 'Starting GPU '$i'...' &&
  nohup python train.py --epochs 10 --data coco128.yaml --weights yolov5s.pt --cache --device $i --evolve > evolve_gpu_$i.log &
done

for i in 0 1 2 3 4 5 6 7; do
  sleep $(expr 30 \* $i) &&
  echo 'Starting GPU '$i'...' &&
  "$(while true; do nohup python train.py... --device $i --evolve 1 > evolve_gpu_$i.log; done)" &
done


lr0: 0.01
lrf: 0.2
momentum: 0.937
weight_decay: 0.0005
warmup_epochs: 3.0
warmup_momentum: 0.8
warmup_bias_lr: 0.1
box: 0.05
cls: 0.5
cls_pw: 1.0
obj: 1.0
obj_pw: 1.0
iou_t: 0.20
anchor_t: 4.0

fl_gamma: 0.0
hsv_h: 0.015
hsv_s: 0.7
hsv_v: 0.4
degrees: 0.0
translate: 0.1
scale: 0.5
shear: 0.0
perspective: 0.0
flipud: 0.0
fliplr: 0.5
mosaic: 1.0
mixup: 0.0
copy_paste: 0.0

我们建议至少300代的进化才能获得最好的结果。请注意，进化通常是昂贵和耗时的，因为基本场景要训练数百次，可能需要数百或数千个GPU小时。

8、超参数可视化

evolve.csv is plotted as evolve.png by utils.plots.plot_evolve() after evolution finishes with one subplot per hyperparameter showing fitness (y axis) vs hyperparameter values (x axis). Yellow indicates higher concentrations. Vertical distributions indicate that a parameter has been disabled and does not mutate. This is user selectable in the meta dictionary in train.py, and is useful for fixing parameters and preventing them from evolving.

Original: https://blog.csdn.net/qq_41627642/article/details/125420988
Author: qq_41627642
Title: YOLOV5超参数设置与数据增强解析

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/723828/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

目标检测系列算法:YOLOv6代码复现

目录开发环境源码获取与复现 * 训练预测参考 YOLOv6 是一个专用于工业应用的单阶段目标检测框架，具有硬件友好的高效设计和高性能。 YOLOv6-nano 在 COCO…

人工智能 2023年7月9日
0056
SMOTE算法原理易用手搓小白版数据集扩充 python

前言为啥要写这个呢，在做课题的时候想着扩充一下数据集，尝试过这个过采样降采样，交叉采样，我还研究了一周的对抗生成网络，对抗生成网络暂时还解决不了我要生成的信号模式崩塌的问题，然后…

人工智能 2023年7月4日
0074
点云欧式聚类快速了解

点云处理算法快速了解专栏————点云欧式聚类文章目录一、pandas是什么？二、使用步骤 1.引入库 2.读入数据总结一、…

人工智能 2023年6月2日
0081
Remote Photoplethysmograph Signal Measurement from Facial Videos Using Spatio-Temporal Networks

前言前期方法的缺陷早期rPPG研究多数为”提取—分析”的两阶段方法，首先检测或跟踪人脸以提取rPPG信号，然后分析并估计相应的平均HR。缺点：1)基于纯…

人工智能 2023年7月14日
0079
Pandas数据分析去重：去重，真的只是去除一样的行或列吗？

本篇主要讲解的知识：数据分析中数据去重的概念及目标如何借助df.describe()帮助去重 df.drop_duplicates() 如何简单高效去除重复 *列什么是数据去…

人工智能 2023年6月11日
00100
双系统Ubuntu22.04深度学习环境配置与踩坑记录

双系统Ubuntu22.04深度学习环境配置踩坑记录前言目录 * 相关版本主要参考教程 Ubuntu安装 Nvidia和CUDA安装 – 踩坑经历官网安装所遇问…

人工智能 2023年7月21日
0067
faster R-CNN之RPN

目标检测算法依赖于region proposals算法来假设目标位置，随着SPP Net和fast R-CNN的改进，region proposals已经成为目标检测算法的瓶颈。 …

人工智能 2023年7月12日
0081
python利用opencv简单识别红绿灯

#先装包和环境 import cv2 from PIL import Image import numpy as np #导入视频并自定义 cor_x, cor_y = -1, -…

人工智能 2023年7月19日
0057
YOLO V5源码详解

1.数据读取首先读取图片以及标签路径，并将标签存入缓存，对单标签情况、特定类别、以及是否保持长方形等情况分别进行处理。如果需要进行mosaic 数据增强，首先找到中心点，将图片…

人工智能 2023年7月5日
00110
安装tensorflow_gpu的各种问题–总结

前言本文只是作者安装后的一个小总结，文章中的每个链接都是我看过的一些解决方案链接，所以本文不会有具体的解决方案，只会提供相应的解决方案和解决问题的链接。环境问题各不相同，没有统一…

人工智能 2023年5月24日
0082
【理论知识】实际部署中tensorrt的简单理解

搭建tensorrt的基本流程 ➢ 基本流程 ➢ 构建期 ➢ 建立 Builder（引擎构建器） ➢ 创建 Network（计算图内容） ➢ 生成 SerializedNetwor…

人工智能 2023年5月26日
0071
双十一到了，当我用Python采集了电商平台所有商品后发现….

Python采集电商平台写在前面环境及模块案例实现思路代码展示效果展示最后写在前面这不是双十一快到了，为了以最优惠的价格买到自己想买的商品，我不惜用Python把y…

人工智能 2023年6月28日
0086
通俗解读人脸检测框架-RetinaFace

目录一、简介二、模型结构 1.MobileNet-0.25 2.FPN结构 3.SSH结构 4.Head结构三、Anchor的编解码四、Multi-task Loss 一、…

人工智能 2023年7月9日
0093
机器学习系列5 利用Scikit-learn构建回归模型：准备和可视化数据（保姆级教程）

课前测验本文所用数据免费下载：数据科学机器学习系列5利用Scikit-learn构建回归模型：准备和可视化数据.ipynb-机器学习文档类资源-CSDN文库前文提要：在上一篇文章…

人工智能 2023年6月18日
0085
【Matplotlib】plt.figure()、plt.subplot() 、plt.subplots() 、plt.xticks() 、plt.xlim()和 plt.grid() 六个函数的使用

系列文章目录 Python中 matplotlib库的学习目录系列文章目录前言一、 plt.figure() 二、plt.subplot() 三、plt.subplots(…

人工智能 2023年6月13日
0061
基于libmpv内核设计开发的视频播放器-高级版（四）

环境介绍：操作系统: win10 64位 Qt版本 : Qt5.12.6 libmpv: 采用最新版–截止文章编写。mpv-dev-x86_64-v3-20220918-git…

人工智能 2023年6月2日
0098

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31