MMDet逐行解读之AnchorGenerator

2023年7月11日下午11:09 • 人工智能 • 阅读 61

文章目录

前言
1、base_anchors的生成
2、grid_anchors的生成
3、valid_flags介绍
总结

前言

本篇主要介绍mmdet/core/anchor/anchor_generator.py文件下的AnchorGenerator类。以RetinaNet的配置作为说明。

anchor_generator_cfg = dict(
    type='AnchorGenerator',
    octave_base_scale=4,
    scales_per_octave=3,
    ratios=[0.5, 1.0, 2.0],
    strides=[8, 16, 32, 64, 128])

1、base_anchors的生成

所谓base_anchors是在初始化AnchorGenerator类借助gen_base_anchors方法产生了基础的9个anchor，这些anchor是原图上的anchor。

@ANCHOR_GENERATORS.register_module()
class AnchorGenerator(object):
    def __init__(self,
                 strides,
                 ratios,
                 scales=None,
                 base_sizes=None,
                 scale_major=True,
                 octave_base_scale=None,
                 scales_per_octave=None,
                 centers=None,
                 center_offset=0.):

        self.strides = [_pair(stride) for stride in strides]

        self.base_sizes = [min(stride) for stride in self.strides
                           ] if base_sizes is None else base_sizes

        assert ((octave_base_scale is not None
                and scales_per_octave is not None) ^ (scales is not None)), \
            'scales and octave_base_scale with scales_per_octave cannot' \
            ' be set at the same time'
        if scales is not None:
            self.scales = torch.Tensor(scales)

        elif octave_base_scale is not None and scales_per_octave is not None:
            octave_scales = np.array(
                [2**(i / scales_per_octave) for i in range(scales_per_octave)])
            scales = octave_scales * octave_base_scale
            self.scales = torch.Tensor(scales)

        self.base_anchors = self.gen_base_anchors()

现在具体看下gen_base_anchors方法：

    def gen_base_anchors(self):
        """Generate base anchors

        Returns:
            list(torch.Tensor): Base anchors of a feature grid in multiple
                feature levels.

"""
        multi_level_base_anchors = []
        for i, base_size in enumerate(self.base_sizes):
            center = None
            if self.centers is not None:
                center = self.centers[i]
            multi_level_base_anchors.append(
                self.gen_single_level_base_anchors(
                    base_size,
                    scales=self.scales,
                    ratios=self.ratios,
                    center=center))
        return multi_level_base_anchors

    def gen_single_level_base_anchors(self,
                                      base_size,
                                      scales,
                                      ratios,
                                      center=None):

        w = base_size
        h = base_size
        if center is None:
            x_center = self.center_offset * w
            y_center = self.center_offset * h
        else:
            x_center, y_center = center

        h_ratios = torch.sqrt(ratios)
        w_ratios = 1 / h_ratios
        if self.scale_major:

            ws = (w * w_ratios[:, None] * scales[None, :]).view(-1)
            hs = (h * h_ratios[:, None] * scales[None, :]).view(-1)
        else:
            ws = (w * scales[:, None] * w_ratios[None, :]).view(-1)
            hs = (h * scales[:, None] * h_ratios[None, :]).view(-1)

        base_anchors = [
            x_center - 0.5 * ws, y_center - 0.5 * hs, x_center + 0.5 * ws,
            y_center + 0.5 * hs
        ]
        base_anchors = torch.stack(base_anchors, dim=-1)

        return base_anchors

其实上面代码就是下图干的事情：就是stride * scales* ratios = 9

2、grid_anchors的生成

在生成base_anchor基础上，之后需要通过改变每个anchor的中心来广播到整张特征图上面。以grid_anchors方法实现：

    def grid_anchors(self, featmap_sizes, device='cuda'):
        assert self.num_levels == len(featmap_sizes)
        multi_level_anchors = []
        for i in range(self.num_levels):
            anchors = self.single_level_grid_anchors(
                self.base_anchors[i].to(device),
                featmap_sizes[i],
                self.strides[i],
                device=device)
            multi_level_anchors.append(anchors)
        return multi_level_anchors

贴下single_level_grid_anchors方法

    def _meshgrid(self, x, y, row_major=True):
        """Generate mesh grid of x and y

        Args:
            x (torch.Tensor): Grids of x dimension.

            y (torch.Tensor): Grids of y dimension.

            row_major (bool, optional): Whether to return y grids first.

                Defaults to True.

        Returns:
            tuple[torch.Tensor]: The mesh grids of x and y.

"""
        xx = x.repeat(len(y))
        yy = y.view(-1, 1).repeat(1, len(x)).view(-1)
        if row_major:
            return xx, yy
        else:
            return yy, xx

    def single_level_grid_anchors(self,
                                  base_anchors,
                                  featmap_size,
                                  stride=(16, 16),
                                  device='cuda'):
        """Generate grid anchors of a single level.

        Note:
            This function is usually called by method .grid_anchors.

        Args:
            base_anchors (torch.Tensor): The base anchors of a feature grid.

            featmap_size (tuple[int]): Size of the feature maps.

            stride (tuple[int], optional): Stride of the feature map.

                Defaults to (16, 16).

            device (str, optional): Device the tensor will be put on.

                Defaults to 'cuda'.

        Returns:
            torch.Tensor: Anchors in the overall feature maps.

"""
        feat_h, feat_w = featmap_size

        shift_x = torch.arange(0, feat_w, device=device) * stride[0]
        shift_y = torch.arange(0, feat_h, device=device) * stride[1]
        shift_xx, shift_yy = self._meshgrid(shift_x, shift_y)
        shifts = torch.stack([shift_xx, shift_yy, shift_xx, shift_yy], dim=-1)
        shifts = shifts.type_as(base_anchors)

        all_anchors = base_anchors[None, :, :] + shifts[:, None, :]
        all_anchors = all_anchors.view(-1, 4)
        return all_anchors

3、valid_flags介绍

简单说下这个方法作用：在模型批次训练过程中，往往会对图像进行pad，pad会出现黑边，后面撒anchor会在pad部分也回撒上anchor，其实这部分anchor应该忽略掉。故该函数就是赋予每个anchor一个标签，若anchor在有效像素位置上，则Ture；否则赋为FALSE。

    def valid_flags(self, featmap_sizes, pad_shape, device='cuda'):
"""
        输入特征图原始尺寸和pad后尺寸
        Return:
            list(torch.Tensor):返回一个和anchor数量相等的bool型张量
"""
        assert self.num_levels == len(featmap_sizes)
        multi_level_flags = []
        for i in range(self.num_levels):
            anchor_stride = self.strides[i]
            feat_h, feat_w = featmap_sizes[i]
            h, w = pad_shape[:2]
            valid_feat_h = min(int(np.ceil(h / anchor_stride[0])), feat_h)
            valid_feat_w = min(int(np.ceil(w / anchor_stride[1])), feat_w)
            flags = self.single_level_valid_flags((feat_h, feat_w),
                                                  (valid_feat_h, valid_feat_w),
                                                  self.num_base_anchors[i],
                                                  device=device)
            multi_level_flags.append(flags)
        return multi_level_flags

    def single_level_valid_flags(self,
                                 featmap_size,
                                 valid_size,
                                 num_base_anchors,
                                 device='cuda'):
        """Generate the valid flags of anchor in a single feature map

        Args:
            featmap_size (tuple[int]): 原始特征图
            valid_size (tuple[int]): pad后有效尺寸
            num_base_anchors (int): 9
            device (str, optional): Device where the flags will be put on.

                Defaults to 'cuda'.

        Returns:
            torch.Tensor: The valid flags of each anchor in a single level
                feature map.

"""
        feat_h, feat_w = featmap_size
        valid_h, valid_w = valid_size
        assert valid_h  feat_h and valid_w  feat_w
        valid_x = torch.zeros(feat_w, dtype=torch.bool, device=device)
        valid_y = torch.zeros(feat_h, dtype=torch.bool, device=device)
        valid_x[:valid_w] = 1
        valid_y[:valid_h] = 1
        valid_xx, valid_yy = self._meshgrid(valid_x, valid_y)
        valid = valid_xx & valid_yy
        valid = valid[:, None].expand(valid.size(0),
                                      num_base_anchors).contiguous().view(-1)
        return valid

总结

下篇会介绍MaxIOUAssigner，敬请期待。若有问题欢迎+vx：wulele2541612007，拉你进群探讨交流。

Original: https://blog.csdn.net/wulele2/article/details/122409507
Author: 武乐乐~
Title: MMDet逐行解读之AnchorGenerator

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/686470/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

使用Python对数据进行描述性统计（机器学习）

使用Python对数据进行描述性统计数据集：diabetes.csv参考书：《Machine Learning Mastery With Python Understand Yo…

人工智能 2023年6月15日
00116
【YoloV5 6.0|6.1 部署 TensorRT到torchserve】环境搭建|模型转换|engine模型部署（详细的packet文件编写方法）

忽然发现，关于部署TensorRT的文章少的可怜，于是乎，决定分享一下我自己关于这部分内容的一些成功实操和心得。还是希望大家可以分享出去，让更多人看到！！！ QQ: 1757093…

人工智能 2023年7月23日
0073
朴素贝叶斯（Naive Bayes）详解

朴素贝叶斯是贝叶斯分类器中的一种模型，用已知类别的数据集训练模型，从而实现对未知类别数据的类别判断。其理论基础是贝叶斯决策论（Bayesian decision theory）。 …

人工智能 2023年7月25日
0036
Python 数据集：乳腺癌数据集（from sklearn.datasets import load_breast_cancer）。

数据集：乳腺癌数据集（from sklearn.datasets import load_breast_cancer）。（1）将样本集划分为70%的训练集，30%作为测试集，分别…

人工智能 2023年7月5日
0054
YoloV5训练安全帽检测并部署在安卓上

YoloV5训练安全帽检测并实现安卓端部署一.Requirements 本教程使用的环境：u版yolov5，源码下载地址： yolov5 PyTorch:1.8.0 Cuda:1…

人工智能 2023年7月12日
0061
聚类综述-聚类算法综述-图像聚类-clustering

[TencentCloudSDKException] code:FailedOperation.ServiceIsolate message:service is stopped …

人工智能 2023年6月2日
0085
Pytorch输出网络中间层特征可视化

Pytorch输出网络中间层特征可视化本文主要介绍了如何提取特定层的特征，然后对它进行可视化。最后给出了不同网络的应用案例。推荐一个GITHUN实现可视化的工具地址整体步骤加…

人工智能 2023年7月21日
0080
pandas|Task03索引

索引 import numpy as np import pandas as pd 索引器表的列索引一般通过[]来实现，通过[列名]可以从DataFrame中取出相应的列，返回…

人工智能 2023年7月8日
0074
机器学习系列：LightGBM 可视化调参

大家好，在100天搞定机器学习|Day63 彻底掌握 LightGBM一文中，我介绍了LightGBM 的模型原理和一个极简实例。最近我发现Huggingface与Streamli…

人工智能 2023年6月4日
0095
【Matting】MODNet：实时人像抠图模型-onnx C++部署

在线人像抠图体验：CV案例相关链接：【Matting】MODNet：实时人像抠图模型-onnx python部署【Matting】MODNet：实时人像抠图模型-笔记【Ma…

人工智能 2023年5月28日
0098
Yolov5 安装详细教程及目标检测和识别

文章内容：1.在 Anaconda 环境下，进行目标检测程序（Yolov5）的下载及安装，实…

人工智能 2023年5月26日
0080
【机器学习】集成学习——Stacking模型融合（理论+图解）

🌠 『精品学习专栏导航帖』 🐳最适合入门的100个深度学习实战项目 🐳 🐙【PyTorch深度学习项目实战100例目录】项目详解 + 数据集 + 完整源码 🐙 🐶【机器学习入门项目…

人工智能 2023年6月25日
0064
（八）集成学习Bagging之随机森林与python代码实现

不定期持续更新ing~~~~~ 目录一、知识点总结不定期持续更新中： Part1 ：集成算法，bagging，boosting Part2：随机森林 Part3：相关知识点 Pa…

人工智能 2023年7月18日
0047
python学习 –DataFrame数据清洗（空值、重复值）

目录空值的处理 1、检查是否有空值 2、统计空值的数量 3、删除空值 4、填补空值用value参数替换空值将空值替换成上一列的值将空值替换成上一行的值将空值替换成下一列的…

人工智能 2023年7月6日
0087
深度学习基础宝典—激活函数、Batch Size、归一化

🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝🥰 博客首页： knighthood2001😗 欢迎点赞👍评论🗨️❤️ 热爱python，期待与大家一同进步成长！！❤️ 目录👍👍 🕐激活函数常见的激活函…

人工智能 2023年7月29日
0075
论文阅读笔记：Graph Matching Networks for Learning the Similarity of Graph Structured Objects

论文做的是用于图匹配的神经网络研究，作者做出了两点贡献: 证明GNN可以经过训练，产生嵌入graph-leve的向量可以用于相似性计算。作者提出了一种新的基于注意力的跨图匹配机制…

人工智能 2023年6月1日
0071

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

MMDet逐行解读之AnchorGenerator

文章目录

大家都在看