使用ResNet101作为预训练模型训练Faster-RCNN-TensorFlow-Python3-master

2023年5月23日下午5:53 • 人工智能 • 阅读 91

使用VGG16作为预训练模型训练Faster-RCNN-TensorFlow-Python3-master的详细步骤→Windows10+Faster-RCNN-TensorFlow-Python3-master+VOC2007数据集。

如果使用ResNet101作为预训练模型训练Faster-RCNN-TensorFlow-Python3-master，在之前使用VGG16作为预训练模型的训练步骤基础上需要修改几个地方。

第一个，在之前的第6步时，改为下载预训练模型ResNet101，在 ./data文件夹下新建文件夹 imagenet_weights，将下载好的 resnet_v1_101_2016_08_28.tar.gz解压到 ./data/imagenet_weights路径下，并将 resnet_v1_101.ckpt重命名为 resnet101.ckpt。

* 第二个，在之前的第7步时，除了修改最大迭代次数 max_iters参数和迭代多少次保存一次模型 snap_iterations参数之外，还需要修改以下几个参数。
① 将 network参数由vgg16改为resnet101

② 将 pretrained_model参数由./data/imagenet_weights/vgg16.ckpt改为./data/imagenet_weights/resnet101.ckpt

③ 增加 pooling_mode、 FIXED_BLOCKS、 POOLING_SIZE、 MAX_POOL四个参数

tf.app.flags.DEFINE_string('network', "resnet101", "The network to be used as backbone")

tf.app.flags.DEFINE_string('pretrained_model', "./data/imagenet_weights/resnet101.ckpt", "Pretrained network weights")


tf.app.flags.DEFINE_string('pooling_mode', "crop", "Default pooling mode")
tf.app.flags.DEFINE_integer('FIXED_BLOCKS', 1, "Number of fixed blocks during training")
tf.app.flags.DEFINE_integer('POOLING_SIZE', 7, "Size of the pooled region after RoI pooling")
tf.app.flags.DEFINE_boolean('MAX_POOL', False, "Whether to append max-pooling after crop_and_resize")

第三个，对 resnet_v1.py文件进行修改，用下面的代码替换原文件中的代码。


from __future__ import absolute_import
from __future__ import division
from __future__ import print_function

import tensorflow as tf
import tensorflow.contrib.slim as slim
from tensorflow.contrib.slim import losses
from tensorflow.contrib.slim import arg_scope
from tensorflow.contrib.slim.python.slim.nets import resnet_utils
from tensorflow.contrib.slim.python.slim.nets import resnet_v1
import numpy as np

from lib.nets.network import Network
from tensorflow.python.framework import ops
from tensorflow.contrib.layers.python.layers import regularizers
from tensorflow.python.ops import nn_ops
from tensorflow.contrib.layers.python.layers import initializers
from tensorflow.contrib.layers.python.layers import layers
from lib.config import config as cfg

def resnet_arg_scope(is_training=True,
                     weight_decay=cfg.FLAGS.weight_decay,

                     batch_norm_decay=0.997,
                     batch_norm_epsilon=1e-5,
                     batch_norm_scale=True):
    batch_norm_params = {

        'is_training': False,
        'decay': batch_norm_decay,
        'epsilon': batch_norm_epsilon,
        'scale': batch_norm_scale,
        'trainable': False,
        'updates_collections': ops.GraphKeys.UPDATE_OPS
    }

    with arg_scope(
            [slim.conv2d],
            weights_regularizer=regularizers.l2_regularizer(weight_decay),
            weights_initializer=initializers.variance_scaling_initializer(),
            trainable=is_training,
            activation_fn=nn_ops.relu,
            normalizer_fn=layers.batch_norm,
            normalizer_params=batch_norm_params):
        with arg_scope([layers.batch_norm], **batch_norm_params) as arg_sc:
            return arg_sc

class resnetv1(Network):
    def __init__(self, batch_size=1, num_layers=101):
        Network.__init__(self, batch_size=batch_size)
        self._num_layers = num_layers
        self._resnet_scope = 'resnet_v1_%d' % num_layers

    def _crop_pool_layer(self, bottom, rois, name):
        with tf.variable_scope(name) as scope:
            batch_ids = tf.squeeze(tf.slice(rois, [0, 0], [-1, 1], name="batch_id"), [1])

            bottom_shape = tf.shape(bottom)
            height = (tf.to_float(bottom_shape[1]) - 1.) * np.float32(self._feat_stride[0])
            width = (tf.to_float(bottom_shape[2]) - 1.) * np.float32(self._feat_stride[0])
            x1 = tf.slice(rois, [0, 1], [-1, 1], name="x1") / width
            y1 = tf.slice(rois, [0, 2], [-1, 1], name="y1") / height
            x2 = tf.slice(rois, [0, 3], [-1, 1], name="x2") / width
            y2 = tf.slice(rois, [0, 4], [-1, 1], name="y2") / height

            bboxes = tf.stop_gradient(tf.concat([y1, x1, y2, x2], 1))
            if cfg.FLAGS.MAX_POOL:
                pre_pool_size = cfg.FLAGS.POOLING_SIZE * 2
                crops = tf.image.crop_and_resize(bottom, bboxes, tf.to_int32(batch_ids), [pre_pool_size, pre_pool_size],
                                                 name="crops")
                crops = slim.max_pool2d(crops, [2, 2], padding='SAME')
            else:
                crops = tf.image.crop_and_resize(bottom, bboxes, tf.to_int32(batch_ids),
                                                 [cfg.FLAGS.POOLING_SIZE, cfg.FLAGS.POOLING_SIZE],
                                                 name="crops")
        return crops

    def build_base(self):
        with tf.variable_scope(self._resnet_scope, self._resnet_scope):
            net = resnet_utils.conv2d_same(self._image, 64, 7, stride=2, scope='conv1')
            net = tf.pad(net, [[0, 0], [1, 1], [1, 1], [0, 0]])
            net = slim.max_pool2d(net, [3, 3], stride=2, padding='VALID', scope='pool1')

        return net

    def build_network(self, sess, is_training=True):

        if cfg.FLAGS.initializer == "truncated":
            initializer = tf.truncated_normal_initializer(mean=0.0, stddev=0.01)
            initializer_bbox = tf.truncated_normal_initializer(mean=0.0, stddev=0.001)
        else:
            initializer = tf.random_normal_initializer(mean=0.0, stddev=0.01)
            initializer_bbox = tf.random_normal_initializer(mean=0.0, stddev=0.001)
        bottleneck = resnet_v1.bottleneck

        if self._num_layers == 50:
            blocks = [
                resnet_utils.Block('block1', bottleneck,
                                   [(256, 64, 1)] * 2 + [(256, 64, 2)]),
                resnet_utils.Block('block2', bottleneck,
                                   [(512, 128, 1)] * 3 + [(512, 128, 2)]),

                resnet_utils.Block('block3', bottleneck,
                                   [(1024, 256, 1)] * 5 + [(1024, 256, 1)]),
                resnet_utils.Block('block4', bottleneck, [(2048, 512, 1)] * 3)
            ]
        elif self._num_layers == 101:

            blocks = [
                resnet_v1.resnet_v1_block('block1', base_depth=64, num_units=3, stride=2),
                resnet_v1.resnet_v1_block('block2', base_depth=128, num_units=4, stride=2),
                resnet_v1.resnet_v1_block('block3', base_depth=256, num_units=23, stride=1),
                resnet_v1.resnet_v1_block('block4', base_depth=512, num_units=3, stride=1),
            ]
        elif self._num_layers == 152:
            blocks = [
                resnet_utils.Block('block1', bottleneck,
                                   [(256, 64, 1)] * 2 + [(256, 64, 2)]),
                resnet_utils.Block('block2', bottleneck,
                                   [(512, 128, 1)] * 7 + [(512, 128, 2)]),

                resnet_utils.Block('block3', bottleneck,
                                   [(1024, 256, 1)] * 35 + [(1024, 256, 1)]),
                resnet_utils.Block('block4', bottleneck, [(2048, 512, 1)] * 3)
            ]
        else:

            raise NotImplementedError

        assert (0  cfg.FLAGS.FIXED_BLOCKS < 4)
        if cfg.FLAGS.FIXED_BLOCKS == 3:
            with slim.arg_scope(resnet_arg_scope(is_training=False)):
                net = self.build_base()
                net_conv4, _ = resnet_v1.resnet_v1(net,
                                                   blocks[0:cfg.FLAGS.FIXED_BLOCKS],
                                                   global_pool=False,
                                                   include_root_block=False,
                                                   scope=self._resnet_scope)
        elif cfg.FLAGS.FIXED_BLOCKS > 0:
            with slim.arg_scope(resnet_arg_scope(is_training=False)):
                net = self.build_base()
                net, _ = resnet_v1.resnet_v1(net,
                                             blocks[0:cfg.FLAGS.FIXED_BLOCKS],
                                             global_pool=False,
                                             include_root_block=False,
                                             scope=self._resnet_scope)

            with slim.arg_scope(resnet_arg_scope(is_training=is_training)):
                net_conv4, _ = resnet_v1.resnet_v1(net,
                                                   blocks[cfg.FLAGS.FIXED_BLOCKS:-1],
                                                   global_pool=False,
                                                   include_root_block=False,
                                                   scope=self._resnet_scope)
        else:
            with slim.arg_scope(resnet_arg_scope(is_training=is_training)):
                net = self.build_base()
                net_conv4, _ = resnet_v1.resnet_v1(net,
                                                   blocks[0:-1],
                                                   global_pool=False,
                                                   include_root_block=False,
                                                   scope=self._resnet_scope)

        self._act_summaries.append(net_conv4)
        self._layers['head'] = net_conv4
        with tf.variable_scope(self._resnet_scope, self._resnet_scope):

            self._anchor_component()

            rpn = slim.conv2d(net_conv4, 512, [3, 3], trainable=is_training, weights_initializer=initializer,
                              scope="rpn_conv/3x3")
            self._act_summaries.append(rpn)
            rpn_cls_score = slim.conv2d(rpn, self._num_anchors * 2, [1, 1], trainable=is_training,
                                        weights_initializer=initializer,
                                        padding='VALID', activation_fn=None, scope='rpn_cls_score')

            rpn_cls_score_reshape = self._reshape_layer(rpn_cls_score, 2, 'rpn_cls_score_reshape')
            rpn_cls_prob_reshape = self._softmax_layer(rpn_cls_score_reshape, "rpn_cls_prob_reshape")
            rpn_cls_prob = self._reshape_layer(rpn_cls_prob_reshape, self._num_anchors * 2, "rpn_cls_prob")
            rpn_bbox_pred = slim.conv2d(rpn, self._num_anchors * 4, [1, 1], trainable=is_training,
                                        weights_initializer=initializer,
                                        padding='VALID', activation_fn=None, scope='rpn_bbox_pred')
            if is_training:
                rois, roi_scores = self._proposal_layer(rpn_cls_prob, rpn_bbox_pred, "rois")
                rpn_labels = self._anchor_target_layer(rpn_cls_score, "anchor")

                with tf.control_dependencies([rpn_labels]):
                    rois, _ = self._proposal_target_layer(rois, roi_scores, "rpn_rois")
            else:

                if cfg.FLAGS.test_mode == "nms":
                    rois, _ = self._proposal_layer(rpn_cls_prob, rpn_bbox_pred, "rois")

                elif cfg.FLAGS.test_mode == "top":
                    rois, _ = self._proposal_top_layer(rpn_cls_prob, rpn_bbox_pred, "rois")
                else:
                    raise NotImplementedError

            if cfg.FLAGS.pooling_mode == 'crop':
                pool5 = self._crop_pool_layer(net_conv4, rois, "pool5")
            else:
                raise NotImplementedError

        with slim.arg_scope(resnet_arg_scope(is_training=is_training)):
            fc7, _ = resnet_v1.resnet_v1(pool5,
                                         blocks[-1:],
                                         global_pool=False,
                                         include_root_block=False,
                                         scope=self._resnet_scope)

        with tf.variable_scope(self._resnet_scope, self._resnet_scope):

            fc7 = tf.reduce_mean(fc7, axis=[1, 2])
            cls_score = slim.fully_connected(fc7, self._num_classes, weights_initializer=initializer,
                                             trainable=is_training, activation_fn=None, scope='cls_score')
            cls_prob = self._softmax_layer(cls_score, "cls_prob")
            bbox_pred = slim.fully_connected(fc7, self._num_classes * 4, weights_initializer=initializer_bbox,
                                             trainable=is_training,
                                             activation_fn=None, scope='bbox_pred')
        self._predictions["rpn_cls_score"] = rpn_cls_score
        self._predictions["rpn_cls_score_reshape"] = rpn_cls_score_reshape
        self._predictions["rpn_cls_prob"] = rpn_cls_prob
        self._predictions["rpn_bbox_pred"] = rpn_bbox_pred
        self._predictions["cls_score"] = cls_score
        self._predictions["cls_prob"] = cls_prob
        self._predictions["bbox_pred"] = bbox_pred
        self._predictions["rois"] = rois

        self._score_summaries.update(self._predictions)

        return rois, cls_prob, bbox_pred

    def get_variables_to_restore(self, variables, var_keep_dic):
        variables_to_restore = []

        for v in variables:

            if v.name == (self._resnet_scope + '/conv1/weights:0'):
                self._variables_to_fix[v.name] = v
                continue
            if v.name.split(':')[0] in var_keep_dic:
                print('Varibles restored: %s' % v.name)
                variables_to_restore.append(v)

        return variables_to_restore

    def fix_variables(self, sess, pretrained_model):
        print('Fix Resnet V1 layers..')
        with tf.variable_scope('Fix_Resnet_V1') as scope:
            with tf.device("/cpu:0"):

                conv1_rgb = tf.get_variable("conv1_rgb", [7, 7, 3, 64], trainable=False)
                restorer_fc = tf.train.Saver({self._resnet_scope + "/conv1/weights": conv1_rgb})
                restorer_fc.restore(sess, pretrained_model)

                sess.run(tf.assign(self._variables_to_fix[self._resnet_scope + '/conv1/weights:0'],
                                   tf.reverse(conv1_rgb, [2])))

第四个，在之前的第9步时，点击 Run 'train'开始训练之前先修改 train.py代码的如下几个地方。


from lib.nets.resnet_v1 import resnetv1


        if cfg.FLAGS.network == 'resnet101':
            self.net = resnetv1(batch_size=cfg.FLAGS.ims_per_batch)


        filename = 'resnet101_faster_rcnn_iter_{:d}'.format(iter) + '.ckpt'
        filename = os.path.join(self.output_dir, filename)
        self.saver.save(sess, filename)
        print('Wrote snapshot to: {:s}'.format(filename))

        nfilename = 'resnet101_faster_rcnn_iter_{:d}'.format(iter) + '.pkl'
        nfilename = os.path.join(self.output_dir, nfilename)

经过上面的几步修改后，就可以运行 train.py开始训练模型了。
训练时，模型保存的路径是 ./default/voc_2007_trainval/default，每次保存模型都是保存4个文件，如下图所示。

因此，在测试期间需要进行几项更改。

[En]

Accordingly, there are several changes that need to be made during testing.

第一个，在之前的第12步时，改为新建 ./output/resnet101/voc_2007_trainval/default文件夹，从 ./default/voc_2007_trainval/default路径下复制一组模型数据到新建的文件夹下，并将所有文件名改为 resnet101.后缀。

* 第二个，在之前的第13步时，对 demo.py再进行如下的修改。

经过上面的几步修改后，就可以运行 demo.py开始测试模型了。
在输出PR曲线并计算AP值时，同样也需要修改 test_net.py文件中的几个地方，如下图所示。


from lib.nets.resnet_v1 import resnetv1


NETS = {'resnet101': ('resnet101.ckpt',)}

经过上面的几步修改后，就可以运行 test_net.py来输出PR曲线并计算AP值了。

Original: https://blog.csdn.net/HUAI_BI_TONG/article/details/122630567
Author: 大彤小忆
Title: 使用ResNet101作为预训练模型训练Faster-RCNN-TensorFlow-Python3-master

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/497001/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

【ROS进阶篇】第九讲基于Rviz和Arbotix控制的机器人模型运动

【ROS进阶篇】第九讲基于Rviz和Arbotix控制的机器人模型运动文章目录【ROS进阶篇】第九讲基于Rviz和Arbotix控制的机器人模型运动前言一、Arboti…

人工智能 2023年6月10日
00203
【ROS2】为什么要使用ROS2？《ROS2系统特性介绍》

文章目录一、为什么要推出ROS2？ * （一）ROS1的诞生（二）ROS1存在的问题 – （1）PR2特点（2）ROS1存在问题（三）ROS2设计思路 &#82…

人工智能 2023年6月1日
0068
【目标检测】YOLO系列Anchor标签分配、边框回归（坐标预测）方式、LOSS计算方式

1、YOLOv1 标签分配：GT的中心落在哪个grid，那个grid对应的两个bbox中与GT的IOU最大的bbox为正样本，其余为负样本，（由于是回归模型，不是分类模型，其解决类…

人工智能 2023年7月12日
00131
如何在PyTorch中实现数据增强技术，如旋转、裁剪和缩放等操作

详细介绍在深度学习中，数据增强是一种常用的技术，它可以扩增训练数据集，提升模型的泛化能力。PyTorch 提供了丰富的图像处理函数和类，可以通过旋转、裁剪和缩放等操作来实现数据增…

人工智能 2024年1月3日
0055
七月总结&八月计划

404. 抱歉，您访问的资源不存在。可能是网址有误，或者对应的内容被删除，或者处于私有状态。代码改变世界，联系邮箱 contact@cnblogs.com 园子的商业化努力-困…

人工智能 2023年6月4日
0085
关于cv2.dnn.readNetFromONNX(path)就报ERROR during processing node with 3 inputs and 1 outputs的解决过程【独家发布】

问题是什么：如图，对vgg16使用opencv的dnn模块进行推理时出现错误。 错误的详&…

人工智能 2023年6月25日
00159
机器学习之逻辑回归Logistic Regression（python代码实现）

逻辑回归（Logistic Regression）逻辑回归是一个非常经典的算法，用于解决分类问题的机器学习方法，用于估计某种事物的可能性，其有着简单、可并行化、可解释强的特点。逻…

人工智能 2023年7月29日
0064
LabelStudio + MMDetection 实现目标分割预标注

在 Label Studio ML Backend 提供的预标注模型示例中，只有 mmdetection 这个目标检测预标注示例，而没有目标分割预标注示例，因此我参考野生的…

人工智能 2023年7月10日
0063
python官方扩展库索引是什么意思_pandas中index索引功能是什么

pandas的索引对象可以用来保存坐标轴标签和其它元数据，是使用过程中必要的参与对象，那pandas中index索引功能是什么呢？pandas中index索引可以轻松的读取数据，更…

人工智能 2023年7月8日
00100
【tensorflow】tensorflow的安装及应用

安装tensorflow的三种方法 2.其他旧版本的安装，去pypi.org官网 3.安装包安装，找到download files，选择与自己系统匹配的文件进行下载下载完成后，在…

人工智能 2023年6月24日
0088
写给Python社群的第5课：Python 函数，真的难吗？

⛳️ 函数简介这篇博客给大家带来的是 Python 函数，每次讲解到函数的时候，都会有很多同学掉队，其核心原因是思维的转变，在前文的几篇博客中，都是以线性思维进行说明，翻译成编程…

人工智能 2023年7月6日
0075
披萨价格和直径-机器学习实验笔记（单变量线性回归）

目录 1 单变量回归原理（不涉及数学推导） * 1.1 基本原理介绍 1.2 sklearn库相关方法和参数介绍 2 披萨价格和直径案例 * 2.1 案例代码 1 单变量回归原理（…

人工智能 2023年6月16日
00107
卷积层中的激活函数是什么？为什么要使用激活函数

问题背景卷积神经网络（Convolutional Neural Network, CNN）是一种常用的深度学习算法，广泛应用于计算机视觉领域。在CNN中，卷积层是其中最关键的组成…

人工智能 2024年1月1日
0068
随机森林与人工神经网络联合诊断心衰模型的构建与分析

今天，和大家分享一篇文章的解读与复现——随机森林与人工神经网络联合诊断心衰模型的构建与分析，顺便分享近期遇到的一个神奇的社区。目录 Part1文献解读 1摘要 2前言 3方法&a…

人工智能 2023年5月31日
0098
Python中多线程的简单使用

from threading import Timer import time def run1(): print(1) print(2) print(3) def run2():…

人工智能 2023年6月6日
0071
NeRF 从入门到精通

目录 NeRF简介课程教程代码实践商业应用 NeRF简介 NeRF(神经辐射场)是当前最为火热的研究领域之一，效果非常惊艳，它要解决的问题就是给定一些拍摄的图，如何生成新的…

人工智能 2023年6月16日
0097

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

使用ResNet101作为预训练模型训练Faster-RCNN-TensorFlow-Python3-master

大家都在看