使用MobileNet_SSD进行目标检测

2023年5月28日下午4:22 • 人工智能 • 阅读 92

文章目录

*
– 1.MobileNetV1轻量化网络结构
– 2.MobileNetV2轻量化网络结构
– 3.前置准备
–
+ （1）MobileNetSSD_300x300.prototxt描述文件下载
+ （2）MobileNet_SSD.caffemodel下载
– 3.正文
–
+ （1）初始化操作
+ （2）预测类别
+ （3）读取相关文件
+ （4）对图像进行预处理和设置网络的输入
+ （5）对图像进一步处理
+ （6）遍历预测的结果
+ （7）对单张图片进行预测
+ （8）实时检测
+ （9）完整代码

1.MobileNetV1轻量化网络结构

https://mydreamambitious.blog.csdn.net/article/details/124560414

2.MobileNetV2轻量化网络结构

https://mydreamambitious.blog.csdn.net/article/details/124617584

3.前置准备

（1）MobileNetSSD_300x300.prototxt描述文件下载

注:虽然我们这里使用的python中的opencv来实现GoogleNet图像分类，可是我们需要GoogleNet模型的描述文件和分类文件，所以我们这里需要下载Opencv-3-3-0,从里面获取描述文件和分类文件：
https://www.raoyunsoft.com/opencv/opencv-3.3.0/

下载好Opencv-3-3-0压缩包之后，解压，打开以下路径即可找到 MobileNetSSD_300x300.prototxt

; （2）MobileNet_SSD.caffemodel下载

git clone https://github.com/chuanqi305/MobileNet-SSD.git

或者从百度网盘下载亦可以：
链接：https://pan.baidu.com/s/1S9GrYB-G_iS1wodrYjsdbw
提取码：gqpu

3.正文

（1）初始化操作

import os
import cv2
import cvzone
import numpy as np

#设置图片的宽度和高度
img_width,img_heigth=300,300
#得到图像的高宽比
WHRatio=img_width/float(img_heigth)
#设置图片的缩放因子
ScaleFactor=0.007843
#设置平均数
meanVal=127.5
#设置置信度阈值
threshod=0.2

（2）预测类别

#mobileNetSSD可以检测类别数21=20+1（背景）
classNames = ['background',
              'aeroplane', 'bicycle', 'bird', 'boat',
              'bottle', 'bus', 'car', 'cat', 'chair',
              'cow', 'diningtable', 'dog', 'horse',
              'motorbike', 'person', 'pottedplant',
              'sheep', 'sofa', 'train', 'tvmonitor']

（3）读取相关文件

#加载文件
net=cv2.dnn.readNetFromCaffe(prototxt='modelCaffe//MobileNetSSD_300x300.prototxt',
                             caffeModel='modelCaffe//mobilenet_iter_73000.caffemodel')

（4）对图像进行预处理和设置网络的输入

对图片进行预处理
    blob = cv2.dnn.blobFromImage(image=imgSize, scalefactor=ScaleFactor,
                                 size=(img_width, img_heigth), mean=meanVal)
    # 设置网络的输入并进行前向传播
    net.setInput(blob)
    detections = net.forward()

（5）对图像进一步处理

对图像进行按比例裁剪
    height,width,channel=np.shape(imgSize)
    if width/float(height)>WHRatio:

        cropSize=(int(height*WHRatio),height)
    else:

        cropSize = (width,int(width / WHRatio))
    y1=int((height-cropSize[1])/2)
    y2=int(y1+cropSize[1])
    x1=int((width-cropSize[0])/2)
    x2=int(x1+cropSize[0])
    imgSize=imgSize[y1:y2,x1:x2]
    height,width,channel=np.shape(imgSize)

（6）遍历预测的结果

打开文件：MobileNetSSD_300x300.prototxt末尾。 上面的第一个参数之所以为0，表示背景。
使用MobileNet_SSD进行目标检测

#遍历检测的目标
    print('detection.shape: {}'.format(detections.shape))
    print('detection: {}'.format(detections))
    for i in range(detections.shape[2]):
        #预测的置信度保留两位小数
        confidence=round(detections[0,0,i,2]*100,2)
        if confidence>threshod:
            #预测类别的id
            class_id=int(detections[0,0,i,1])

            xLeftBottom=int(detections[0,0,i,3]*width)
            yLeftBottom=int(detections[0,0,i,4]*height)
            xRightTop=int(detections[0,0,i,5]*width)
            yRightTop=int(detections[0,0,i,6]*height)

            cv2.rectangle(img=imgSize,pt1=(xLeftBottom,yLeftBottom),
                          pt2=(xRightTop,yRightTop),color=(0,255,0),thickness=2)
            label=classNames[class_id]+": "+str(confidence)
            labelSize, baseLine = cv2.getTextSize(label, cv2.FONT_HERSHEY_SIMPLEX, 0.5, 1)

            cvzone.putTextRect(img=imgSize,text=label,pos=(xLeftBottom+9,yLeftBottom-12),
                                scale=1,thickness=1,colorR=(0,255,0))

    return imgSize

（7）对单张图片进行预测

#对单张图片进行检测
def SignalDetect(img_path='images//6.png'):
    imgSize=cv2.imread(img_path)
    imgSize=processImage(imgSize)
    cv2.imshow('imgSize', imgSize)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

（8）实时检测

#实时检测
def detectTime():
    cap=cv2.VideoCapture(0)
    while cap.isOpened():
        ret,frame=cap.read()
        frame=cv2.resize(src=frame,dsize=(520,520))
        frame=cv2.flip(src=frame,flipCode=2)
        frame=processImage(frame)
        cv2.imshow('frame',frame)
        key=cv2.waitKey(1)
        if key==27:
            break
    cap.release()
    cv2.destroyAllWindows()

（9）完整代码

import os
import cv2
import cvzone
import numpy as np

#设置图片的宽度和高度
img_width,img_heigth=300,300
#得到图像的高宽比
WHRatio=img_width/float(img_heigth)
#设置图片的缩放因子
ScaleFactor=0.007843
#设置平均数
meanVal=127.5
#设置置信度阈值
threshod=0.2

#mobileNetSSD可以检测类别数21=20+1（背景）
classNames = ['background',
              'aeroplane', 'bicycle', 'bird', 'boat',
              'bottle', 'bus', 'car', 'cat', 'chair',
              'cow', 'diningtable', 'dog', 'horse',
              'motorbike', 'person', 'pottedplant',
              'sheep', 'sofa', 'train', 'tvmonitor']

#加载文件
net=cv2.dnn.readNetFromCaffe(prototxt='modelCaffe//MobileNetSSD_300x300.prototxt',
                             caffeModel='modelCaffe//mobilenet_iter_73000.caffemodel')

#对图片进行处理和设置网络的输入同时进行前向传播
def processImage(imgSize):
    # 对图片进行预处理
    blob = cv2.dnn.blobFromImage(image=imgSize, scalefactor=ScaleFactor,
                                 size=(img_width, img_heigth), mean=meanVal)
    # 设置网络的输入并进行前向传播
    net.setInput(blob)
    detections = net.forward()
    # 对图像进行按比例裁剪
    height,width,channel=np.shape(imgSize)
    if width/float(height)>WHRatio:
        #裁剪多余的宽度
        cropSize=(int(height*WHRatio),height)
    else:
        # 裁剪多余的高度
        cropSize = (width,int(width / WHRatio))
    y1=int((height-cropSize[1])/2)
    y2=int(y1+cropSize[1])
    x1=int((width-cropSize[0])/2)
    x2=int(x1+cropSize[0])
    imgSize=imgSize[y1:y2,x1:x2]
    height,width,channel=np.shape(imgSize)

    #遍历检测的目标
    # print('detection.shape: {}'.format(detections.shape))
    # print('detection: {}'.format(detections))
    for i in range(detections.shape[2]):
        #保留两位小数
        confidence=round(detections[0,0,i,2]*100,2)
        if confidence>threshod:
            class_id=int(detections[0,0,i,1])

            xLeftBottom=int(detections[0,0,i,3]*width)
            yLeftBottom=int(detections[0,0,i,4]*height)
            xRightTop=int(detections[0,0,i,5]*width)
            yRightTop=int(detections[0,0,i,6]*height)

            cv2.rectangle(img=imgSize,pt1=(xLeftBottom,yLeftBottom),
                          pt2=(xRightTop,yRightTop),color=(0,255,0),thickness=2)
            label=classNames[class_id]+": "+str(confidence)
            labelSize, baseLine = cv2.getTextSize(label, cv2.FONT_HERSHEY_SIMPLEX, 0.5, 1)

            cvzone.putTextRect(img=imgSize,text=label,pos=(xLeftBottom+9,yLeftBottom-12),
                                scale=1,thickness=1,colorR=(0,255,0))
            # cv2.rectangle(imgSize, (xLeftBottom, yLeftBottom - labelSize[1]),
            #               (xLeftBottom + labelSize[0], yLeftBottom + baseLine),
            #               (255, 255, 255), cv2.FILLED)
            # cv2.putText(imgSize, label, (xLeftBottom, yLeftBottom),
            #             cv2.FONT_HERSHEY_SIMPLEX, 0.5, (0, 0, 0))
    return imgSize

#对单张图片进行检测
def SignalDetect(img_path='images//8.png'):
    imgSize=cv2.imread(img_path)
    imgSize=processImage(imgSize)
    cv2.imshow('imgSize', imgSize)
    cv2.waitKey(0)
    cv2.destroyAllWindows()

#实时检测
def detectTime():
    cap=cv2.VideoCapture(0)
    while cap.isOpened():
        ret,frame=cap.read()
        frame=cv2.resize(src=frame,dsize=(520,520))
        frame=cv2.flip(src=frame,flipCode=2)
        frame=processImage(frame)
        cv2.imshow('frame',frame)
        key=cv2.waitKey(1)
        if key==27:
            break
    cap.release()
    cv2.destroyAllWindows()

if __name__ == '__main__':
    print('Pycharm')
    # SignalDetect()
    detectTime()

Original: https://blog.csdn.net/Keep_Trying_Go/article/details/125789909
Author: Keep_Trying_Go
Title: 使用MobileNet_SSD进行目标检测

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/532501/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

python3.7+anaconda3-5.3.1+pytorch1.10.1环境搭建

根据github上bert的pytorch版本Readme当中的安装说明，它支持的python版本是3.6以上的，PyTorch是1.3.1以上的，所有我决定用python3.7来…

人工智能 2023年5月26日
0096
手撕目标检测之第一篇：目标检测的总体流程

总体流程前言 * 了解 VOC 数据集 – 0、VOC数据集下载 1、VOC 数据集的20个类别及其层级结构： 2、下载文件的架构 3、标签文件Annotations…

人工智能 2023年7月9日
00140
基于libtorch的Resnet34残差网络实现——Cifar-10分类（测试集准确率94.15%）

“ 前文我们使用libtorch实现的Resnet34网络对Cifar-10进行分类，测试集的分类准确率仅有74.95%，本文我们在前文的基础上做了一些改进，使得测试集…

人工智能 2023年7月2日
00149
还在用饼状图？来瞧瞧这些炫酷的百分比可视化新图形（附代码实现）

💡 作者：韩信子@ShowMeAI📘 数据分析实战系列：https://www.showmeai.tech/tutorials/40📘 本文地址：https://www.showm…

人工智能 2023年7月8日
0078
[TI TDA4 J721E] Sensor 鱼眼摄像头 LDC畸变校正模块LUT的创建和生成——详解

首先感谢阅读，如果您也对TDA4相关的开发感兴趣，我们这边有个学习交流微信群，可以入群和大家一起交流学习。资历较浅，水平有限，如遇错误，请大家多指正！保持开源精神，共同分享、进…

人工智能 2023年5月26日
00130
大学生端午节网页作业制作学生端午节日网页设计模板传统文化节日端午节静态网页成品代码下载端午节日网页设计作品

🎉精彩专栏推荐 💭文末获取联系✍️ 作者简介: 一个热爱把逻辑思维转变为代码的技术博主💂 作者主页: 【主页——🚀获取更多优质源码】🎓 web前端期末大作业：【📚毕设项目精品实战…

人工智能 2023年6月27日
0092
6.1 Keras模型保存 —- 加载和保存整个模型

一、模型保存依赖包 Keras 模型保存为 HDF5 文件Keras 使用了 h5py Python 包。h5py 是 Keras 的依赖项，应默认被安装二、保存/加载整个模型 …

人工智能 2023年5月26日
00112
关于语音会议自动转文字系统的想法

我要考虑的问题是该怎么做？用什么？怎么做呢？我觉得先思考这三个问题对我以后的工作会有很大的帮助，而不是脑洞大开地查资料。通过回答这三个问题，我想明确自动语音会议向文本系统的工作方向…

人工智能 2023年5月25日
0086
ROS中的时间

ros::Time t1 = ros::Time::now(); double t_cur = t1.toSec(); printf("The time is: %16f…

人工智能 2023年7月8日
0072
《自然语言处理实战入门》深度学习 —- 预训练模型的使用（ALBERT 进行多标签文本分类与使用windows11 在WSL GPU 下的微调 fine tune）

文章大纲环境安装脚本使用windows11 在WSL GPU 下的资源申请与配置数据预处理模型训练与微调微调报错： InternalError: Blas xGEMMB…

人工智能 2023年7月2日
0080
【深度学习】基于卷积神经网络（tensorflow）的人脸识别项目（二）

活动地址：CSDN21天学习挑战赛目录前言基本思路关于环境 * 通过anaconda导入配置数据集 * 训练集、验证机与测试集划分规则预处理 * 从指定路径读取训…

人工智能 2023年7月13日
00127
深度学习研究生第一年之际，前来谈谈自己的感受

前言在即将结束研究生第一年之际，前来谈谈自己的感受。你可以把这篇文章当做深度学习者、程序员、研究生的简短自白。可能会有点啰嗦，会有点无趣。但如果有时间、感兴趣，不妨阅读阅读，或…

人工智能 2023年6月4日
00131
GENI: Estimating Node Importance in Knowledge Graphs Using Graph Neural Networks

核心问题：对于在知识图谱中估计节点重要性的问题，现有方法不能充分利用kg中可用的信息，或者缺乏为实体之间的复杂关系及其重要性建模所需的灵活性。我们提出了一种有监督的机器学习方法，…

人工智能 2023年6月10日
0069
数学建模笔记（六）：常微分方程及其应用

文章目录一、常微分方程概述 * 1.什么是常微分方程 2.以微分方程解决实际问题的一般思维 3.微分方程求解 4.微分方程适用问题 5.建立微分方程模型的方法二、物体的冷却过程…

人工智能 2023年7月27日
0088
统计学习导论（ISLR）（四）：分类算法

统计学习导论(ISLR) 🌸个人主页：JoJo的数据分析历险记 📝个人介绍：小编大四统计在读，目前保研到统计学top3高校继续攻读统计研究生 💌如果文章对你有帮助，欢迎 *关注、…

人工智能 2023年7月2日
0083
亲测可用的RT1052+FreeRTOS10.3移植CmBacktrace方法——2022.11.12

搜遍全网都找不到一个靠谱的RT1052可用的移植方法，自己弄了一个分享出来，禁止一切形式未经许可的转载复制。文章目录 CmBacktrace 移植CmBacktrace * 前期…

人工智能 2023年6月29日
00145

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31