OpenCV—-YOLOv5目标检测模型推理 (兼容YOLACT)

2023年7月9日下午3:15 • 人工智能 • 阅读 95

分析：
1）opencv的DNN模块集成了很多深度学习模型，包括人脸检测、图像分类、分割、目标检测等，集成了Pytorch、tensorflow、paddlepaddle等模型框架（参看代码库OpenCV/dnn）
2）深度学习推理模型一般步骤：加载模型，包括配置文件和权重文件；输入图像预处理，转换成模型可接受的文件类型和尺寸；模型预测后处理，对于实例分割，主要是NMS后处理方法；

结果展示：

main.exe -h

Usage: main.exe [params] image confThreshold nmsThresshold model_name

        -?, -h, --help, --usage (value:true)
                opecv based deep learining demo

        image (value:inference/horses.jpg)
                Image to process
        confThreshold (value:0.5)
                confidence threshold, default 0.5
        nmsThresshold (value:0.5)
                nms threshold, default 0.5
        model_name (value:yolov5)
                dnn model, default yolov5
parse wrong, please check command or type help

 main.exe inference/horses.jpg 0.5 0.5 yolov5

CMakeLists.txt:


SET(CMAKE_BUILD_TYPE "Release")

include_directories(".../opencv/build/include" ".../opencv/build/include/opencv2")
link_directories(".../opencv/build/x64/vc15/lib")

add_executable (main main.cpp)
add_library(yolact yolact.cpp)
add_library(yolov5 yolov5.cpp)
add_library(config config.cpp)
target_link_libraries(main yolact yolov5 config opencv_world460)

代码示例:

1：检测模型配置文件头文件 config.hpp


extern const char* class_names[];
extern const unsigned char colors[81][3];

2: 检测模型配置实现 config.cpp


#pragma once
#include
#include"config.hpp"

extern const char* class_names[] = { "background",
                                        "person", "bicycle", "car", "motorcycle", "airplane", "bus",
                                        "train", "truck", "boat", "traffic light", "fire hydrant",
                                        "stop sign", "parking meter", "bench", "bird", "cat", "dog",
                                        "horse", "sheep", "cow", "elephant", "bear", "zebra", "giraffe",
                                        "backpack", "umbrella", "handbag", "tie", "suitcase", "frisbee",
                                        "skis", "snowboard", "sports ball", "kite", "baseball bat",
                                        "baseball glove", "skateboard", "surfboard", "tennis racket",
                                        "bottle", "wine glass", "cup", "fork", "knife", "spoon", "bowl",
                                        "banana", "apple", "sandwich", "orange", "broccoli", "carrot",
                                        "hot dog", "pizza", "donut", "cake", "chair", "couch",
                                        "potted plant", "bed", "dining table", "toilet", "tv", "laptop",
                                        "mouse", "remote", "keyboard", "cell phone", "microwave", "oven",
                                        "toaster", "sink", "refrigerator", "book", "clock", "vase",
                                        "scissors", "teddy bear", "hair drier", "toothbrush"
};

extern const unsigned char colors[81][3] = {{56, 0, 255}, {226, 255, 0}, {0, 94, 255},
    {0, 37, 255}, {0, 255, 94}, {255, 226, 0}, {0, 18, 255}, {255, 151, 0},
    {170, 0, 255}, {0, 255, 56}, {255, 0, 75}, {0, 75, 255}, {0, 255, 169},
    {255, 0, 207}, {75, 255, 0}, {207, 0, 255}, {37, 0, 255}, {0, 207, 255},
    {94, 0, 255}, {0, 255, 113}, {255, 18, 0}, {255, 0, 56}, {18, 0, 255},
    {0, 255, 226}, {170, 255, 0}, {255, 0, 245}, {151, 255, 0}, {132, 255, 0},
    {75, 0, 255}, {151, 0, 255}, {0, 151, 255}, {132, 0, 255}, {0, 255, 245},
    {255, 132, 0}, {226, 0, 255}, {255, 37, 0}, {207, 255, 0},
    {0, 255, 207}, {94, 255, 0}, {0, 226, 255},
    {56, 255, 0}, {255, 94, 0}, {255, 113, 0},{0, 132, 255}, {255, 0, 132},
    {255, 170, 0}, {255, 0, 188}, {113, 255, 0}, {245, 0, 255}, {113, 0, 255},
    {255, 188, 0}, {0, 113, 255}, {255, 0, 0}, {0, 56, 255}, {255, 0, 113},
    {0, 255, 188}, {255, 0, 94}, {255, 0, 18}, {18, 255, 0}, {0, 255, 132},
    {0, 188, 255}, {0, 245, 255}, {0, 169, 255},{37, 255, 0},
    {255, 0, 151}, {188, 0, 255}, {0, 255, 37}, {0, 255, 0},
    {255, 0, 170}, {255, 0, 37}, {255, 75, 0}, {0, 0, 255}, {255, 207, 0},
    {255, 0, 226}, {255, 245, 0}, {188, 255, 0}, {0, 255, 18}, {0, 255, 75},
    {0, 255, 151}, {255, 56, 0}, {245, 255, 0}
};

extern struct net_config{
    float confThreshold;
    float nmsThreshold;
    std::string model_name;
    int img_size;
    std::string model_path;
};

3: yolov5推理模型


#include
#include
#include
#include
#include
#include

#include "config.cpp"

using namespace cv;
using namespace dnn;
using namespace std;

class yolov5
{
public:

    yolov5(float confThreshold, float nmsThreshold, string model_path = "model/yolov5m.onnx", const int keep_top_k = 200);

    yolov5(net_config& config);

    void detect(Mat& frame);
private:
    const float anchors[3][6] = {{10.0, 13.0, 16.0, 30.0, 33.0, 23.0}, {30.0, 61.0, 62.0, 45.0, 59.0, 119.0},{116.0, 90.0, 156.0, 198.0, 373.0, 326.0}};
    const float stride[3] = { 8.0, 16.0, 32.0 };
    const int inpWidth = 640;
    const int inpHeight = 640;
    float confThreshold = 0.5;
    float nmsThreshold = 0.5;
    float objThreshold = 0.5;

    Net net;

    void drawPred(float conf, int left, int top, int right, int bottom, Mat& frame, int classid);

    void sigmoid(Mat* out, int length){
        float* pdata = (float*)(out->data);
        int i = 0;
        for (i = 0; i < length; i++)
        {
            pdata[i] = 1.0 / (1 + expf(-pdata[i]));
        }
    }
};

yolov5::yolov5(float confThreshold, float nmsThreshold, string model_path, const int keep_top_k)
{
    this->confThreshold = confThreshold;
    this->nmsThreshold = nmsThreshold;
    this->net = readNet(model_path);
}

yolov5::yolov5(net_config& config)
{
    this->confThreshold = config.confThreshold;
    this->nmsThreshold = config.nmsThreshold;
    this->net = readNet(config.model_path);
}

void yolov5::drawPred(float conf, int left, int top, int right, int bottom, Mat& frame, int classid)
{

    rectangle(frame, Point(left, top), Point(right, bottom), Scalar(0, 0, 255), 2);

    string label = format("%.2f", conf);
    label = string(class_names[classid+1]) + ":" + label;

    int baseLine;
    Size labelSize = getTextSize(label, FONT_HERSHEY_SIMPLEX, 0.5, 1, &baseLine);
    top = max(top, labelSize.height);

    putText(frame, label, Point(left, top), FONT_HERSHEY_SIMPLEX, 0.75, Scalar(0, 255, 0), 1);

}

void yolov5::detect(Mat& frame)
{
    Mat blob;
    blobFromImage(frame, blob, 1 / 255.0, Size(this->inpWidth, this->inpHeight), Scalar(0, 0, 0), true, false);
    this->net.setInput(blob);
    vector<Mat> outs;
    this->net.forward(outs, this->net.getUnconnectedOutLayersNames());

    vector<int> classIds;
    vector<float> confidences;
    vector<Rect> boxes;
    float ratioh = (float)frame.rows / this->inpHeight;
    float ratiow = (float)frame.cols / this->inpWidth;

    int n = 0, q = 0, i = 0, j = 0, nout = 80 + 5, c = 0;
    for (n = 0; n < 3; n++)
    {
        int num_grid_x = (int)(this->inpWidth / this->stride[n]);
        int num_grid_y = (int)(this->inpHeight / this->stride[n]);
        int area = num_grid_x * num_grid_y;
        this->sigmoid(&outs[n], 3 * nout * area);
        for (q = 0; q < 3; q++)
        {
            const float anchor_w = this->anchors[n][q * 2];
            const float anchor_h = this->anchors[n][q * 2 + 1];
            float* pdata = (float*)outs[n].data + q * nout * area;
            for (i = 0; i < num_grid_y; i++)
            {
                for (j = 0; j < num_grid_x; j++)
                {
                    float box_score = pdata[4 * area + i * num_grid_x + j];
                    if (box_score > this->objThreshold)
                    {
                        float max_class_socre = 0, class_socre = 0;
                        int max_class_id = 0;
                        for (c = 0; c < 80; c++)
                        {
                            class_socre = pdata[(c + 5) * area + i * num_grid_x + j];
                            if (class_socre > max_class_socre)
                            {
                                max_class_socre = class_socre;
                                max_class_id = c;
                            }
                        }

                        if (max_class_socre > this->confThreshold)
                        {
                            float cx = (pdata[i * num_grid_x + j] * 2.f - 0.5f + j) * this->stride[n];
                            float cy = (pdata[area + i * num_grid_x + j] * 2.f - 0.5f + i) * this->stride[n];
                            float w = powf(pdata[2 * area + i * num_grid_x + j] * 2.f, 2.f) * anchor_w;
                            float h = powf(pdata[3 * area + i * num_grid_x + j] * 2.f, 2.f) * anchor_h;

                            int left = (cx - 0.5*w)*ratiow;
                            int top = (cy - 0.5*h)*ratioh;

                            classIds.push_back(max_class_id);
                            confidences.push_back(max_class_socre);
                            boxes.push_back(Rect(left, top, (int)(w*ratiow), (int)(h*ratioh)));
                        }
                    }
                }
            }
        }
    }

    vector<int> indices;
    NMSBoxes(boxes, confidences, this->confThreshold, this->nmsThreshold, indices);
    for (size_t i = 0; i < indices.size(); ++i)
    {
        int idx = indices[i];
        Rect box = boxes[idx];

        this->drawPred(confidences[idx], box.x, box.y,
            box.x + box.width, box.y + box.height, frame, classIds[idx]);
    }
}

4: 整体代码结构


#define _CRT_SECURE_NO_WARNINGS
#include
#include
#include
#include
#include

#include "config.cpp"
#include "yolact.cpp"
#include "yolov5.cpp"

using namespace cv;
using namespace dnn;
using namespace std;

bool parseParam(int argc, char** argv, const char* keys, Mat& img, net_config& config){
    CommandLineParser parser(argc, argv, keys);
    if(parser.has("help")){
        parser.printMessage();
        return false;
    }
    if(!parser.check()){
        parser.printErrors();
        return false;
    }
    String imgFile = parser.get<String>(0);
    img = imread(imgFile);
    if(img.empty()){
        cout << "wrong image path ! please check again." << endl;
        return false;
    }
    config.confThreshold = parser.get<float>(1);
    config.nmsThreshold = parser.get<float>(2);
    config.model_name = parser.get<string>(3);
    return true;
}

int main(int argc, char** argv)
{
    const char* keys  = {
        "{help h usage ? | | opecv based deep learining demo}"
        "{@image | inference/horses.jpg | Image to process}"
        "{@confThreshold | 0.5 | confidence threshold, default 0.5}"
        "{@nmsThresshold | 0.5 | nms threshold, default 0.5}"
        "{@model_name | yolov5 | dnn model, default yolov5}"
        };

    net_config config;
    Mat srcimg;
    if(!parseParam(argc, argv, keys, srcimg, config)){
        cout << "parse wrong, please check command or type help" << endl;
        return 0;
    }

    if(config.model_name == "yolact"){
        config.model_path = "model/yolact_base_54_800000.onnx";
        yolact model(config);
        model.detect(srcimg);
        static const string kWinName = "Deep learning object detection in OpenCV";
        namedWindow(kWinName, WINDOW_NORMAL);
        imshow(kWinName, srcimg);
        waitKey(0);
        destroyAllWindows();
    }else if(config.model_name == "yolov5"){
        config.model_path = "model/yolov5m.onnx";
        yolov5 model(config);
        model.detect(srcimg);
        static const string kWinName = "Deep learning object detection in OpenCV";
        namedWindow(kWinName, WINDOW_NORMAL);
        imshow(kWinName, srcimg);
        waitKey(0);
        destroyAllWindows();

        }
    }else{
        cout << "model not defined" << endl;
    }
    return 0;
}

Original: https://blog.csdn.net/qq_37172182/article/details/126554052
Author: qq_37172182
Title: OpenCV—-YOLOv5目标检测模型推理 (兼容YOLACT)

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/680922/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

【Python机器学习】回归模型：推土机售价预测

文章目录使用机器学习预测推土机的售价零、导入模块一、EDA * 1.1 查看基本信息 1.2 特征类型转换 1.3 联表+特征初筛 – 1.3.1 删除包含重复信…

人工智能 2023年6月18日
0090
Keras深度学习使用VGG16预训练神经网络实现猫狗分类

Keras深度学习使用VGG16预训练神经网络实现猫狗分类最近刚刚接触深度学习不久，而Keras呢，是在众多的深度学习框架中，最适合上手的，而猫狗的图像分类呢，也算是计算机视觉中…

人工智能 2023年5月23日
0064
定义的评分函数

基于嵌入的方法通常是在向量空间中表示KG，然后对于结果向量应用预先定义的评分函数来进行知识图谱补全。但是这种方法的弊端就是对于训练过程中存在的实体，可以有很好的训练效果，但是对于在…

人工智能 2023年6月4日
0081
基于麻雀算法优化LSTM回归预测（matlab）

基于麻雀算法优化LSTM回归预测（matlab）概述：麻雀算法构思 lstm原理麻雀优化lstm原理代码及结果展示第一部分麻雀算法构思众所周知，麻雀是常见的留鸟而且非…

人工智能 2023年6月17日
0071
少儿编程是什么？要学吗？如何学？

编程是一件很有趣的事情，主要能培养这些能力：一、构思能力编程是一种”先写剧本，后看结果”的活动，这要求孩子先在脑子里进行构思并模拟出结果，然后再实际验证结…

人工智能 2023年6月6日
00102
神经网络中的 Dropout 以及变体方法

Dropout 的学习笔记，主要参考文章： 12种主要的Dropout方法：如何应用于DNNs，CNNs，RNNs中的数学和可视化解释【科普】神经网络中的随机失活方法 1. 简介…

人工智能 2023年7月13日
00115
备战数学建模28 & 科研必备 Python之数据处理神器pandas

目录 1-series和读取外部数据 2-pandas的DataFrame 3-统计方法和字符串离散化 4-数据的合并和分组聚合 1-series和读取外部数据我们知道在pyth…

人工智能 2023年7月7日
0086
cartographer中的反光板定位

简介反光板定位作为cartographer中landmark数据最常用的部分，其特性和AprilTag使用方法类似，在cartographer中， landmark必须是 tra…

人工智能 2023年6月1日
0079
基于Python的抽取式文本自动摘要的实现

资源下载地址：https://download.csdn.net/download/sheziqiong/85736065资源下载地址：https://download.csdn….

人工智能 2023年5月31日
0098
知识图谱可视化技术在美团的实践与探索

知识图谱可视化可以更直观地查看和分析知识图谱的数据。本文主要介绍了美团平台在布局策略、视觉降噪、交互功能、可视化叙事、3D图谱可视化等方面的一些实践和探索，同时沉淀出了uni-gr…

人工智能 2023年6月1日
0082
【数据攻略】字节面试真题（含答案）+100道面试题库

整理了一套字节的面试真题，还有100道PDF版的面试题库一、SQL题面试真题1：抖音电商平台，现有一张订单表（order_info），有以下字段： order_id good…

人工智能 2023年7月16日
0074
【Linux】感性认识冯诺依曼体系结构和操作系统

文章目录 * – 1、冯诺依曼体系结构 – 2、操作系统 – + 2.1 操作系统向下对硬件的管理 + 2.1 操作系统对软件前言：本章都是为…

人工智能 2023年6月29日
0088
神经网络模型的参数量和计算量

其实模型的参数量好算，但浮点运算数并不好确定，我们一般也就根据参数量直接估计计算量了。但是像卷积之类的运算，它的参数量比较小，但是运算量非常大，它是一种计算密集型的操作。反观全连接…

人工智能 2023年6月4日
00114
对比学习（contrastive learning）

什么是自监督学习？举个通俗的例子：即使不记得物体究竟是什么样子，我们也可以在野外识别物体。我们通过记住高阶特征并忽略微观层面的细节来做到这一点。那么，现在的问题是，我们能否构建…

人工智能 2023年5月26日
0085
彻底搞懂dfs与回溯

目录初识dfs 扩展到图深度优先搜索dfs，其过程是对每一个可能的分支路径深入到不能再深入为止，是一种广泛用于树和图中搜索路径，和其他情况下搜索需要的情况的算法初识dfs 彻…

人工智能 2023年6月26日
0073
使用PyTorch进行小样本学习的图像分类

近年来，基于深度学习的模型在目标检测和图像识别等任务中表现出色。像ImageNet这样具有挑战性的图像分类数据集，包含1000种不同的对象分类，现在一些模型已经超过了人类水平上。但…

人工智能 2023年7月21日
0071

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

OpenCV—-YOLOv5目标检测模型推理 (兼容YOLACT)

大家都在看