OpenCV+YOLO+IP摄像头实现目标检测

2023年7月19日上午1:41 • 人工智能 • 阅读 56

title: OpenCV+YOLO+IP摄像头实现目标检测

前言

学习OpenCV、YOLO到现在我实现了调用本地摄像头使用自己训练的模型进行目标识别，然后想着能不能远程获取视频数据，然后再PC端处理，最后将结果返回给视频流端。然后发现旧手机下载IP摄像头之后可以当做一个远程摄像头使用，并且它还支持rstp网络视频流协议（海康、大华的摄像头也是用这个协议，还可以兼容未来硬件的升级）

代码

import time
import torch
import cv2 as cv

class MultipleTarget:

    def __init__(self, url):
"""
        &#x521D;&#x59CB;&#x5316;
"""
        # &#x52A0;&#x8F7D;&#x8BAD;&#x7EC3;&#x6A21;&#x578B;
        self.model = torch.hub.load('./yolov5', 'custom', path='./weight/yolov5s.pt', source='local')
        # &#x8BBE;&#x7F6E;&#x9608;&#x503C;
        self.model.conf = 0.52  # confidence threshold (0-1)
        self.model.iou = 0.45  # NMS IoU threshold (0-1)
        # &#x52A0;&#x8F7D;&#x6444;&#x50CF;&#x5934;
        self.url = url
        self.cap = cv.VideoCapture(self.url)
        self.cap.set(cv.CAP_PROP_FOURCC, cv.VideoWriter_fourcc('M', 'J', 'P', 'G'))
        if not self.cap.isOpened():
            print("Cannot open camera")
            exit()

    def draw(self, list_temp, image_temp):
        for temp in list_temp:
            name = temp[6]  # &#x53D6;&#x51FA;&#x6807;&#x7B7E;&#x540D;
            temp = temp[:4].astype('int')  # &#x8F6C;&#x6210;int&#x52A0;&#x5FEB;&#x8BA1;&#x7B97;
            cv.rectangle(image_temp, (temp[0], temp[1]), (temp[2], temp[3]), (0, 0, 255), 3)  # &#x6846;&#x51FA;&#x8BC6;&#x522B;&#x7269;&#x4F53;
            cv.putText(image_temp, name, (int(temp[0] - 10), int(temp[1] - 10)), cv.FONT_ITALIC, 1, (0, 255, 0), 2)

    def detect(self):
"""
        &#x76EE;&#x6807;&#x68C0;&#x6D4B;
"""
        while True:
            ret, frame = self.cap.read()
            # &#x5982;&#x679C;&#x6B63;&#x786E;&#x8BFB;&#x53D6;&#x5E27;&#xFF0C;ret&#x4E3A;True
            if not ret:
                print("Can't receive frame (stream end?). Exiting ...")
                break
            # frame = cv.flip(frame, 1)

            # FPS&#x8BA1;&#x7B97;time.start
            start_time = time.time()

            # Inference
            results = self.model(frame)
            pd = results.pandas().xyxy[0]  # tensor-->pandas&#x7684;DataFrame
            # &#x53D6;&#x51FA;&#x5BF9;&#x5E94;&#x6807;&#x7B7E;&#x7684;list
            person_list = pd[pd['name'] == 'person'].to_numpy()
            bus_list = pd[pd['name'] == 'bus'].to_numpy()
            # &#x6846;&#x51FA;&#x7269;&#x4F53;
            self.draw(person_list, frame)
            self.draw(bus_list, frame)
            # end_time
            end_time = time.time()
            fps = 1 / (end_time - start_time)

            # &#x63A7;&#x5236;&#x53F0;&#x663E;&#x793A;
            # results.print()  # or .show(), .save(), .crop(), .pandas(), etc.

            # print(results.xyxy[0])  # img1 predictions (tensor)
            # print('----------------')
            # print(results.pandas().xyxy[0])  # img1 predictions (pandas)

            # FPS&#x663E;&#x793A;
            cv.putText(frame, 'FPS:' + str(int(fps)), (30, 50), cv.FONT_ITALIC, 1, (0, 255, 0), 2)

            cv.imshow('results', frame)
            cv.waitKey(10)
            if cv.waitKey(10) & 0xFF == ord('q'):
                break

        self.cap.release()
        cv.destroyAllWindows()

url = 'rtsp://admin:admin@192.168.43.229:8554/live'
test = MultipleTarget(url)
test.detect()

存在问题

在不进行目标检测的时候，读到的视频流很流畅，进行目标检测后就非常卡几乎不能用。
经过几天的学习和查找，感觉这个问题出在这里：
CPU和内存在读视频流和处理视频的时候爆了
我在运行程序的时候看了任务管理器果然如此
然后我就根据网上的说法使用多进程来解决这个问题， 但是结果还是一个样
我现在在怀疑是不是我的电脑配置不够（ps：我的电脑配置确实垃圾）

有搞了几天没有丝毫进展！！！！！！！！！
躺了，试了很多方法还是卡的一批，延迟还贼高，无奈
配置不够（ps：我的电脑配置确实垃圾）

有搞了几天没有丝毫进展！！！！！！！！！
躺了，试了很多方法还是卡的一批，延迟还贼高，无奈

; 已解决(使用多进程)

"""
&#x591A;&#x8FDB;&#x7A0B;&#x5BF9;rstp&#x89C6;&#x9891;&#x6D41;&#x8FDB;&#x884C;&#x56FE;&#x50CF;&#x5904;&#x7406;
&#x73B0;&#x5B58;&#x5728;&#x95EE;&#x9898;&#xFF1A;&#x7B14;&#x8BB0;&#x672C;&#x7B97;&#x529B;&#x4E0D;&#x591F;&#xFF0C;cpu&#x7206;&#x4E86;&#xFF0C;&#x7ED3;&#x679C;&#x80FD;&#x6D41;&#x7545;&#x8FD0;&#x884C;&#x4E00;&#x6BB5;&#x65F6;&#x95F4;&#xFF0C;&#x5EF6;&#x65F6;&#x4E5F;&#x4F4E;&#xFF08;&#x5DF2;&#x89E3;&#x51B3;&#xFF09;

&#x8FDB;&#x7A0B;&#x4E00;&#xFF1A;&#x8BFB;&#x53D6;rtsp&#x89C6;&#x9891;&#x6D41;
&#x89C6;&#x9891;&#x6D41;&#x4FDD;&#x5B58;&#x4F7F;&#x7528;Manager.list
&#x8FDB;&#x7A0B;&#x4E8C;&#xFF1A;&#x4F7F;&#x7528;yolo&#x5904;&#x7406;&#x89C6;&#x9891;&#x6D41;

&#x5C40;&#x57DF;&#x7F51;&#x5185;&#x5B9E;&#x73B0;rtsp&#x534F;&#x8BAE;&#x89C6;&#x9891;&#x63A8;&#x6D41;
&#x83B7;&#x53D6;rtsp&#x89C6;&#x9891;&#x6D41;&#xFF0C;&#x5E76;&#x7528;yolo&#x5BF9;&#x5176;&#x8FDB;&#x884C;&#x5904;&#x7406;&#x3002;&#x5B9E;&#x73B0;&#x76EE;&#x6807;&#x68C0;&#x6D4B;

@author Yuzzz
"""

import os
import cv2 as cv
import gc
from multiprocessing import Process, Manager
import torch
import time

&#x5411;&#x5171;&#x4EAB;&#x7F13;&#x51B2;&#x6808;&#x4E2D;&#x5199;&#x5165;&#x6570;&#x636E;,rtsp&#x89C6;&#x9891;&#x6D41;
def write(stack, cam, top: int) -> None:
"""
    :param cam: &#x6444;&#x50CF;&#x5934;&#x53C2;&#x6570;
    :param stack: Manager.list&#x5BF9;&#x8C61;
    :param top: &#x7F13;&#x51B2;&#x6808;&#x5BB9;&#x91CF;
    :return: None
"""
    print('Process to write: %s' % os.getpid())  # write&#x5B50;&#x8FDB;&#x7A0B;ID
    cap = cv.VideoCapture(cam)
    while True:
        _, img = cap.read()
        if _:
            stack.append(img)
            # &#x6BCF;&#x5230;&#x4E00;&#x5B9A;&#x5BB9;&#x91CF;&#x6E05;&#x7A7A;&#x4E00;&#x6B21;&#x7F13;&#x51B2;&#x6808;
            # &#x5229;&#x7528;gc&#x5E93;&#xFF0C;&#x624B;&#x52A8;&#x6E05;&#x7406;&#x5185;&#x5B58;&#x5783;&#x573E;&#xFF0C;&#x9632;&#x6B62;&#x5185;&#x5B58;&#x6EA2;&#x51FA;
            if len(stack) >= top:
                del stack[:]
                gc.collect()

def img_resize(image):
"""
    &#x66F4;&#x6539;&#x56FE;&#x7247;&#x5C3A;&#x5BF8;
"""
    height, width = image.shape[0], image.shape[1]
    # &#x8BBE;&#x7F6E;&#x65B0;&#x7684;&#x56FE;&#x7247;&#x5206;&#x8FA8;&#x7387;&#x6846;&#x67B6; 640x369 1280&#xD7;720 1920&#xD7;1080
    width_new = 1280
    height_new = 720
    # &#x5224;&#x65AD;&#x56FE;&#x7247;&#x7684;&#x957F;&#x5BBD;&#x6BD4;&#x7387;
    if width / height >= width_new / height_new:
        img_new = cv.resize(image, (width_new, int(height * width_new / width)))
    else:
        img_new = cv.resize(image, (int(width * height_new / height), height_new))
    return img_new

def save_img(yolo_img, pic_number):
    cv.imwrite(r'E:/Pytorch_learning/SaveImg/%d.jpg' % pic_number, yolo_img)
    # cv2.imwrite('File_SavePath/%d.bmp' % (i), reImage)  # &#x4FDD;&#x5B58;&#x56FE;&#x7247;&#x8DEF;&#x5F84;
    pass

def draw(list_temp, image_temp):
    for temp in list_temp:
        name = temp[6]  # &#x53D6;&#x51FA;label
        temp = temp[:4].astype('int')
        cv.rectangle(image_temp, (temp[0], temp[1]), (temp[2], temp[3]), (0, 0, 255), 3)  # &#x6846;&#x51FA;&#x8BC6;&#x522B;&#x7269;&#x4F53;
        cv.putText(image_temp, name, (int(temp[0] - 10), int(temp[1] - 10)), cv.FONT_ITALIC, 1, (0, 255, 0), 2)

&#x5728;&#x7F13;&#x51B2;&#x6808;&#x4E2D;&#x8BFB;&#x53D6;&#x6570;&#x636E;:
def read(stack) -> None:
    print('Process to read: %s' % os.getpid())  # read&#x5B50;&#x8FDB;&#x7A0B;ID
    # &#x521D;&#x59CB;&#x5316;yolo
    model = torch.hub.load('./yolov5', 'custom', path='./weight/yolov5s.pt', source='local')
    # &#x8D85;&#x53C2;&#x6570;&#x8BBE;&#x7F6E;
    model.conf = 0.52  # confidence threshold (0-1)
    model.iou = 0.45  # NMS IoU threshold (0-1)
    while True:
        if len(stack) != 0:
            value = stack.pop()  # &#x51FA;&#x6808;
            # &#x5BF9;&#x83B7;&#x53D6;&#x7684;&#x89C6;&#x9891;&#x5E27;&#x5206;&#x8FA8;&#x7387;&#x91CD;&#x5904;&#x7406;
            img_new = img_resize(value)
            # &#x4F7F;&#x7528;yolo&#x6A21;&#x578B;&#x5904;&#x7406;&#x89C6;&#x9891;&#x5E27;
            # yolo_img = yolo_deal(img_new)

            # FPS&#x8BA1;&#x7B97;time.start
            start_time = time.time()
            # Inference
            results = model(img_new)
            pd = results.pandas().xyxy[0]  # tensor-->pandas&#x7684;DataFrame
            # &#x53D6;&#x51FA;&#x5BF9;&#x5E94;&#x6807;&#x7B7E;&#x7684;list
            person_list = pd[pd['name'] == 'person'].to_numpy()
            bus_list = pd[pd['name'] == 'bus'].to_numpy()
            # &#x6846;&#x51FA;&#x7269;&#x4F53;
            draw(person_list, img_new)
            draw(bus_list, img_new)
            # end_time
            end_time = time.time()
            fps = 1 / (end_time - start_time)
            # FPS&#x663E;&#x793A;
            cv.putText(img_new, 'FPS:' + str(int(fps)), (30, 50), cv.FONT_ITALIC, 1, (0, 255, 0), 2)
            cv.imshow('results', img_new)

            # &#x5C06;&#x5904;&#x7406;&#x7684;&#x89C6;&#x9891;&#x5E27;&#x5B58;&#x653E;&#x5728;&#x6587;&#x4EF6;&#x5939;&#x91CC;
            # pic_number = 0  # &#x56FE;&#x50CF;&#x6570;&#x91CF;
            # pic_number += 1
            # save_img(img_new, pic_number)
            key = cv.waitKey(1) & 0xFF
            if key == ord('q'):
                break

if __name__ == '__main__':
    # &#x7236;&#x8FDB;&#x7A0B;&#x521B;&#x5EFA;&#x7F13;&#x51B2;&#x6808;&#xFF0C;&#x5E76;&#x4F20;&#x7ED9;&#x5404;&#x4E2A;&#x5B50;&#x8FDB;&#x7A0B;&#xFF1A;
    q = Manager().list()
    url = 'rtsp://admin:admin@192.168.43.229:8554/live'     # &#x6539;&#x6210;&#x81EA;&#x5DF1;&#x7684;url

    pw = Process(target=write, args=(q, url, 100))
    pr = Process(target=read, args=(q,))
    # &#x542F;&#x52A8;&#x5B50;&#x8FDB;&#x7A0B;pw&#xFF0C;&#x5199;&#x5165;:
    pw.start()
    # &#x542F;&#x52A8;&#x5B50;&#x8FDB;&#x7A0B;pr&#xFF0C;&#x8BFB;&#x53D6;:
    pr.start()
    # &#x7B49;&#x5F85;pr&#x7ED3;&#x675F;:
    pr.join()

    # pw&#x8FDB;&#x7A0B;&#x91CC;&#x662F;&#x6B7B;&#x5FAA;&#x73AF;&#xFF0C;&#x65E0;&#x6CD5;&#x7B49;&#x5F85;&#x5176;&#x7ED3;&#x675F;&#xFF0C;&#x53EA;&#x80FD;&#x5F3A;&#x884C;&#x7EC8;&#x6B62;:
    pw.terminate()

Original: https://blog.csdn.net/qq_43815039/article/details/125515074
Author: Yuzzz.
Title: OpenCV+YOLO+IP摄像头实现目标检测

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/701911/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

c语言实现语音检测vad_AI语音交互技术

近年来，由于深度学习技术、大数据、移动互联网、云计算等技术领域的发展，人工智能技术发展迅速、突飞猛进。作为人工智能技术的重要领域，智能语音交互技术已逐渐成熟，成为最多的落地方向之一…

人工智能 2023年5月27日
00111
Ubuntu22.04与深度学习配置（已搭建三台服务器）

本人在三台Ubuntu22.04服务器上搭建了深度学习环境，现将搭建记录分享给大家！ Ubuntu22.04分区分区为UEFI格式的情况下： 200MB的EFI分区（逻辑分区）（…

人工智能 2023年6月17日
00132
剖析知识图谱：实体的出度与入度

Digging into KG: the indegree and outdegree of entity. 零：前言知识图谱以三元组的方式存储知识，所有知识图谱的知识条目共同形…

人工智能 2023年6月1日
0091
BertTokenizer 使用方法

python 导入与初始化 BertTokenizer from transformers import BertTokenizer tokenizer = BertTokeniz…

人工智能 2023年5月30日
0074
cv2.bitwise_and（）图像的与运算

定义 dst=cv2.bitwise_and（src1,src2[,mask]]）用法实现按位与运算 dst表示与输入值具有同样大小的array输出值。 src1表示第一个ar…

人工智能 2023年6月18日
0076
事件图谱的构建、推理与应用

点击上方蓝字关注我们胡志磊1,2,3, 靳小龙1,2,3, 陈剑赟4, 黄冠利5 1 中国科学院网络数据科学与技术重点实验室，北京 100190 2 中国科学院计算技术研究所，北…

人工智能 2023年6月1日
00153
三层BP神经网络公式推导及C语言实现

; 公式推导三层BP神经网络如上图所示。其中，x i x_i x i 表示第i i i个输入层节点的输入值，也是其输出值，z j z_j z j 表示第j j j个隐藏层节点…

人工智能 2023年7月14日
0083
【SVM回归预测】基于matlab混沌灰狼算法优化SVM回归预测【含Matlab源码 1576期】

⛄一、混沌灰狼算法简介 1 Tent混沌反向学习策略为保持种群多样性和使初始化种群个体尽可能均匀分布。在目前文献中,采用较多的是混沌映射Logistic,但它在[0,0.1]和[0…

人工智能 2023年6月18日
0092
Pandas（数据分析处理库）—讲解

本内容来自《跟&#…

人工智能 2023年6月19日
0080
Linux基本命令(3)

Linux基本命令(3) 📟作者主页：慢热的陕西人🌴专栏链接：Linux📣欢迎各位大佬👍点赞🔥关注🚓收藏，🍉留言本博客主要讲解了最后一部分常用的Linux指令和一些热键，另外还介…

人工智能 2023年7月29日
0062
【python实战】–图片像素动漫化

系列文章目录文章目录系列文章目录前言一、图片像素风 * 1. 效果图 2. 引入库Tiler 3. 步骤二、图片动漫风 * 1. 原图和效果图 2. PyTorch 3….

人工智能 2023年7月18日
0069
GANs系列：DCGAN原理简介与基础GAN的区别对比

本文长期不定时更新最新知识，防止迷路记得收藏哦！还未了解基础GAN的，可以先看下面两篇文章： GNA笔记–GAN生成式对抗网络原理以及数学表达式解剖入门GAN实战&…

人工智能 2023年6月16日
0089
[听风]TBC单体插件“必备安装的DBM”

[听风]TBC单体插件”必备安装的DBM” 标签（空格分隔）： TBC 文章目录 [听风]TBC单体插件”必备安装的DBM” * 插…

人工智能 2023年5月27日
0081
谈一谈AI对人工的取代

文章目录 * – AI绘画现在达到了什么水平？易用性怎么样？ – 缘起：2015年用文字画画 – 2021年 Dalle 与开源社区的程序员…

人工智能 2023年6月24日
00106
数据治理学习笔记（一）：数据治理是什么，要做什么

前言：经常在各种数据工作的文章中看到这个词，看词语意思就是要把数据管理好。作为数据基础支撑工作，其重要性也是可以想象的，平时工作中，有数据问题，一圈查下来就是一条记录的质量问题，更…

人工智能 2023年7月16日
0070
CMeKG代码解读(以项目为导向从零开始学习知识图谱)（二）

书接上文 Model4po类 init(): forward(): load_schema(): load_data(): load_fn(): train(): extract_…

人工智能 2023年6月1日
0095

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

OpenCV+YOLO+IP摄像头实现目标检测

title: OpenCV+YOLO+IP摄像头实现目标检测

大家都在看