realsense D455深度相机+YOLO V5结合实现目标检测（二）

2023年6月2日上午12:23 • 人工智能 • 阅读 93

realsense D455深度相机+YOLO V5结合实现目标检测（二）

1.代码来源
2.环境配置
3.代码分析：
*
3.1 主要展示在将detect.py转换为realsensedetect.py的文件部分，大家也可以直接将自己的detect.py 文件改成下面的文件，直接执行即可。
3.2 文件或者文件夹里面文件的对比差异分析软件介绍：
4. 思考与结束语

realsense D455深度相机+YOLO V5结合实现目标检测（一）第一篇链接

为什么会出现关于realsense D455 +YOLO V5结合的第二篇文章呢，因为上一篇文章是从github上面找到并且跑通之后写的，后来发现怎么也用不到我自己git下来的YOLO V5代码之中，发现还是缺一点东西，所以从各种途径中学习后将原汁原味的从github上找到的YOLO v5代码应用到了里面，最后可以很好的检测啦！

可以实现将D435,D455深度相机和yolo v5结合到一起，在识别物体的同时，还能测到物体相对与相机的距离。

说明一下为什么需要做这个事情？1.首先为什么需要用到realsense D455深度相机? 因为他是普通的相机还加了一个红外测距的东西，所以其他二维图像一样，能够得到三维世界在二维像素平面的投影，也就是图片，但是我们损失了一个深度的维度以后得到的都是投影的东西，比如说苹果可以和足球一样大，因为我们不知道深度也就是物体距离相机的距离信息，所以我们需要一个深度相机来实现测距离。2.为什么需要用到yolo算法？因为他在实时性和准确率方面都可以，可以应用于工农业生产当中，所以肯定很需要。所以才会有这二者的结合的必要性！

1.代码来源

这是我第一次将代码更改后放在了github上， 希望大家多多star,主要重写了detect.py文件为realsensedetect.py.首先大家如果想用这个代码的话可以去这里git clone 这是代码链接（为了防止链接不过去还是再写在这里 https://github.com/wenyishengkingkong/realsense-D455-YOLOV5.git）。

2.环境配置

大家按照YOLO V5环境配置方法配置环境就可以，或者是向前面的一篇一样前面的一篇，有一个简单的配置。

然后cd到进入工程文件夹下执行：

python realsensedetect.py

主要重写了detect.py部分为realsensedetect.py文件。运行结果如下：

3.代码分析：

3.1 主要展示在将detect.py转换为realsensedetect.py的文件部分，大家也可以直接将自己的detect.py 文件改成下面的文件，直接执行即可。

import argparse
import os
import shutil
import time
from pathlib import Path

import cv2
import torch
import torch.backends.cudnn as cudnn
from numpy import random
import numpy as np
import pyrealsense2 as rs

from models.experimental import attempt_load
from utils.general import (
    check_img_size, non_max_suppression, apply_classifier, scale_coords,
    xyxy2xywh, plot_one_box, strip_optimizer, set_logging)
from utils.torch_utils import select_device, load_classifier, time_synchronized
from utils.datasets import letterbox

def detect(save_img=False):
    out, source, weights, view_img, save_txt, imgsz = \
        opt.save_dir, opt.source, opt.weights, opt.view_img, opt.save_txt, opt.img_size
    webcam = source == '0' or source.startswith(('rtsp://', 'rtmp://', 'http://')) or source.endswith('.txt')

    # Initialize
    set_logging()
    device = select_device(opt.device)
    if os.path.exists(out):  # output dir
        shutil.rmtree(out)  # delete dir
    os.makedirs(out)  # make new dir
    half = device.type != 'cpu'  # half precision only supported on CUDA

    # Load model
    model = attempt_load(weights, map_location=device)  # load FP32 model
    imgsz = check_img_size(imgsz, s=model.stride.max())  # check img_size
    if half:
        model.half()  # to FP16
    # Set Dataloader
    vid_path, vid_writer = None, None
    view_img = True
    cudnn.benchmark = True  # set True to speed up constant image size inference
    #dataset = LoadStreams(source, img_size=imgsz)

    # Get names and colors
    names = model.module.names if hasattr(model, 'module') else model.names
    colors = [[random.randint(0, 255) for _ in range(3)] for _ in range(len(names))]

    # Run inference
    t0 = time.time()
    img = torch.zeros((1, 3, imgsz, imgsz), device=device)  # init img
    _ = model(img.half() if half else img) if device.type != 'cpu' else None  # run once
    pipeline = rs.pipeline()
    # &#x521B;&#x5EFA; config &#x5BF9;&#x8C61;&#xFF1A;
    config = rs.config()
    # config.enable_stream(rs.stream.depth, 640, 480, rs.format.z16, 30)
    config.enable_stream(rs.stream.depth, 640, 480, rs.format.z16, 60)
    config.enable_stream(rs.stream.color, 640, 480, rs.format.bgr8, 60)

    # Start streaming
    pipeline.start(config)
    align_to_color = rs.align(rs.stream.color)
    while True:
        start = time.time()
        # Wait for a coherent pair of frames&#xFF08;&#x4E00;&#x5BF9;&#x8FDE;&#x8D2F;&#x7684;&#x5E27;&#xFF09;: depth and color
        frames = pipeline.wait_for_frames()
        frames = align_to_color.process(frames)
        # depth_frame = frames.get_depth_frame()
        depth_frame = frames.get_depth_frame()
        color_frame = frames.get_color_frame()
        color_image = np.asanyarray(color_frame.get_data())
        depth_image = np.asanyarray(depth_frame.get_data())
        mask = np.zeros([color_image.shape[0], color_image.shape[1]], dtype=np.uint8)
        mask[0:480, 320:640] = 255

        sources = [source]
        imgs = [None]
        path = sources
        imgs[0] = color_image
        im0s = imgs.copy()
        img = [letterbox(x, new_shape=imgsz)[0] for x in im0s]
        img = np.stack(img, 0)
        img = img[:, :, :, ::-1].transpose(0, 3, 1, 2)  # BGR to RGB, to 3x416x416, uint8 to float32
        img = np.ascontiguousarray(img, dtype=np.float16 if half else np.float32)
        img /= 255.0  # 0 - 255 to 0.0 - 1.0

        # Get detections
        img = torch.from_numpy(img).to(device)
        if img.ndimension() == 3:
            img = img.unsqueeze(0)
        t1 = time_synchronized()
        pred = model(img, augment=opt.augment)[0]

        # Apply NMS
        pred = non_max_suppression(pred, opt.conf_thres, opt.iou_thres, classes=opt.classes, agnostic=opt.agnostic_nms)
        t2 = time_synchronized()

        for i, det in enumerate(pred):  # detections per image
            p, s, im0 = path[i], '%g: ' % i, im0s[i].copy()
            s += '%gx%g ' % img.shape[2:]  # print string
            gn = torch.tensor(im0.shape)[[1, 0, 1, 0]]  # normalization gain whwh
            if det is not None and len(det):
                # Rescale boxes from img_size to im0 size
                det[:, :4] = scale_coords(img.shape[2:], det[:, :4], im0.shape).round()

                # Print results
                for c in det[:, -1].unique():
                    n = (det[:, -1] == c).sum()  # detections per class
                    s += '%g %ss, ' % (n, names[int(c)])  # add to string

                # Write results
                for *xyxy, conf, cls in reversed(det):
                    xywh = (xyxy2xywh(torch.tensor(xyxy).view(1, 4)) / gn).view(-1).tolist()  # normalized xywh
                    line = (cls, conf, *xywh) if opt.save_conf else (cls, *xywh)  # label format
                    distance_list = []
                    mid_pos = [int((int(xyxy[0]) + int(xyxy[2])) / 2), int((int(xyxy[1]) + int(xyxy[3])) / 2)]  # &#x786E;&#x5B9A;&#x7D22;&#x5F15;&#x6DF1;&#x5EA6;&#x7684;&#x4E2D;&#x5FC3;&#x50CF;&#x7D20;&#x4F4D;&#x7F6E;&#x5DE6;&#x4E0A;&#x89D2;&#x548C;&#x53F3;&#x4E0B;&#x89D2;&#x76F8;&#x52A0;&#x5728;/2
                    min_val = min(abs(int(xyxy[2]) - int(xyxy[0])), abs(int(xyxy[3]) - int(xyxy[1])))  # &#x786E;&#x5B9A;&#x6DF1;&#x5EA6;&#x641C;&#x7D22;&#x8303;&#x56F4;
                    # print(box,)
                    randnum = 40
                    for i in range(randnum):
                        bias = random.randint(-min_val // 4, min_val // 4)
                        dist = depth_frame.get_distance(int(mid_pos[0] + bias), int(mid_pos[1] + bias))
                        # print(int(mid_pos[1] + bias), int(mid_pos[0] + bias))
                        if dist:
                            distance_list.append(dist)
                    distance_list = np.array(distance_list)
                    distance_list = np.sort(distance_list)[
                                    randnum // 2 - randnum // 4:randnum // 2 + randnum // 4]  # &#x5192;&#x6CE1;&#x6392;&#x5E8F;+&#x4E2D;&#x503C;&#x6EE4;&#x6CE2;

                    label = '%s %.2f%s' % (names[int(cls)], np.mean(distance_list), 'm')
                    plot_one_box(xyxy, im0, label=label, color=colors[int(cls)], line_thickness=3)

            # Print time (inference + NMS)
            print('%sDone. (%.3fs)' % (s, t2 - t1))

            # Stream results
            if view_img:
                cv2.imshow(p, im0)
                if cv2.waitKey(1) == ord('q'):  # q to quit
                    raise StopIteration
    print('Done. (%.3fs)' % (time.time() - t0))

if __name__ == '__main__':
    parser = argparse.ArgumentParser()
    parser.add_argument('--weights', nargs='+', type=str, default='yolov5m.pt', help='model.pt path(s)')
    parser.add_argument('--source', type=str, default='inference/images', help='source')  # file/folder, 0 for webcam
    parser.add_argument('--img-size', type=int, default=640, help='inference size (pixels)')
    parser.add_argument('--conf-thres', type=float, default=0.25, help='object confidence threshold')
    parser.add_argument('--iou-thres', type=float, default=0.45, help='IOU threshold for NMS')
    parser.add_argument('--device', default='', help='cuda device, i.e. 0 or 0,1,2,3 or cpu')
    parser.add_argument('--view-img', action='store_true', help='display results')
    parser.add_argument('--save-txt', action='store_true', help='save results to *.txt')
    parser.add_argument('--save-conf', action='store_true', help='save confidences in --save-txt labels')
    parser.add_argument('--save-dir', type=str, default='inference/output', help='directory to save results')
    parser.add_argument('--classes', nargs='+', type=int, help='filter by class: --class 0, or --class 0 2 3')
    parser.add_argument('--agnostic-nms', action='store_true', help='class-agnostic NMS')
    parser.add_argument('--augment', action='store_true', help='augmented inference')
    parser.add_argument('--update', action='store_true', help='update all models')
    opt = parser.parse_args()
    print(opt)

    with torch.no_grad(): # &#x4E00;&#x4E2A;&#x4E0A;&#x4E0B;&#x6587;&#x7BA1;&#x7406;&#x5668;&#xFF0C;&#x88AB;&#x8BE5;&#x8BED;&#x53E5;wrap&#x8D77;&#x6765;&#x7684;&#x90E8;&#x5206;&#x5C06;&#x4E0D;&#x4F1A;track&#x68AF;&#x5EA6;
        detect()

相信大家看到这么多代码已经觉得头疼了，其实更改的就不多的几行，只不过是将顺序的和位置更改了一下。大家如果觉得麻烦，有两个软件可以辅助大家对文件进行对比（说明上面用的到是YOLO V5代码中的v3.1版本，相信换成其他版本应该不会有任何问题，对于其他的目标检测算法没有进行试验，相信应该都是换汤不换药）。

3.2 文件或者文件夹里面文件的对比差异分析软件介绍：

无论是在windows上或者是在ubuntu上面，好用的pycharm软件都是可以应用的，可以在 选择文件或者文件夹然后右键有一个compare with的选项就可以进行差异分析了，大家可以对比上面realsensedetect.py文件和detect.py文件两者的差异部分就可以知道到底更改了多少。第二是在 Windows上面可以应用diffnity的软件，按道理来说挺好用的！

思考与结束语

为什么需要用到这个realsense深度相机呢，正如上一篇讲述的一样，他会增加一个维度，就是距离，那多的这个维度到底有什么应用呢？首先第一个就是在 社交距离检测中，比如你发现检测到一个人没有戴口罩，那么你可以直接检测到他距离摄像头的距离，你就可以提前通知他带好口罩，以避免在入口处人员多的时候交叉感染。这是一个实际的例子。其次，主要应用在 三维重建中，我们得到了物体的二维像素点和距离值，就可以通过三维重建或者数学建模来实现三维物体的重新建模，这是很重要的！最后，我们都可以利用已经得到的信息进行 三维建模和用pcl库进行更加准确的距离计算，实现在现实世界中的应用！

这是第一在github上git自己的代码，希望能够帮助到您， 对我感兴趣的童鞋可以 关注我，说不定那一天就可以 帮到您！

Original: https://blog.csdn.net/qq_45077256/article/details/120040059
Author: 文一生
Title: realsense D455深度相机+YOLO V5结合实现目标检测（二）

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/558979/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

NLP预训练模型系列-GPT-2

NLP预训练模型系列文章目录 BERT GPT GPT-2 GPT-3 目录 NLP预训练模型系列文章目录前言 1. Abstract 2. Introduction 3. A…

人工智能 2023年5月31日
0087
Python sklearn 文本特征提取 CountVectorizer TfidfVectorizer

参考 https://www.cntofu.com/book/170/docs/58.md（中文文档）https://scikit-le…

人工智能 2023年7月16日
0065
KNN实现鸢尾花分类

因为我们有已知品种的鸢尾花的测量数据，所以这是一个监督学习问题。在这个问题中，我们要在多个选项中预测其中一个（鸢尾花的品种）。这是一个分类问题，可能的输出（鸢尾花的不同品种）叫做类…

人工智能 2023年6月24日
0072
MATLAB-多项式曲线回归拟合

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年6月17日
0047
玩转Kaggle：Classify Leaves（叶子分类）——模型设计与训练

文章目录 * – 一、数据加载 – 二、模型构建与训练 – + 1. resnet + * 1.1 ResNet-50模型微调+冻结 * 1.2…

人工智能 2023年7月2日
0051
知识图谱成长之路（一）认识知识图谱

学习之初，想对自己和可能刷到这篇文章的你说几句话，可能和知识图谱没啥太大关系，麻烦忍耐一下：我是一名初学者，本着一个严谨的态度，出于对知识图谱的进一步深入学习的考虑，由于之前毕…

人工智能 2023年6月10日
0096
【数字图像处理】实验二图像增强（MATLAB实现）

目录一、实验意义及目的二、实验内容三、Matlab 相关函数介绍四、算法原理五、参考代码及扩展代码流程图（1）参考代码流程图（2）扩展代码流程图六、参考代码七、实…

人工智能 2023年7月5日
0067
合成孔径雷达成像算法与实现(信号处理基础知识点)

最近由于学业上需要，重新学习了《合成孔径雷达成像算法与实现》一书，其中第二章是信号处理基础，在此记录下学习过程。一、信号处理基本概念 1.卷积与相关卷积：相关: 卷积从几何解…

人工智能 2023年6月20日
0071
Pytorch：全连接神经网络-MLP回归

Pytorch: 全连接神经网络-解决 Boston 房价回归问题 Copyright: Jingmin Wei, Pattern Recognition and Intellig…

人工智能 2023年7月26日
0042
几个图像处理库整理：OpenCV、PIL(pillow)、skimage和GDAL库

主要是图像处理的几个库对数据的读取方式存在差异，有的时候经常搞混，没有概念，所以大致整理一下，一是增强印象，二是整理便于查阅。关于图像读取函数： 1、opencv库，python…

人工智能 2023年6月18日
0050
R语言与临床模型预测——LASSO回归，单因素多因素cox，差异表达分析，Venn图，森林图，列线图，矫正曲线，ROC全套代码及解析——第五部分批量cox回归分析本专栏可免费答疑

上次我们将自噬相关基因的表达数据与临床数据进行了匹配，下面我们进行批量cox回归分析，筛选出预后相关的自噬基因：这个临床模型预测会针对一个案例进行讲解，目录如下： 1.下载数据 …

人工智能 2023年6月17日
00126
讯飞语音转文字 PHP demo

讯飞语音转文字PHP tp6 demo 讯飞官网没有PHP demo我是很诧异的 * 改成了我需要的tp6 demo 讯飞官网没有PHP demo我是很诧异的我php天下第一就这…

人工智能 2023年5月27日
00120
Python数据分析5-数据分组与聚合

目录 5.1数据分组 5.1.1GroupBy简介 5.1.2列名分组 5.1.3按列表或元组分组 5.1.4按字典分组 5.1.5按函数分组 5.2聚合运算 5.2.1聚合函数 …

人工智能 2023年7月8日
0072
python isin函数_pandas中isin()函数及其逆函数使用

pandas中isin()函数及其逆函数使用发布时间：2018-05-27 21:11, 浏览次数：2021 , 标签： pandas isin 我使用这个函数就是用来清洗数据，…

人工智能 2023年7月7日
0070
20系列和30系列显卡下的tensorflow-gpu环境配置，避坑指南

此方法适合于20系列显卡环境的配置，较传统方法简便参考博客https://blog.csdn.net/m0_49090516/article/details/113576003?…

人工智能 2023年5月25日
0059
python随机数（random）

import random import string random.randint(a,b) 在python中的random.randint(a,b)用于生成一个指定范围内的整数…

人工智能 2023年7月4日
0050

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

realsense D455深度相机+YOLO V5结合实现目标检测（二）

realsense D455深度相机+YOLO V5结合实现目标检测（二）

3.1 主要展示在将detect.py转换为realsensedetect.py的文件部分，大家也可以直接将自己的detect.py 文件改成下面的文件，直接执行即可。

3.2 文件或者文件夹里面文件的对比差异分析软件介绍：

大家都在看