TX2安装pytorch+TensorRT+yolov5实现实时检测

2023年6月16日下午4:33 • 人工智能 • 阅读 92

已完成的环境配置：

TX2刷机后，完成了opencv4.5.1的编译：Ubuntu18.04安装opencv4.5.1+contrib 支持cuda加速（附带编译好的opencv4.5.1及缺失文件）_Barrymaster的博客-CSDN博客

TX2版本及配置环境如下：

一、换源

一般刷机的时候应该换过源了，没换过的可以参照下方刷机文章的换源部分
（注意：TX2和ubuntupc源不共用）

Nvidia TX2 刷机全过程_Barrymaster的博客-CSDN博客_tx2刷机

二、安装pytorch

pytorch分架构，所以pc端与TX2的安装方法不同。

这里直接在系统环境下使用python 3安装的流程， 全程用的python3与pip3。如有多版本共存需求，请先查找archiconda（pc端叫anaconda）在TX2上的使用方法

1.下载TX2专用torch包（.whl）

官方的TX2专用下载地址，一定和jetpack的版本对应好（网址里也有很清晰的安装教程）

https://link.csdn.net/?target=https%3A%2F%2Fforums.developer.nvidia.com%2Ft%2Fpytorch-for-jetson-version-1-10-now-available%2F72048

首先根据自己TX2的Cuda版本选择合适的Pytorch版本，注意：我使用的yolov5的5.0版本要求pytorch≥1.7，提前查看所需的pytorch版本。对应关系可参照以下两个链接：

PYTORCH和CUDA 版本对应关系_代码帮的博客-CSDN博客_pytorch和cuda对应版本

Previous PyTorch Versions | PyTorch

（但是别按照上面的代码安装pytorch，只是为了看一下cuda与pytorch对应版本）

接下来查看自己的Jetpack版本与python版本是否符合要求。

我的Cuda版本为10.2，python版本为3.6.9，Jetpack版本为4.6，最后选择安装Pytorch v1.10.0。

windows环境里点一下这个whl就可以进行下载了，但是ubuntu环境里点了没反应，我就用windows下载后用U盘拷到TX2里去了。

2.配置环境以及安装pytorch

按照官网里installation下的安装步骤即可，第一步的意思是把左侧网址的whl文件下载并重命名为右侧的whl文件。我们在windows里下载后直接重命名为torch-1.10.0-cp36-cp36m-linux_aarch64.whl即可。
（注意：官网给的是v1.8.0的安装例子，左侧的网址是1.8.0的安装包，不要直接用）

#&#x7B2C;&#x4E00;&#x6B65;windows&#x4E0B;&#x8F7D;&#x5E76;&#x4E14;&#x91CD;&#x540D;&#x4E3A;torch-1.10.0-cp36-cp36m-linux_aarch64.whl
sudo apt-get install python3-pip libopenblas-base libopenmpi-dev libomp-dev
pip3 install Cython
pip3 install numpy torch-1.10.0-cp36-cp36m-linux_aarch64.whl #Permission denied&#x5C31;&#x52A0;sudo

3.安装 torchvision

同样官网给出了安装过程，需要根据pytorch版本选择torchvision版本。

或者也可以在第二步version里直接选择版本：

sudo apt-get install libjpeg-dev zlib1g-dev libpython3-dev libavcodec-dev libavformat-dev libswscale-dev
pip3 install setuptools #&#x8FD9;&#x4E2A;&#x5B98;&#x65B9;&#x6559;&#x7A0B;&#x6CA1;&#x5199;
git clone --branch v0.11.1 https://github.com/pytorch/vision torchvision
cd torchvision

python3 setup.py install #Permission denied&#x5C31;&#x52A0;sudo

cd ../
pip3 install 'pillow<7'< code></7'<>

4.测试

python3
import torch
print(torch.__version__)
print(torchversion.__version__)
print(torch.cuda.is_available())&#xA0;
&#x9000;&#x51FA;&#x7528;exit()

安装成功的话，前两个print出版本号，第三个print出现true。

三、配置yolov5环境

一般在TX2上跑yolov5的应该都在pc端运行成功过，所以yolov5下载啥的就不写了。这里我用的是v5的 5.0版本。

这里老坑了，tx2刷机的时候预装了好多库，一是版本不对，二是不好卸载。

1.yolov5的requirement（电脑一般按照1装，TX2请直接看2）

重要：一般来说是运行 yolov5 的 requirement.txt 文件，按照他里面的要求自动安装库。但是在电脑里这么做的时候他直接重装了我的pytorch等，所以需要在requirement里进行筛选，并且将 已安装的库进行注释，所以不如就 不运行requirement了。而是直接运行检测命令：

python3 detect.py --source data/images/ --weights yolov5s.pt

看报错缺什么库就安装什么库。

thop、seaborn、tqdm、pycocotools 这几个好像是要装的

requirement文件如下：（实在想这么装，就把opencv-python，torch，torchvision注释掉）

pip install -r requirements.txt

base ----------------------------------------
Cython
matplotlib>=3.2.2
numpy>=1.18.5
opencv-python>=4.1.2
Pillow
PyYAML>=5.3.1
scipy>=1.4.1
tensorboard>=2.2
torch>=1.7.0
torchvision>=0.8.1
tqdm>=4.41.0

logging -------------------------------------
wandb

plotting ------------------------------------
seaborn>=0.11.0
pandas

export --------------------------------------
coremltools>=4.1
onnx>=1.8.1
scikit-learn==0.19.2  # for coreml quantization

extras --------------------------------------
thop  # FLOPS computation
pycocotools>=2.0  # COCO mAP

注意：
运行 yolov5 的 detect.py 时，会有一个检查安装包的版本的函数 check_requirements ，很多安装包需要很新的版本，这里建议把这个函数注释掉，否则要重新装很多东西，着实没必要（能跑通就行）。

2.TX2配置yolov5环境

TX2可参照下方代码安装：

sudo apt-get install liblapack-dev
sudo apt-get install libblas-dev
sudo apt-get install gfortran
pip3 install scipy

pip3 install matplotlib pillow pyyaml tensorboard tqdm
#&#x5B89;&#x88C5;seaborn&#x65F6;&#x4F1A;&#x81EA;&#x52A8;&#x5B89;&#x88C5;matplotlib&#x548C;pandas
pip3 install seaborn

python3 detect.py #&#x770B;&#x770B;&#x8FD8;&#x7F3A;&#x5565;

可能会遇到巨多问题

1.安装scipy，matplotlib，numpy等的时候，可能会提示安装失败，提示：pip3 uninstall 失败，outside environment /usr。

原因：刷机后的系统预装了许多的库，但是版本可能过低，无法满足要求。但是预装的库一般在/usr/lib/python3/dist-packages里，pip卸载不掉（pip的库一般在/usr/lib/python3/site-packages里）。

解决办法：以scipy为例子，在/usr/lib/python3/dist-packages/路径下找到一个scipy文件夹以及一个scipy.egg-info文件（具体名称略有不同），将这两个文件全部删掉，然后再进行pip3安装即可。

2.安装scipy的时候出现error: library dfftpack has Fortran sources but no Fortran compiler found

原因：有代码是Fortran写的，因此需要安装gfortran

解决方法：安装Fortran compiler

sudo apt-get install gfortran

注意：（同1中的注意一致）
运行 yolov5 的 detect.py 时，会有一个检查安装包的版本的函数 check_requirements ，把这个函数注释掉，没用。

3.yolov5调用TX2板载摄像头

参考文章：
YOLOv5 调用 NVIDIA JetsonTX2 板载摄像头出错_G果的博客-CSDN博客_tx2 yolov5 detect.py执行命令$ python3 detect.py –source 0 –exist-ok –weights yolov5s.pt报错：AssertionError: Failed to open 0解决方法注意：cv2.VideoCapture(0)是打不开的！！！需要把括号里面的0替换成 gst_str 这一长串字符串 gst_str = (‘nvarguscamerasrc ! ‘ ‘video/x-raw(m. TX2安装pytorch+TensorRT+yolov5实现实时检测 https://blog.csdn.net/weixin_42899627/article/details/115129085?spm=1001.2014.3001.5506 ;注意：该方法对 yolov5 utils 文件夹里面的 dataset.py进行修改，可成功调用板载摄像头。在这里我还遇到了其他问题，使用命令python3 detect.py –source 1时，无法调用usb摄像头，但是通过上方文章进行修改后，source 1即可成功调用usb摄像头，没搞明白原因。

四、TensorRt加速

1.下载tensorrtx

我用的是tensorrtx，下载地址（yolov5）如下，需要在Tags里选择相应的v5版本，与自己训练使用的v5版本一定要对应，不然会报错。

https://github.com/wang-xinyu/tensorrtx.git

tensorrtx支持许多加速，所以下载的文件中包含许多文件，不用的话可以删除。我只保留了yolov5文件夹，如图：

下载yolov5s.pt(官方提供的，注意对应版本不能错)进行测试，或者自信的话直接用自己训练的也行。

2.模型转换（将yolov5s.pt转换成yolov5s.wts）

tensorrtx上有他自己官方的教程，这里我搬运过来：

1.安装pycuda

pip3 install pycuda

2.将 tensorrtx/yolov5/gen_wts.py复制到第三步下载的yolov5检测文件夹中。

3.将官方的权重文件’yolov5s.pt’，或自己训练出来的模型，也放在yolov5检测文件夹中。

4.执行gen_wts.py生成.wts文件。python3 gen_wts.py -w 自己的权重.pt -o 随便起名.wts

python3 gen_wts.py -w yolov5s.pt -o yolov5s.wts

5.接下来去到目录tensorrtx下的yolov5文件夹，创建一个build文件,并生成生成makeFile，如果用的自己的权重文件请看8.

mkdir build
cd build
cmake ..

make

6.将第4步生成的.wts文件复制到tensorrtx/yolov5/build里，运行代码生成.engine文件。
我用的是yolov5s，所以结尾用s，格式是 -s xxxx.wts xxxx.engine s/m/l/x/….（要根据自己的权重类型选择），或者直接不输入最后一个字母，运行 -s xxxx.wts xxxx.engine后会提示有哪些正确的输入格式）（权重类型，置信度等都在yolov5.cpp里）

sudo ./yolov5 -s ../yolov5s.wts yolov5s.engine s

7.测试用文件自带的图片(在samples里有两张图片)测试一下

sudo ./yolov5 -d yolov5s.engine ../samples

也可以用自带的python文件来测试

python yolov5_trt.py

8.使用自己训练的权重文件检测

#6&#x3001;&#x5C06;yololayer.h&#x91CC;&#x7684;CLASS_NUM&#x4FEE;&#x6539;&#x6210;&#x4F60;&#x7684;&#x3002;&#x56E0;&#x4E3A;&#x5B98;&#x65B9;&#x7528;&#x7684;&#x662F;coco&#x6570;&#x636E;&#x96C6;&#xFF0C;&#x6240;&#x4EE5;&#x9ED8;&#x8BA4;&#x662F;80&#x3002;

#7&#x3001;&#x6267;&#x884C;makeFile&#x3002;&#xFF08;&#x6BCF;&#x6B21;&#x4FEE;&#x6539;&#x4E3A;CLASS_NUM&#x90FD;&#x8981;make&#x4E00;&#x6B21;&#xFF09;
make

#8&#x3001;&#x5C06;&#x7B2C;4&#x6B65;&#x751F;&#x6210;&#x7684;.wts&#x6587;&#x4EF6;&#x590D;&#x5236;&#x5230;tensorrtx/yolov5&#x91CC;&#x3002;

#9&#x3001;&#x751F;&#x6210;.engine&#x6587;&#x4EF6;&#xFF08;&#x6211;&#x7528;&#x7684;&#x662F;yolov5s&#xFF0C;&#x6240;&#x4EE5;&#x7ED3;&#x5C3E;&#x7528;s&#xFF09;
sudo ./yolov5 -s ../yolov5s.wts yolov5s.engine s
    &#x5982;&#x679C;&#x4F60;&#x8BAD;&#x7EC3;&#x65F6;&#x662F;&#x81EA;&#x5B9A;&#x4E49;depth_multiple &#x548C; width_multiple&#x5C31;&#x8FD9;&#x6837;&#x5199;&#xFF1A;
sudo ./yolov5 -s ../yolov5.wts yolov5.engine c 0.17 0.25
    &#x5728;tensorrtx 5.0&#x91CC;&#x4E5F;&#x66F4;&#x65B0;&#x4E86;yolov5&#x7684;P6&#x6A21;&#x578B;&#xFF1A;
sudo ./yolov5 -s ../yolov5.wts yolov5.engine s6

也可按照官网原装教程进行配置：

tensorrtx/yolov5 at master · wang-xinyu/tensorrtx · GitHub

五、TensorRt调用usb摄像头

搬运自：

【TensorRT】在Jetson设备Xavier（TX2等通用）上使用TensorRT加速yolov5，详细记录过程_小祥子ovo的博客-CSDN博客_jetson xavier 安装tensorrt 硬件：Xavierjetpack版本：sudo apt-cache show nvidia-jetpack一、克隆代码：1、第一个代码是yolov5的官方代码：https://github.com/ultralytics/yolov5.git2、第二个代码是TensorRT编译yolov5的代码：git clone https://github.com/wang-xinyu/tensorrtx.git其中tensorrtx文件夹我只保留了yolov5文件夹，如图：下载yolov5s TX2安装pytorch+TensorRT+yolov5实现实时检测 https://blog.csdn.net/weixin_46716951/article/details/123742902?spm=1001.2014.3001.5506 ; 搜索cap = cv2.VideoCapture 进行摄像头的选择，一般0是板载摄像头，1是usb摄像头。

"""
An example that uses TensorRT's Python api to make inferences.

"""
import ctypes
import os
import shutil
import random
import sys
import threading
import time
import cv2
import numpy as np
import pycuda.autoinit
import pycuda.driver as cuda
import tensorrt as trt
import torch
import torchvision
import argparse

CONF_THRESH = 0.5
IOU_THRESHOLD = 0.4

def get_img_path_batches(batch_size, img_dir):
    ret = []
    batch = []
    for root, dirs, files in os.walk(img_dir):
        for name in files:
            if len(batch) == batch_size:
                ret.append(batch)
                batch = []
            batch.append(os.path.join(root, name))
    if len(batch) > 0:
        ret.append(batch)
    return ret

def plot_one_box(x, img, color=None, label=None, line_thickness=None):
"""
    description: Plots one bounding box on image img,
                 this function comes from YoLov5 project.

    param:
        x:      a box likes [x1,y1,x2,y2]
        img:    a opencv image object
        color:  color to draw rectangle, such as (0,255,0)
        label:  str
        line_thickness: int
    return:
        no return
"""
    tl = (
            line_thickness or round(0.002 * (img.shape[0] + img.shape[1]) / 2) + 1
    )  # line/font thickness
    color = color or [random.randint(0, 255) for _ in range(3)]
    c1, c2 = (int(x[0]), int(x[1])), (int(x[2]), int(x[3]))
    cv2.rectangle(img, c1, c2, color, thickness=tl, lineType=cv2.LINE_AA)
    if label:
        tf = max(tl - 1, 1)  # font thickness
        t_size = cv2.getTextSize(label, 0, fontScale=tl / 3, thickness=tf)[0]
        c2 = c1[0] + t_size[0], c1[1] - t_size[1] - 3
        cv2.rectangle(img, c1, c2, color, -1, cv2.LINE_AA)  # filled
        cv2.putText(
            img,
            label,
            (c1[0], c1[1] - 2),
            0,
            tl / 3,
            [225, 255, 255],
            thickness=tf,
            lineType=cv2.LINE_AA,
        )

class YoLov5TRT(object):
"""
    description: A YOLOv5 class that warps TensorRT ops, preprocess and postprocess ops.

"""

    def __init__(self, engine_file_path):
        # Create a Context on this device,
        self.ctx = cuda.Device(0).make_context()
        stream = cuda.Stream()
        TRT_LOGGER = trt.Logger(trt.Logger.INFO)
        runtime = trt.Runtime(TRT_LOGGER)

        # Deserialize the engine from file
        with open(engine_file_path, "rb") as f:
            engine = runtime.deserialize_cuda_engine(f.read())
        context = engine.create_execution_context()

        host_inputs = []
        cuda_inputs = []
        host_outputs = []
        cuda_outputs = []
        bindings = []

        for binding in engine:
            print('bingding:', binding, engine.get_binding_shape(binding))
            size = trt.volume(engine.get_binding_shape(binding)) * engine.max_batch_size
            dtype = trt.nptype(engine.get_binding_dtype(binding))
            # Allocate host and device buffers
            host_mem = cuda.pagelocked_empty(size, dtype)
            cuda_mem = cuda.mem_alloc(host_mem.nbytes)
            # Append the device buffer to device bindings.

            bindings.append(int(cuda_mem))
            # Append to the appropriate list.

            if engine.binding_is_input(binding):
                self.input_w = engine.get_binding_shape(binding)[-1]
                self.input_h = engine.get_binding_shape(binding)[-2]
                host_inputs.append(host_mem)
                cuda_inputs.append(cuda_mem)
            else:
                host_outputs.append(host_mem)
                cuda_outputs.append(cuda_mem)

        # Store
        self.stream = stream
        self.context = context
        self.engine = engine
        self.host_inputs = host_inputs
        self.cuda_inputs = cuda_inputs
        self.host_outputs = host_outputs
        self.cuda_outputs = cuda_outputs
        self.bindings = bindings
        self.batch_size = engine.max_batch_size

    def infer(self, input_image_path):
        threading.Thread.__init__(self)
        # Make self the active context, pushing it on top of the context stack.

        self.ctx.push()
        self.input_image_path = input_image_path
        # Restore
        stream = self.stream
        context = self.context
        engine = self.engine
        host_inputs = self.host_inputs
        cuda_inputs = self.cuda_inputs
        host_outputs = self.host_outputs
        cuda_outputs = self.cuda_outputs
        bindings = self.bindings
        # Do image preprocess
        batch_image_raw = []
        batch_origin_h = []
        batch_origin_w = []
        batch_input_image = np.empty(shape=[self.batch_size, 3, self.input_h, self.input_w])

        input_image, image_raw, origin_h, origin_w = self.preprocess_image(input_image_path
                                                                           )

        batch_origin_h.append(origin_h)
        batch_origin_w.append(origin_w)
        np.copyto(batch_input_image, input_image)
        batch_input_image = np.ascontiguousarray(batch_input_image)

        # Copy input image to host buffer
        np.copyto(host_inputs[0], batch_input_image.ravel())
        start = time.time()
        # Transfer input data  to the GPU.

        cuda.memcpy_htod_async(cuda_inputs[0], host_inputs[0], stream)
        # Run inference.

        context.execute_async(batch_size=self.batch_size, bindings=bindings, stream_handle=stream.handle)
        # Transfer predictions back from the GPU.

        cuda.memcpy_dtoh_async(host_outputs[0], cuda_outputs[0], stream)
        # Synchronize the stream
        stream.synchronize()
        end = time.time()
        # Remove any context from the top of the context stack, deactivating it.

        self.ctx.pop()
        # Here we use the first row of output in that batch_size = 1
        output = host_outputs[0]
        # Do postprocess
        result_boxes, result_scores, result_classid = self.post_process(
            output, origin_h, origin_w)
        # Draw rectangles and labels on the original image
        for j in range(len(result_boxes)):
            box = result_boxes[j]
            plot_one_box(
                box,
                image_raw,
                label="{}:{:.2f}".format(
                    categories[int(result_classid[j])], result_scores[j]
                ),
            )
        return image_raw, end - start

    def destroy(self):
        # Remove any context from the top of the context stack, deactivating it.

        self.ctx.pop()

    def get_raw_image(self, image_path_batch):
"""
        description: Read an image from image path
"""
        for img_path in image_path_batch:
            yield cv2.imread(img_path)

    def get_raw_image_zeros(self, image_path_batch=None):
"""
        description: Ready data for warmup
"""
        for _ in range(self.batch_size):
            yield np.zeros([self.input_h, self.input_w, 3], dtype=np.uint8)

    def preprocess_image(self, input_image_path):
"""
        description: Convert BGR image to RGB,
                     resize and pad it to target size, normalize to [0,1],
                     transform to NCHW format.

        param:
            input_image_path: str, image path
        return:
            image:  the processed image
            image_raw: the original image
            h: original height
            w: original width
"""
        image_raw = input_image_path
        h, w, c = image_raw.shape
        image = cv2.cvtColor(image_raw, cv2.COLOR_BGR2RGB)
        # Calculate widht and height and paddings
        r_w = self.input_w / w
        r_h = self.input_h / h
        if r_h > r_w:
            tw = self.input_w
            th = int(r_w * h)
            tx1 = tx2 = 0
            ty1 = int((self.input_h - th) / 2)
            ty2 = self.input_h - th - ty1
        else:
            tw = int(r_h * w)
            th = self.input_h
            tx1 = int((self.input_w - tw) / 2)
            tx2 = self.input_w - tw - tx1
            ty1 = ty2 = 0
        # Resize the image with long side while maintaining ratio
        image = cv2.resize(image, (tw, th))
        # Pad the short side with (128,128,128)
        image = cv2.copyMakeBorder(
            image, ty1, ty2, tx1, tx2, cv2.BORDER_CONSTANT, (128, 128, 128)
        )
        image = image.astype(np.float32)
        # Normalize to [0,1]
        image /= 255.0
        # HWC to CHW format:
        image = np.transpose(image, [2, 0, 1])
        # CHW to NCHW format
        image = np.expand_dims(image, axis=0)
        # Convert the image to row-major order, also known as "C order":
        image = np.ascontiguousarray(image)
        return image, image_raw, h, w

    def xywh2xyxy(self, origin_h, origin_w, x):
"""
        description:    Convert nx4 boxes from [x, y, w, h] to [x1, y1, x2, y2] where xy1=top-left, xy2=bottom-right
        param:
            origin_h:   height of original image
            origin_w:   width of original image
            x:          A boxes tensor, each row is a box [center_x, center_y, w, h]
        return:
            y:          A boxes tensor, each row is a box [x1, y1, x2, y2]
"""
        y = torch.zeros_like(x) if isinstance(x, torch.Tensor) else np.zeros_like(x)
        r_w = self.input_w / origin_w
        r_h = self.input_h / origin_h
        if r_h > r_w:
            y[:, 0] = x[:, 0] - x[:, 2] / 2
            y[:, 2] = x[:, 0] + x[:, 2] / 2
            y[:, 1] = x[:, 1] - x[:, 3] / 2 - (self.input_h - r_w * origin_h) / 2
            y[:, 3] = x[:, 1] + x[:, 3] / 2 - (self.input_h - r_w * origin_h) / 2
            y /= r_w
        else:
            y[:, 0] = x[:, 0] - x[:, 2] / 2 - (self.input_w - r_h * origin_w) / 2
            y[:, 2] = x[:, 0] + x[:, 2] / 2 - (self.input_w - r_h * origin_w) / 2
            y[:, 1] = x[:, 1] - x[:, 3] / 2
            y[:, 3] = x[:, 1] + x[:, 3] / 2
            y /= r_h

        return y

    def post_process(self, output, origin_h, origin_w):
"""
        description: postprocess the prediction
        param:
            output:     A tensor likes [num_boxes,cx,cy,w,h,conf,cls_id, cx,cy,w,h,conf,cls_id, ...]
            origin_h:   height of original image
            origin_w:   width of original image
        return:
            result_boxes: finally boxes, a boxes tensor, each row is a box [x1, y1, x2, y2]
            result_scores: finally scores, a tensor, each element is the score correspoing to box
            result_classid: finally classid, a tensor, each element is the classid correspoing to box
"""
        # Get the num of boxes detected
        num = int(output[0])
        # Reshape to a two dimentional ndarray
        pred = np.reshape(output[1:], (-1, 6))[:num, :]
        # to a torch Tensor
        pred = torch.Tensor(pred).cuda()
        # Get the boxes
        boxes = pred[:, :4]
        # Get the scores
        scores = pred[:, 4]
        # Get the classid
        classid = pred[:, 5]
        # Choose those boxes that score > CONF_THRESH
        si = scores > CONF_THRESH
        boxes = boxes[si, :]
        scores = scores[si]
        classid = classid[si]
        # Trandform bbox from [center_x, center_y, w, h] to [x1, y1, x2, y2]
        boxes = self.xywh2xyxy(origin_h, origin_w, boxes)
        # Do nms
        indices = torchvision.ops.nms(boxes, scores, iou_threshold=IOU_THRESHOLD).cpu()
        result_boxes = boxes[indices, :].cpu()
        result_scores = scores[indices].cpu()
        result_classid = classid[indices].cpu()
        return result_boxes, result_scores, result_classid

class inferThread(threading.Thread):
    def __init__(self, yolov5_wrapper):
        threading.Thread.__init__(self)
        self.yolov5_wrapper = yolov5_wrapper

    def infer(self, frame):
        batch_image_raw, use_time = self.yolov5_wrapper.infer(frame)

        # for i, img_path in enumerate(self.image_path_batch):
        #     parent, filename = os.path.split(img_path)
        #     save_name = os.path.join('output', filename)
        #     # Save image
        #     cv2.imwrite(save_name, batch_image_raw[i])
        # print('input->{}, time->{:.2f}ms, saving into output/'.format(self.image_path_batch, use_time * 1000))
        return batch_image_raw, use_time

class warmUpThread(threading.Thread):
    def __init__(self, yolov5_wrapper):
        threading.Thread.__init__(self)
        self.yolov5_wrapper = yolov5_wrapper

    def run(self):
        batch_image_raw, use_time = self.yolov5_wrapper.infer(self.yolov5_wrapper.get_raw_image_zeros())
        print('warm_up->{}, time->{:.2f}ms'.format(batch_image_raw[0].shape, use_time * 1000))

if __name__ == "__main__":
    # load custom plugins
    parser = argparse.ArgumentParser()
    parser.add_argument('--engine', nargs='+', type=str, default="build/yolov5s.engine", help='.engine path(s)')
    parser.add_argument('--save', type=int, default=0, help='save?')
    opt = parser.parse_args()
    PLUGIN_LIBRARY = "build/libmyplugins.so"
    engine_file_path = opt.engine

    ctypes.CDLL(PLUGIN_LIBRARY)

    # load coco labels

    categories = ["person", "bicycle", "car", "motorcycle", "airplane", "bus", "train", "truck", "boat",
                  "traffic light",
                  "fire hydrant", "stop sign", "parking meter", "bench", "bird", "cat", "dog", "horse", "sheep", "cow",
                  "elephant", "bear", "zebra", "giraffe", "backpack", "umbrella", "handbag", "tie", "suitcase",
                  "frisbee",
                  "skis", "snowboard", "sports ball", "kite", "baseball bat", "baseball glove", "skateboard",
                  "surfboard",
                  "tennis racket", "bottle", "wine glass", "cup", "fork", "knife", "spoon", "bowl", "banana", "apple",
                  "sandwich", "orange", "broccoli", "carrot", "hot dog", "pizza", "donut", "cake", "chair", "couch",
                  "potted plant", "bed", "dining table", "toilet", "tv", "laptop", "mouse", "remote", "keyboard",
                  "cell phone",
                  "microwave", "oven", "toaster", "sink", "refrigerator", "book", "clock", "vase", "scissors",
                  "teddy bear",
                  "hair drier", "toothbrush"]
    # a YoLov5TRT instance
    yolov5_wrapper = YoLov5TRT(engine_file_path)
    cap = cv2.VideoCapture(1)   #&#x9009;&#x62E9;&#x6444;&#x50CF;&#x5934;
    try:
        thread1 = inferThread(yolov5_wrapper)
        thread1.start()
        thread1.join()
        while 1:
            _, frame = cap.read()
            img, t = thread1.infer(frame)
            fps = 1/t
            imgout = cv2.putText(img, "FPS= %.2f" % (fps), (0, 40), cv2.FONT_HERSHEY_SIMPLEX, 1, (0, 255, 0), 2)
            cv2.imshow("result", imgout)
            if cv2.waitKey(1) & 0XFF == ord('q'):  # 1 millisecond
                break

    finally:
        # destroy the instance
        cap.release()
        cv2.destroyAllWindows()
        yolov5_wrapper.destroy()

调用板载摄像头可参考下方文章：

YOLOv5 调用 NVIDIA JetsonTX2 板载摄像头出错_G果的博客-CSDN博客_tx2 yolov5

cv2.VideoCapture(0)是打不开TX2板载摄像头的，所以需要进行一定的修改。

参考文章：

JetsonXavierAGX配置Yolov5环境_航天城拖地的的博客-CSDN博客

Jetson nano + yolov5 + TensorRT加速+调用usb摄像头_無證騎士的博客-CSDN博客_jetson nano调用usb摄像头

Jetson AGX Xavier实现TensorRT加速YOLOv5进行实时检测_围白的尾巴的博客-CSDN博客

yolov5笔记（3）——移动端部署自己的模型（随5.0更新）_Pangaroo的博客-CSDN博客_yolov5移动端部署

【TensorRT】在Jetson设备Xavier（TX2等通用）上使用TensorRT加速yolov5，详细记录过程_小祥子ovo的博客-CSDN博客_jetson xavier 安装tensorrt

Original: https://blog.csdn.net/m0_62013374/article/details/125736316
Author: Barrymaster
Title: TX2安装pytorch+TensorRT+yolov5实现实时检测

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/625458/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

如何评价自己的研究工作是否有价值

本文是看了沐神分享的一个视频的笔记，地址：https://www.zhihu.com/zvideo/1475716940051869696 视频中分享了沐神自己如何判断工作的研究价…

人工智能 2023年6月28日
0095
NLP–社区检测算法(Community Detection)总结【原理】

文章目录文章目录社区检测(Community Detection) 社区社区检测社区检测与聚类的对比分析鲁汶算法(Louvain ) 莱顿社区检测(Leiden) 标签传…

人工智能 2023年6月15日
0069
使用Yolov7遇到的一些问题的解决方法

1.CUDA内存不足我在使用yolov7中遇到了一些问题，通过查询相关的资料和网站将其一个一个解决了。首先遇到的第一个问题就是cuda的内存不足，报错内容如下所示： Runtim…

人工智能 2023年7月27日
0058
【if 的高阶用法练习题】It‘s time / would rather

It’s time *接不定式与句子之间的改写，表达是时候 … 1.It’s time you stopped. 2.It’s ti…

人工智能 2023年6月27日
0063
Pytorch入门（三）训练 / 测试模型

上一篇文章中讲解了神经网络模型的编写，一般情况下，我们只需要对现有的网络模型进行修改就可以了，那这篇文章就进入到最重要的部分了，也就是网络模型的训练和测试。其实对于分类和回归的模型…

人工智能 2023年7月14日
0080
Pytorch—-注意力机制与自注意力机制的代码详解与使用

注意力机制的核心重点就是让网络关注到它更需要关注的地方。当我们使用卷积神经网络去处理图片的时候，我们会更希望卷积神经网络去注意应该注意的地方，而不是什么都关注，我们不可能手动…

人工智能 2023年7月24日
0069
深度学习100例-卷积神经网络（VGG-16）识别海贼王草帽一伙 | 第6天

文章目录一、前期工作 * 1. 设置GPU 2. 导入数据 3. 查看数据二、数据预处理 * 1. 加载数据 2. 可视化数据 3. 再次检查数据 4. 配置数据集 5. 归一…

人工智能 2023年5月26日
0089
数据库实时同步利器——CDC（变化数据捕获技术）

在进行数据ETL过程中，我们经常需要通过周期性的定时调度将业务数据按照T+1的方式同步到数据仓库中，进行数据分析处理，最终通过BI报表展示给最终用户。但这种方式实时性较差，用户往往…

人工智能 2023年6月19日
0090
iOS 内购（In-App Purchase）详解

iOS 内购（In-App Purchase）详解概述 IAP 全称：In-App Purchase，是指苹果 App Store 的应用内购买，是苹果为 App 内购买虚拟商品…

人工智能 2023年5月30日
00110
【matlab图像处理】图像处理工具箱（1）

中国史之【实行井田制】：井田制为西周时期的土地国有制度。井田的疆理，一般是以每一方块为一百亩（约合今31亩），称为一田，纵横相连的九田合为一井。井田制下，卿大夫以下的贵族土地不得随…

人工智能 2023年6月22日
0074
【人工智能】计算机视觉之OpenCV学习详解一

文章目录一，前言二，OpenCV安装三，实战学习 * 3.1,色彩处理 3.2，裁剪提取 3.3，图像绘制四，源码五，后言一，前言看准这个方向很久了，今天开始和Jav…

人工智能 2023年7月20日
0049
Tensorflow c++源码编译（踩坑版）

在实际项目部署过程中，会需要tensorflow c或者c++版本，网上一通查，说自己编译会各种坑，投机取巧。。在拿到别人编译好的c和c++版本调用报错后，开始痛定思痛，躲不过的…

人工智能 2023年5月24日
0081
YOLO V5源码详解

1.数据读取首先读取图片以及标签路径，并将标签存入缓存，对单标签情况、特定类别、以及是否保持长方形等情况分别进行处理。如果需要进行mosaic 数据增强，首先找到中心点，将图片…

人工智能 2023年6月23日
0065
python pandas 数据处理：删除特定行、列

dropna() 方法过滤任何含有缺失值的行 pandas.DataFrame里，如果一行数据有任意值为空，则过滤掉整行，这时候使用 dropna()方法是合适的。下面的案例，任意…

人工智能 2023年7月14日
00131
知识图谱 | 带你了解反洗钱的复杂网络分析

【原创文章，转载请注明出处，文章原地址】知识 | 带你了解反洗钱的复杂网络分析mp.weixin.qq.com 1.背景反洗钱模型报告中主要从模型层面出发验证了捕获到图信息的…

人工智能 2023年6月1日
0083
关于OpenCV–C++处理音视频的学习01

Opencv作为优秀的图像视频处理模块库系统，如今在音视频开发上也正绽放异彩，闲暇之余不如学习音视频开发吧 在这里…

人工智能 2023年7月19日
0055

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30