制作自己的 tusimple 格式数据集

2023年5月25日下午1:52 • 人工智能 • 阅读 105

tusimple 格式数据集制作与标注小结

最近在看 LaneNet ，然后参考网上的博客记录一下自己制作个人 tusimple 格式数据集的过程。

1.前期准备

工具： labelme 3.6.12 + windows10

(1) 先创建一个虚拟环境，控制台指令：

conda  create -n yourenvname python==3.6.0

(2) 激活环境并安装，控制台指令：(推荐用国内源)

conda activate yourenvname
pip install labelme==3.6.12

2.进行标注

(1) 控制台进入软件：

直接控制台环境下输入 labelme ；

(2) 利用点(Point)进行标注(亲测可用)：

控制台进入labelme, Edit → \rightarrow →Create Point , 即可在图像上进行标注，双击生成label并输入；

(3) 利用线(LineStrip)进行标注(亲测可用)：

控制台进入labelme, Edit → \rightarrow →Create LineStrip , 即可在图像上进行标注，双击生成label并输入；

最后得到一系列 .json 文件

PS：笔者暂时只做到利用 LaneNet 可以进行正常的训练，不知道是否有好的效果。

; 3.数据格式转化

(1) 将 json 文件转化成 dataset

import argparse
import json
import os
import os.path as osp
import warnings
import PIL.Image
import yaml
from labelme import utils
import base64

def main():

    warnings.warn("This script is aimed to demonstrate how to convert the\n"

                  "JSON file to a single image dataset, and not to handle\n"

                  "multiple JSON files to generate a real-use dataset.")

    parser = argparse.ArgumentParser()

    parser.add_argument('--json_file', default='D:\\Study\\Video_Frame\\Line\\')

    parser.add_argument('--out', default=None)

    args = parser.parse_args()

    json_file = args.json_file

    if args.out is None:

        out_dir = osp.basename(json_file).replace('.', '_')

        out_dir = osp.join(osp.dirname(json_file), out_dir)

    else:

        out_dir = args.out

    if not osp.exists(out_dir):

        os.mkdir(out_dir)

    count = os.listdir(json_file)

    for i in range(0, len(count)):

        path = os.path.join(json_file, count[i])

        if os.path.isfile(path):

            data = json.load(open(path))

            if data['imageData']:

                imageData = data['imageData']

            else:

                imagePath = os.path.join(os.path.dirname(path), data['imagePath'])

                with open(imagePath, 'rb') as f:

                    imageData = f.read()

                    imageData = base64.b64encode(imageData).decode('utf-8')

            img = utils.img_b64_to_arr(imageData)

            label_name_to_value = {'_background_': 0}

            for shape in data['shapes']:

                label_name = shape['label']

                if label_name in label_name_to_value:

                    label_value = label_name_to_value[label_name]

                else:

                    label_value = len(label_name_to_value)

                    label_name_to_value[label_name] = label_value

            label_values, label_names = [], []

            for ln, lv in sorted(label_name_to_value.items(), key=lambda x: x[1]):

                label_values.append(lv)

                label_names.append(ln)

            assert label_values == list(range(len(label_values)))

            lbl = utils.shapes_to_label(img.shape, data['shapes'], label_name_to_value)

            captions = ['{}: {}'.format(lv, ln)

                for ln, lv in label_name_to_value.items()]

            lbl_viz = utils.draw_label(lbl, img, captions)

            out_dir = osp.basename(count[i]).replace('.', '_')

            out_dir = osp.join(osp.dirname(count[i]), out_dir)

            if not osp.exists(out_dir):

                os.mkdir(out_dir)

            PIL.Image.fromarray(img).save(osp.join(out_dir, 'img.png'))

            utils.lblsave(osp.join(out_dir, 'label.png'), lbl)

            PIL.Image.fromarray(lbl_viz).save(osp.join(out_dir, 'label_viz.png'))

            with open(osp.join(out_dir, 'label_names.txt'), 'w') as f:

                for lbl_name in label_names:

                    f.write(lbl_name + '\n')

            warnings.warn('info.yaml is being replaced by label_names.txt')

            info = dict(label_names=label_names)

            with open(osp.join(out_dir, 'info.yaml'), 'w') as f:

                yaml.safe_dump(info, f, default_flow_style=False)

            print('Saved to: %s' % out_dir)

if __name__ == '__main__':

    main()

以Create Strip 标注的数据集生成结果如下，用Create Point 步骤完全一样。

(2) dataset 转 tusimple 数据集格式


import cv2
from skimage import measure, color
from skimage.measure import regionprops
import numpy as np
import os
import copy

def skimageFilter(gray):
    binary_warped = copy.copy(gray)
    binary_warped[binary_warped > 0.1] = 255
    gray = (np.dstack((gray, gray, gray)) * 255).astype('uint8')
    labels = measure.label(gray[:, :, 0], connectivity=1)
    dst = color.label2rgb(labels, bg_label=0, bg_color=(0, 0, 0))
    gray = cv2.cvtColor(np.uint8(dst * 255), cv2.COLOR_RGB2GRAY)
    return binary_warped, gray

def moveImageTodir(path, targetPath, name):
    if os.path.isdir(path):
        image_name = "gt_image/" + str(name) + ".png"
        binary_name = "gt_binary_image/" + str(name) + ".png"
        instance_name = "gt_instance_image/" + str(name) + ".png"
        train_rows = image_name + " " + binary_name + " " + instance_name + "\n"
        origin_img = cv2.imread(path + "/img.png")
        origin_img = cv2.resize(origin_img, (1280, 720))
        cv2.imwrite(targetPath + "/" + image_name, origin_img)
        img = cv2.imread(path + '/label.png')
        img = cv2.resize(img, (1280, 720))
        gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)
        binary_warped, instance = skimageFilter(gray)

        cv2.imwrite(targetPath + "/" + binary_name, binary_warped)
        cv2.imwrite(targetPath + "/" + instance_name, instance)

        print("success create data name is : ", train_rows)
        return train_rows
    return None

if __name__ == "__main__":
    print('--------------开始执行----------------')

    with open("./train.txt", 'w+') as file:
        for images_dir in os.listdir("./images_line"):
            dir_name = os.path.join("./images_line", images_dir + "/annotations")
            for annotations_dir in os.listdir(dir_name):
                json_dir = os.path.join(dir_name, annotations_dir)
                if os.path.isdir(json_dir):

                    train_rows = moveImageTodir(json_dir, "./", json_dir.split('\\')[-1])
                    file.write(train_rows)

生成结果如下：

以 gt_binary_image 为例：

通过上面的操作就可以得到 tusimple 数据集格式的个人数据集，就可以放到 LaneNet 中去训练啦，可以参考我的另一篇博客：LaneNet调试记录

参考博客：

创建自己的 tusimple 数据集格式

制作 tusimple 数据集格式的数据

tusimple车道线检测处理自己的数据集用自己的数据集训练模型

车道线识别 tusimple 数据集介绍

Labelme标注的数据转换为tusimple数据集格式

本博客仅供学习交流，如有侵权行为，请联系我删除，谢谢。

[En]

This blog is for learning and communication only, if there is any infringement, please contact me to delete, thank you.

Original: https://blog.csdn.net/yyq_163/article/details/119882892
Author: 原点哈哈哈
Title: 制作自己的 tusimple 格式数据集

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/514438/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

Linux 进程间通信

目录一.进程间通信介绍 1.原因 2.目的二.管道 1.匿名管道 2.命名管道三.system V共享内存 1.示意图 2.共享内存函数（1）sheget （2）shmct…

人工智能 2023年6月27日
00104
工业相机飞拍模式介绍及相机曝光值计算

1.检测原理 (一)原理飞拍就是使用硬件比较输出或精准输出端口在极短时间内触发相机拍照，而被测物品在拍照过程中仍处于运动状态，与此同时被测物品通过图像处理软件计算出其位置的偏移量…

人工智能 2023年6月23日
00157
问题OpenCV(4.5.4) Error: Assertion failed (empty()) in cv::CascadeClassifier::detectMultiScale的解决

OpenCV(4.5.4) Error: Assertion failed (!empty()) in cv::CascadeClassifier::detectMultiScal…

人工智能 2023年6月20日
0072
OpenCV绘制图像与文字(可作为脚手架代码)(python) Open_CV系列（四）

文章目录 1. cv2.line()绘制线段 2. cv2.rectangle() 绘制矩形 3. cv2.circle() 绘制圆形 * 3.1 绘制实现圆与空心圆 3.2 …

人工智能 2023年6月23日
0069
Harris角点检测原理-

本文是本人看bilibili教学结合官方文档的观后笔记，链接在下：（全）基于python的Opencv项目实战_哔哩哔哩_bilibili 如果有什么理解不到位的地方，欢迎指正。…

人工智能 2023年6月22日
00102
生成模型(一):GAN

生成对抗网络 (GAN)在许多生成任务中显示出很好的结果，以复制真实世界的丰富内容，例如图像、文字和语音。它受到博弈论的启发：一个生成器和一个判别器，在互相竞争的同时让彼此变得更强…

人工智能 2023年7月31日
0068
numpy 下载安装

在安装 numpy 之前，必须要先安装 python，而且 numpy 与 python 的版本是对应的。 numpy 与 python 版本对应关系链接 ; 1. 下载和安装 p…

人工智能 2023年6月15日
0082
数据报表体系搭建流程

信息化、数字化社会加速到来，企业纷纷开始布局数字化转型，数字经济将成为未来世界中最为火热的新型经济。数据报表作为企业数字化转型的代表节点之一，未来会成为数字化企业的基本配置。明…

人工智能 2023年6月11日
0085
残差块与Normalize的作用

ResNet 残差神经网络：假如某个神经网络的最优网络层数是18层，但是我们在设计的时候并不知道到底多少层是最优解，本着层数越深越好的理念，我们设计了32层，那么32层神经网络中有…

人工智能 2023年7月13日
0069
计算机视觉专家：如何从C++转Python

有人说用 Python 编程很简单，6 岁小孩都能学会。计算机视觉专家和编程语言爱好者 asya f 刚开始上手 Python 时也这么想。但门槛低就仅意味着使用简单吗？经常调用 …

人工智能 2023年7月30日
0055
手把手带你注册ChatGPT

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月31日
0053
iOS 分类Category

1.Category定义 Category的主要作用是为已经存在的类添加方法。Objective-C 中的 Category 就是对装饰模式的一种具体实现。它的主要作用是在不改变原…

人工智能 2023年7月2日
0086
pandas基本用法（一）之Series和Dataframe区别、切片和索引、字符串使用方法

1、pandas常用数据类型(Series和Dataframe) Series一维，带标签数组，左边是index，右边是values DataFrame二维，Series容器 * …

人工智能 2023年7月6日
0096
最新避坑Ubuntu20.04安装tensorflow-gpu

最新！避坑！Ubuntu20.04安装tensorflow-gpu2.6.0有效！ 1.基本信息&查询对应配置：系统：Ubuntu20.04显卡：RTX3090 ×2py…

人工智能 2023年5月23日
00112
Ubuntu20.04下载opencv3.4–未完善

最近在学习slam14讲这本书，第5讲中需要下载opencv3.1，（这个是一个ubuntu20.04下载opecv3.1的博客，不过后来我报了一些依赖的错，懒的解决了，就没有用了…

人工智能 2023年7月19日
0071
卷积神经网络CNN实现mnist手写数字识别

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月29日
0060

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

制作自己的 tusimple 格式数据集

1.前期准备

2.进行标注

; 3.数据格式转化

参考博客：

大家都在看