Opencv-Python数据增强

2023年7月19日下午5:27 • 人工智能 • 阅读 69

Opencv-Python数据增强

常见的数据增强操作有：按比例放大或缩小图片、旋转、平移、水平翻转、改变图像通道等。

1.按比例放大和缩小

扩展缩放只是改变图像的尺寸大小。OpenCV 提供的函数 cv2.resize()可以实现这个功能。图像的尺寸可以自己手动设置，也可以指定缩放因子。可以选择使用不同的插值方法。在缩放时我们推荐使用 cv2.INTER_AREA，在扩展时我们推荐使用 v2.INTER_CUBIC（慢) 和 v2.INTER_LINEAR。默认情况下所有改变图像尺寸大小的操作使用的插值方法都是 cv2.INTER_LINEAR。


def zoom_down(img,scale):
    img = cv2.resize(img,None,fx= scale,fy= scale,interpolation=cv2.INTER_CUBIC)
    return img

def zoom_up(img,scale):
    img = cv2.resize(img,None,fx= scale,fy= scale,interpolation=cv2.INTER_CUBIC)
    return img

resize库中第二个参数是目标大小，例如如果我想把图片resize成300*300大小的，可以这么写：

img = cv2.resize(img,(300,300))

2.平移图像

可以使用 Numpy 数组构建这个矩阵（数据类型是 np.float32），然后把它传给函数 cv2.warpAffine()。

mat_translation = np.float32([[1, 0, 20], [0, 1, 30]])

例如上面是的矩阵是将图像往水平方向上移动20个像素点，竖直方向上移动30个像素点。

实例：


def translation(img,tx,ty):
    height = img.shape[0]
    width = img.shape[1]
    mat_translation = np.float32([[1, 0, tx], [0, 1, ty]])
    img = cv2.warpAffine(img, mat_translation, (width + tx, height + ty))
    return img

我这里封装的tx和ty分别为水平和竖直方向需要移动的像素点数。

3.旋转图像

OpenCV 提供了一个函数： cv2.getRotationMatrix2D


def rotation(img,angle,scale):
    rows = img.shape[0]
    cols = img.shape[1]

    M = cv2.getRotationMatrix2D((cols / 2, rows / 2), angle, scale)
    img = cv2.warpAffine(img, M, (cols, rows))
    return img

4.镜像变换

Opencv提供了cv2.flip()函数，可以第二个参数为1时为水平翻转，为0时垂直翻转。为了后面调用方便，我还是自己封装了一下。


def mirror(img,mode):
    img = cv2.flip(img, mode)
    return img

5.添加椒盐噪声

椒盐噪声为纯黑或纯白的像素点，随机生成。


def spiced_salt_noise(img,prob):
    output = np.zeros(img.shape,np.uint8)
    thres = 1 - prob
    for i in range(img.shape[0]):
        for j in range(img.shape[1]):
            rdn = random.random()
            if rdn < prob:
                output[i][j] = 0
            elif rdn > thres:
                output[i][j] = 255
            else:
                output[i][j] = img[i][j]
    return output

6.添加高斯噪声

与椒盐噪声不同，高斯噪声是彩色的，方差越大时噪声越大。


def gasuss_noise(image, mean = 0, var = 0.01):
    '''
        添加高斯噪声
        mean : 均值
        var : 方差，方差越大越模糊
    '''
    image = np.array(image/255, dtype=float)
    noise = np.random.normal(mean, var ** 0.5, image.shape)
    out = image + noise
    if out.min() < 0:
        low_clip = -1.

    else:
        low_clip = 0.

    out = np.clip(out, low_clip, 1.0)
    out = np.uint8(out*255)
    return out

7.模糊化

将图片模糊或平滑有多种算法，例如高斯模糊、中值模糊、均值模糊等，我这里使用一个比较普通的cv2.blur()实现。同样也是先封装方便我后面调用。


def blur(img,scale):
    img = cv2.blur(img,(scale,scale))
    return img

这里的scale其实就是滤波器的尺寸，一般取奇数，scale越大越模糊，

8.重新组合颜色通道

在opencv中，图像的通道顺序为BGR，也就是蓝绿红，可以改变成其他顺序以得到不同的效果。


def change_channel(img):
    b = cv2.split(img)[0]
    g = cv2.split(img)[1]
    r = cv2.split(img)[2]
    brg = cv2.merge([b, r, g])
    return brg

实例

我有以下几张测试图片：

我希望随机地对这些图片进行一些变换，最终执行结果如下：

可以看到程序对我的图片随机进行了各种变换，我这里只是一次变换，读者也可以尝试对图片同时进行多种变换。

本次程序如下：


import numpy as np
import cv2
import random
import os
import sys

def zoom_down(img, scale):
    img = cv2.resize(img, None, fx=scale, fy=scale, interpolation=cv2.INTER_CUBIC)
    return img

def zoom_up(img, scale):
    img = cv2.resize(img, None, fx=scale, fy=scale, interpolation=cv2.INTER_CUBIC)
    return img

def translation(img, tx, ty):
    height = img.shape[0]
    width = img.shape[1]
    mat_translation = np.float32([[1, 0, tx], [0, 1, ty]])
    img = cv2.warpAffine(img, mat_translation, (width + tx, height + ty))
    return img

def rotation(img, angle, scale):
    rows = img.shape[0]
    cols = img.shape[1]

    M = cv2.getRotationMatrix2D((cols / 2, rows / 2), angle, scale)
    img = cv2.warpAffine(img, M, (cols, rows))
    return img

def mirror(img, mode):
    img = cv2.flip(img, mode)
    return img

def spiced_salt_noise(img, prob):
    output = np.zeros(img.shape, np.uint8)
    thres = 1 - prob
    for i in range(img.shape[0]):
        for j in range(img.shape[1]):
            rdn = random.random()
            if rdn < prob:
                output[i][j] = 0
            elif rdn > thres:
                output[i][j] = 255
            else:
                output[i][j] = img[i][j]
    return output

def blur(img, scale):
    img = cv2.blur(img, (scale, scale))
    return img

def gasuss_noise(image, mean=0, var=0.01):
    '''
        添加高斯噪声
        mean : 均值
        var : 方差，方差越大越模糊
    '''
    image = np.array(image / 255, dtype=float)
    noise = np.random.normal(mean, var ** 0.5, image.shape)
    out = image + noise
    if out.min() < 0:
        low_clip = -1.

    else:
        low_clip = 0.

    out = np.clip(out, low_clip, 1.0)
    out = np.uint8(out * 255)
    return out

def change_channel(img):
    b = cv2.split(img)[0]
    g = cv2.split(img)[1]
    r = cv2.split(img)[2]
    brg = cv2.merge([b, r, g])
    return brg

def Data_Augument():
    for i in images_list:
        img = cv2.imread(image_dir+i)
        cv2.imshow('img',img)
        functions = [('zoom_down', [img, 0.8]),
                     ('zoom_up', [img, 1.2]),
                     ('translation', [img, 20, 30]),
                     ('rotation', [img, 15, 0.9]),
                     ('mirror', [img, 1]),
                     ('spiced_salt_noise', [img, 0.01]),
                     ('blur', [img, 5]),
                     ('gasuss_noise', [img, 0, 0.01]),
                     ('change_channel', [img])]
        choice = random.choice(functions)
        this_module = sys.modules[__name__]

        res = getattr(this_module, choice[0])(*choice[1])
        cv2.imwrite(output_dir + i, res)

if __name__ == '__main__':
    image_dir = './test/'
    images_list = os.listdir(image_dir)
    nums = len(os.listdir(image_dir))
    print('found %d pictures' % nums)
    output_dir = './output/'
    Data_Augument()
    print('finished!')

总结

还有其他很多的数据增强操作，例如随机裁剪图像、添加颜色扰动等等。另外也有其他库可以进行这些操作，例如Keras中的图片预处理process库。我这种是离线式的，希望能将变换后的图片保存下来。

参考文献

1.Opencv中文官方文档：http://woshicver.com/

2.https://www.cnblogs.com/lfri/p/10627595.html

Original: https://blog.csdn.net/Aiden_yan/article/details/123004345
Author: 三个臭皮姜
Title: Opencv-Python数据增强

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/703291/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

测试面试被问“期望薪资多少”，不要傻傻直接报价，高情商都这样说

对于软件测试从业者而言，面试很重要，因为那是拿到薪资报酬丰厚程度的关键，你的理论及实操经验确实都很棒，那就尽量别让自己的面试表现拖自己的后腿，否则大概率会让你的薪水大打折扣。你在…

人工智能 2023年7月29日
0060
python连接并简单操作SQLserver数据库

python连接并简单操作SQLserver数据库实验环境： python版本3.9 Python 3.9.7 (tags/v3.9.7:1016ef3, Aug 30 2021…

人工智能 2023年7月4日
0071
darknet_ros部署yolov3

darknet_ros部署yolov3 简单记录一下基于ros运行yolov3做交通标志(LISA数据集)识别的历程 1.创建工作空间 $ mkdir –p catkin_work…

人工智能 2023年7月20日
0059
3.搭建分类器-pytorch与自然语言处理

课程链接：Python人工智能20个小时玩转NLP自然语言处理【黑马程序员】_哔哩哔哩_bilibili 目的：对不同的输入图像进行识别并分类。采用CIFAR10数据集，进行单分类…

人工智能 2023年5月30日
0073
C++继承关系和复合关系

我们今天来讲一下类和类之间的关系,在类里面,分为了三种关系: 没有任何关系继承关系(派生) 复合关系(类似于封闭类) 继承：”是”关系。 – 基类 A…

人工智能 2023年6月28日
0097
python库安装中Microsoft Visual C++ is required解决方法

在用pycharm过程中，用pip去安装一些第三方包的时候会出现如下错误，缺少C++编译器，因为有些程序需要使用，没有C++接口会报错，查阅相关资料及自己的解决方案 error: …

人工智能 2023年6月4日
0087
YOLOv7 Tensorrt Python部署教程

B站教学视频 https://www.bilibili.com/video/BV1q34y1n7Bw/ Github仓库地址 https://github.com/Monday-L…

人工智能 2023年7月5日
0069
matlab 回归

我发现这两天写题目，回归真的是个万能方法，但是我只会最简单的线性回归，为此特地记录一下以下几种方法： 1）：regress 简单线性回归，可以是一元，也可以是多元，具体用法可以看这…

人工智能 2023年6月18日
0087
复现KGAT: Knowledge Graph Attention Network for Recommendation（一）

复现KGAT: Knowledge Graph Attention Network for Recommendation（一）该系列博客应该会有两部分，一部分是读论文，一部分是代…

人工智能 2023年6月10日
0092
Pandas知识点-连接操作concat

Pandas知识点-连接操作concat Pandas提供了多种将Series、DataFrame对象合并的功能，有concat(), merge(), append(), joi…

人工智能 2023年7月7日
00102
bert 句向量的各向异性问题及与对比学习的联系

本文主要介绍了为什么基于bert产出的句向量，在语义相似相关的任务上表现较差的原因及相关解释（各向异性，表示退化，锥形空间），另外介绍了simcse 中论述的对比学习与各…

人工智能 2023年5月28日
0084
CMeKG代码解读(以项目为导向从零开始学习知识图谱)（三）

evaluate(): run_train(): load_model(): get_triples(): evaluate(): def evaluate(data, is_pr…

人工智能 2023年6月1日
0079
创建一个像人类一样的神经网络来诊断肺癌

肺癌是人类的主要癌症杀手，它带走的生命甚至超过乳腺癌、结肠癌和前列腺癌的总和，和其他所有种类的癌症一样，肺部结节是癌症的征兆。医生需通过CT扫描肺部来确定肺中是否有结节。单个患者…

人工智能 2023年7月14日
0067
分类模型效果评价

通常使用的分类模型包括Rpart决策树、Ctree决策树、Random Forest随机森林、Logistics回归等。这些模型通常利用准确率、精确率、召回率、F值和ROC面积等…

人工智能 2023年7月2日
0085
softmax回归的简洁实现

我们发现(通过深度学习框架的高级API能够使实现) (softmax) 线性(回归变得更加容易)。同样，通过深度学习框架的高级API也能更方便地实现softmax回归模型。本节…

人工智能 2023年6月17日
0075
安装PyTorch详细过程

安装PyTorch过程安装anaconda 环境管理 PyTorch安装检验安装安装anaconda 登录anaconda的官网下载，anaconda是一个集成的工具软件不需…

人工智能 2023年7月4日
0091

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Opencv-Python数据增强

1.按比例放大和缩小

2.平移图像

3.旋转图像

4.镜像变换

5.添加椒盐噪声

6.添加高斯噪声

7.模糊化

8.重新组合颜色通道

实例

总结

参考文献

大家都在看