python 图片倾斜校正

2023年5月26日上午6:06 • 人工智能 • 阅读 82

前言

进行图片校正是将拍照倾斜的图片恢复水平状态，大致思路为：

用canny算子检测出图像中的边缘轮廓线；
用霍夫线变换检测出图像中的所有直线；
筛选出接近水平方向上的直线，求出他们偏移角度的平均值；
根据倾斜角旋转矫正；
输出图片。

这里设计到几个知识点：
canny算子
原理：数字图像处理(20): 边缘检测算子(Canny算子)
cv2.Canny函数：OpenCV-Python教程（8、Canny边缘检测）
edge = cv2.Canny(image, threshold1, threshold2[, edges[, apertureSize[, L2gradient ]]])

变量内容image要检测的图像

threshold1 和 threshold2 的值较小时，能够捕获更多的边缘信息，下文中canny_threshold(self, img_path)函数即可可视化不同threshold的效果。

霍夫变换
原理：霍夫变换——神奇的特征提取算法
cv2.HoughLines函数：每天一练P9-Python和OpenCV做图像处理(HoughLines)

其他
Python2 math.degrees() 函数
 Python scipy.ndimage.rotate用法及代码示例（该函数是按逆时针旋转）
利用向量推导坐标旋转公式(方案一)
atctan

代码

在使用代码前，canny的阈值一定要根据实际情况修改！

import cv2
import math
import numpy as np
from scipy import ndimage

class HorizontalCorrection:
    def __init__(self):
        self.rotate_vector = np.array([0, 1])
        self.rotate_theta = 0

    def process(self, img):
        img = cv2.imread(img)
        gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

        edges = cv2.Canny(gray, 350, 400, apertureSize=3)
        cv2.imwrite('./test result/edges.png', edges)

        lines = cv2.HoughLines(edges, 1, np.pi / 180, 120)
        sum = 0
        count = 0
        for i in range(len(lines)):
            for rho, theta in lines[i]:
                a = np.cos(theta)
                b = np.sin(theta)
                x0 = a * rho
                y0 = b * rho
                x1 = int(x0 + 1000 * (-b))
                y1 = int(y0 + 1000 * (a))
                x2 = int(x0 - 1000 * (-b))
                y2 = int(y0 - 1000 * (a))

                if x2 != x1:
                    t = float(y2 - y1) / (x2 - x1)
                    if t  np.pi / 5 and t >= - np.pi / 5:
                        rotate_angle = math.degrees(math.atan(t))
                        sum += rotate_angle
                        count += 1
                        cv2.line(img, (x1, y1), (x2, y2), (0, 0, 255), 2)

        if count == 0:
            avg_rotate_angle = 0
        else:
            avg_rotate_angle = sum / count
        self.rotate_img = ndimage.rotate(img, avg_rotate_angle)

        self.rotate_theta = avg_rotate_angle
        self.count_rotate_vector()

    def count_rotate_vector(self):
        v1_new = (self.rotate_vector[0] * np.cos(self.rotate_theta / 180)) - \
                 (self.rotate_vector[1] * np.sin(self.rotate_theta / 180))
        v2_new = (self.rotate_vector[1] * np.cos(self.rotate_theta / 180)) + \
                 (self.rotate_vector[0] * np.sin(self.rotate_theta / 180))
        self.rotate_vector = np.array([v1_new, v2_new])

    def manual_set_rotate_vector(self, rotate_theta):
        self.rotate_theta = rotate_theta
        self.count_rotate_vector()

    def canny_threshold(self, img_path):
        img_original = cv2.imread(img_path)

        cv2.namedWindow('Canny')

        def nothing(x):
            pass

        cv2.createTrackbar('threshold1','Canny',50,400,nothing)
        cv2.createTrackbar('threshold2','Canny',100,400,nothing)
        while(1):

            threshold1=cv2.getTrackbarPos('threshold1','Canny')
            threshold2=cv2.getTrackbarPos('threshold2','Canny')

            img_edges=cv2.Canny(img_original,threshold1,threshold2)

            cv2.imshow('original',img_original)
            cv2.imshow('Canny',img_edges)
            if cv2.waitKey(1)==ord('q'):
                break
        cv2.destroyAllWindows()

if __name__ == '__main__':
    horizontal_correction = HorizontalCorrection()

    horizontal_correction.process(r'./test image/IMG_6386.JPG')
    print(horizontal_correction.rotate_theta)
    cv2.imwrite('./test result/1.png', horizontal_correction.rotate_img)
    cv2.imshow('rotate', horizontal_correction.rotate_img)
    cv2.waitKey()

效果图

从图中可以看出霍夫变换根据栏杆的水平线进行校正。
彩蛋：乾元的朋友，让我看见你们的双手。

; 补充——用滑动条调整canny阈值

之前在一个博客看到的，但是现在找不到了，先把代码放上。

    def canny_threshold(self, img_path):
        img_original = cv2.imread(img_path)

        cv2.namedWindow('Canny')

        def nothing(x):
            pass

        cv2.createTrackbar('threshold1', 'Canny', 50, 400, nothing)
        cv2.createTrackbar('threshold2', 'Canny', 100, 400, nothing)
        while True:

            threshold1 = cv2.getTrackbarPos('threshold1', 'Canny')
            threshold2 = cv2.getTrackbarPos('threshold2', 'Canny')

            img_edges = cv2.Canny(img_original, threshold1, threshold2)

            cv2.imshow('original', img_original)
            cv2.imshow('Canny', img_edges)
            if cv2.waitKey(1) == ord('q'):
                break
        cv2.destroyAllWindows()

其他

对 cv2.HoughLines返回值的处理方式进行了修改。

import cv2
import math
import numpy as np
from scipy import ndimage

class HorizontalCorrection:
    def __init__(self):
        self.rotate_vector = np.array([0, 1])
        self.rotate_theta = 0

    def process(self, img):
        img = cv2.imread(img)
        gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)

        edges = cv2.Canny(gray, 350, 400, apertureSize=3)
        cv2.imwrite('./test result/edges.png', edges)

        lines = cv2.HoughLines(edges, 1, np.pi / 180, 120)
        sum = 0
        count = 0
        for r_theta in lines:
            arr = np.array(r_theta[0], dtype=np.float64)
            rho, theta = arr
            a = np.cos(theta)
            b = np.sin(theta)
            x0 = a * rho
            y0 = b * rho
            x1 = int(x0 + 1000 * (-b))
            y1 = int(y0 + 1000 * (a))
            x2 = int(x0 - 1000 * (-b))
            y2 = int(y0 - 1000 * (a))

            if x2 != x1:
                t = float(y2 - y1) / (x2 - x1)
                if t  np.pi / 5 and t >= - np.pi / 5:
                    rotate_angle = math.degrees(math.atan(t))
                    sum += rotate_angle
                    count += 1
                    cv2.line(img, (x1, y1), (x2, y2), (0, 0, 255), 2)

        if count == 0:
            avg_rotate_angle = 0
        else:
            avg_rotate_angle = sum / count
        self.rotate_img = ndimage.rotate(img, avg_rotate_angle)

        self.rotate_theta = avg_rotate_angle
        self.count_rotate_vector()

    def count_rotate_vector(self):
        v1_new = (self.rotate_vector[0] * np.cos(self.rotate_theta / 180)) - \
                 (self.rotate_vector[1] * np.sin(self.rotate_theta / 180))
        v2_new = (self.rotate_vector[1] * np.cos(self.rotate_theta / 180)) + \
                 (self.rotate_vector[0] * np.sin(self.rotate_theta / 180))
        self.rotate_vector = np.array([v1_new, v2_new])

    def manual_set_rotate_vector(self, rotate_theta):
        self.rotate_theta = rotate_theta
        self.count_rotate_vector()

    def canny_threshold(self, img_path):
        img_original = cv2.imread(img_path)

        cv2.namedWindow('Canny')

        def nothing(x):
            pass

        cv2.createTrackbar('threshold1','Canny',50,400,nothing)
        cv2.createTrackbar('threshold2','Canny',100,400,nothing)
        while(1):

            threshold1=cv2.getTrackbarPos('threshold1','Canny')
            threshold2=cv2.getTrackbarPos('threshold2','Canny')

            img_edges=cv2.Canny(img_original,threshold1,threshold2)

            cv2.imshow('original',img_original)
            cv2.imshow('Canny',img_edges)
            if cv2.waitKey(1)==ord('q'):
                break
        cv2.destroyAllWindows()

if __name__ == '__main__':
    horizontal_correction = HorizontalCorrection()

    horizontal_correction.process(r'./test image/IMG_6386.JPG')
    print(horizontal_correction.rotate_theta)
    cv2.imwrite('./test result/1.png', horizontal_correction.rotate_img)
    cv2.imshow('rotate', horizontal_correction.rotate_img)
    cv2.waitKey()

Original: https://blog.csdn.net/weixin_42442319/article/details/124596103
Author: 橙橙小狸猫
Title: python 图片倾斜校正

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/518367/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

gazebo的安装

1.4 配置环境变量： echo “source /opt/ros/melodic/setup.bash” >> ~/.bashrc sourc…

人工智能 2023年6月2日
00115
TransC知识表示模型

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、C币套餐、付费专栏及课程。 Original: https:…

人工智能 2023年6月10日
0075
通俗解读NLP中几种常见的注意力机制

1 前言注意力机制在NLP领域中有广泛的应用，诸如机器翻译、智能对话、篇章问答等。在模型设计中使用注意力机制，可以显著提升模型的性能。然而，对于初识注意力机制的朋友来说，可能会有…

人工智能 2023年5月27日
00122
Tensor及其梯度

tensor与传统的numpy等工具不同的是，tensor的某些属性会使得它可以追踪在深度学习是需要计算的梯度以及一系列与深度学习相关的梯度等那么我们在创建tensor时是可以显…

人工智能 2023年7月23日
0092
图神经网络如何解决节点分类和图分类问题中的标签稀疏性问题

问题背景在图神经网络（Graph Neural Network，GNN）中，节点分类（Node Classification）和图分类（Graph Classification）…

人工智能 2024年1月6日
0050
图像去噪 + 低通滤波 opencv

低通滤波：均值滤波、中值滤波、高斯滤波、双边滤波高通滤波：sobel，scharr，Laplacian 一、低通滤波任何图像都是由不同尺度的空间信息组成的，假设我们处理一幅图像…

人工智能 2023年7月20日
0051
数学建模参赛技巧 — 论文撰写

0 前言大家好，我是L学长。今天学长给大家分享一下学长建模私货，论文写作技巧。来跟学长学习的同学不少是第一次参加建模的，这里学长告诉大家一个少为人知的”秘密&#8…

人工智能 2023年5月27日
00111
[ 注意力机制 ] 经典网络模型2——CBAM 详解与复现

🤵 Author ：Horizon Max ✨ 编程技巧篇：各种操作小结 🎇 机器视觉篇：会变魔术 OpenCV 💥 深度学习篇：简单入门 PyTorch 🏆 神经网络篇：经典网络…

人工智能 2023年6月26日
0074
数字信号处理实验——语音信号的数字滤波

文章目录前言一、实验项目二、实验目的三、实验平台四、实验内容 * 1.分析信号 2.信号处理实验结果分析 * 代码地址前言 “数字信号处理”课程实验研究 [En] S…

人工智能 2023年5月27日
0076
如何确保部署的AI算法能够应对不同环境下的数据和输入变化

问题：如何确保部署的AI算法能够应对不同环境下的数据和输入变化？在部署AI算法时，面临的一个重要挑战是如何确保算法能够适应不同环境下的数据和输入变化。这个问题涉及到两个方面：算法…

人工智能 2024年1月4日
0042
hw-2 李宏毅2022年作业2 phoneme识别单strong-hmm详细解释。

目录系列文章前言：项目：一：数据：二：模型三：训练和评估四：main函数和训练过程五后处理。系列文章 2022李宏毅作业hw1—新冠阳性人员数量预…

人工智能 2023年5月27日
00117
tensorflow构建的ckpt文件转pb转onnx文件，深度学习模型推理时加速，以bert模型为例

啊哦~你想找的内容离你而去了哦内容不存在，可能是由于以下原因造成的： [En] The content does not exist and may be caused by t…

人工智能 2023年5月23日
00101
NLP基础知识点：BLEU（及Python代码实现）

Bleu[1]是IBM在2002提出的，用于机器翻译任务的评价 BLEU还有许多变种。根据n-gram可以划分成多种评价指标，常见的指标有BLEU-1、BLEU-2、BLEU-3、…

人工智能 2023年5月30日
00177
语音质量评估

MOS（Mean Opnion Score）平均意见得分。在实时通讯领域，国际电信联盟（ITU）将语音质量的主观评价方法做了标准化处理，代号为ITU-T P.800.1。其中收听…

人工智能 2023年5月25日
0070
计算机视觉(多目标跟踪)算法中卡尔曼滤波算法详解

目录一、背景详解二、卡尔曼滤波(Kalman)原理 * 代码实践三、总结参考文献一、背景详解卡尔曼滤波（Kalman filter）是一种高效的自回归滤波器，它能在存…

人工智能 2023年5月26日
00108
好消息，河北、吉林、广东、四川、广西、西藏、天津、重庆8省（市、区）安全员ABC实行实现电子证照信息跨地区互联互通互认

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年6月28日
0077

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

python 图片倾斜校正

大家都在看