机器学习模型正则化与岭回归、LASSO回归

2023年6月17日下午1:39 • 人工智能 • 阅读 94

文章目录

模型正则化
岭回归
*
使用多项式回归模型
使用岭回归模型
LASSO回归
*
岭回归与LASSO回归区别

模型正则化

为了解决机器学习中方差过大问题，常用的手段是模型正则化，其原理是限制多项式模型中特征系数θ \theta θ，不让其过大，导致过拟合。
在线性回归模型中，目标是使得损失函数尽可能小
J ( θ ) = ∑ i = 1 m ( y ( i ) − θ 0 − θ 1 X 1 ( i ) − … … − θ n X n ( i ) ) 2 J(\theta)=\sum_{i=1}^m (y^{(i)}-\theta_0-\theta_1X^{(i)}1-……-\theta_nX_n^{(i)})^2 J (θ)=i =1 ∑m (y (i )−θ0 −θ1 X 1 (i )−……−θn X n (i ))2
当模型过拟合时，θ \theta θ就会非常大，当损失函数J ( θ ) J(\theta)J (θ)加上
α 1 2 ∑ i = 1 n θ i 2 \alpha \frac{1}{2} \sum{i=1}^n \theta^2_i α2 1 i =1 ∑n θi 2
此时要使得损失函数尽可能小，就要考虑到 θ \theta θ 尽可能小，损失函数加上了 α 1 2 ∑ i = 1 n θ i 2 \alpha \frac{1}{2} \sum_{i=1}^n \theta^2_i α2 1 ∑i =1 n θi 2 即为加入正则化项。

岭回归

损失函数加正则化模型 α 1 2 ∑ i = 1 n θ i 2 \alpha \frac{1}{2} \sum_{i=1}^n \theta^2_i α2 1 ∑i =1 n θi 2 的方式，这种模型称为岭回归。

使用多项式回归模型

import numpy as np
import matplotlib.pyplot as plt
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler

x = np.random.uniform(-3,3,size=100)
y = 0.5*x +3 +np.random.normal(0,1,size=100)
X = x.reshape(-1,1)

plt.scatter(x,y)

x_train,x_test,y_train,y_test = train_test_split(X,y)

def plt_model(model):
    plt_x = np.linspace(-3,3,100).reshape(100,1)
    y_predict = model.predict(plt_x)
    plt.scatter(x,y)
    plt.plot(plt_x[:,0],y_predict)
    plt.show()

def PolynomialDegression(degree):
    return Pipeline([
    ("poly_reg",PolynomialFeatures(degree)),
    ("std",StandardScaler()),
    ("liner_reg",LinearRegression())
    ])

poly_reg = PolynomialDegression(degree=20)
poly_reg.fit(x_train,y_train)
plt_model(poly_reg)

使用岭回归模型

import numpy as np
import matplotlib.pyplot as plt
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import Ridge

x = np.random.uniform(-3,3,size=100)
y = 0.5*x +3 +np.random.normal(0,1,size=100)
X = x.reshape(-1,1)

plt.scatter(x,y)

x_train,x_test,y_train,y_test = train_test_split(X,y)

def plt_model(model):
    plt_x = np.linspace(-3,3,100).reshape(100,1)
    y_predict = model.predict(plt_x)
    plt.scatter(x,y)
    plt.plot(plt_x[:,0],y_predict)
    plt.show()

def RidgeRegression(degree,aplpha):
    return Pipeline([
    ("poly_reg",PolynomialFeatures(degree)),
    ("std",StandardScaler()),
    ("ridge",Ridge())
    ])

ridge = RidgeRegression(20,10000)
ridge.fit(x_train,y_train)
plt_model(ridge)

从结果可以看出，使用岭回归模型训练，超参数a l p h a alpha a l p h a取值非常重要，当a l p h a alpha a l p h a取适合的值时，能非常有效的降低方差。

LASSO回归

LASSO回归与岭回归不同点是使得下面的目标函数尽可能小
J ( θ ) = ∑ i = 1 m ( y ( i ) − θ 0 − θ 1 X 1 ( i ) − … … − θ n X n ( i ) ) 2 + α ∑ i = 1 n ∣ θ ∣ J(\theta)=\sum_{i=1}^m (y^{(i)}-\theta_0-\theta_1X^{(i)}1-……-\theta_nX_n^{(i)})^2 + \alpha \sum{i=1}^n |\theta|J (θ)=i =1 ∑m (y (i )−θ0 −θ1 X 1 (i )−……−θn X n (i ))2 +αi =1 ∑n ∣θ∣

import numpy as np
import matplotlib.pyplot as plt
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import PolynomialFeatures
from sklearn.model_selection import train_test_split
from sklearn.linear_model import LinearRegression
from sklearn.preprocessing import StandardScaler
from sklearn.linear_model import Lasso

x = np.random.uniform(-3,3,size=100)
y = 0.5*x +3 +np.random.normal(0,1,size=100)
X = x.reshape(-1,1)

x_train,x_test,y_train,y_test = train_test_split(X,y)

def plt_model(model):
    plt_x = np.linspace(-3,3,100).reshape(100,1)
    y_predict = model.predict(plt_x)
    plt.scatter(x,y)
    plt.plot(plt_x[:,0],y_predict,color='r')
    plt.show()

def LassoRegression(degree,alpha):
    return Pipeline([
    ("poly_reg",PolynomialFeatures(degree)),
    ("std",StandardScaler()),
    ("lasso",Lasso(alpha=alpha))
    ])

lasso = LassoRegression(20,0.1)
lasso.fit(x_train,y_train)
plt_model(lasso)

岭回归与LASSO回归区别

岭回归训练得到的模型很难得到一条直线，始终是保持弯曲的形状，而LASSO回归，更趋向是一条直线
岭回归是使得每个θ \theta θ趋于0，但不等于0，所以岭回归训练得到的模型始终保持弯曲形状；LASSO趋向于使得一部分的θ \theta θ值为0，所以LASSO训练出来的模型趋向于一条直线

Original: https://blog.csdn.net/weixin_45137294/article/details/124227823
Author: 德乌大青蛙
Title: 机器学习模型正则化与岭回归、LASSO回归

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/630239/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

深度学习笔记：卷积神经网络的可视化–特征图

目录 1. 前言 2. 模型的训练 3. 特征图可视化 3.1 加载保存的模型¶ 3.2 图像预处理：将图像转换为张量 3.3 例化一个模型用于返回各层激活输出（即feature …

人工智能 2023年6月16日
0087
基于pytorch搭建多特征LSTM时间序列预测代码详细解读（附完整代码）

文章目录 LSTM时间序列预测 * 数据获取与预处理模型构建训练与测试 LSTM时间序列预测对于LSTM神经网络的概念想必大家也是熟练掌握了，所以本文章不涉及对LSTM概念的…

人工智能 2023年7月30日
00152
Day63 处理机调度的概念和层次

我是大白(●—●)，这是我开始学习记录大白Java软件攻城狮晋升之路的第六十三天，今天学习的是进程在处理机的调度概念以及处理机调度的层次~ 文章目录一、处理机调度的概念 * 1….

人工智能 2023年6月28日
0067
example_1（for语句）

404. 抱歉，您访问的资源不存在。可能是网址有误，或者对应的内容被删除，或者处于私有状态。代码改变世界，联系邮箱 contact@cnblogs.com 园子的商业化努力-困…

人工智能 2023年6月4日
0081
ZZNUOJ_用C语言编写程序实现1342：支配值数目(附完整源码)

题目描述知f[]与g[]两个整数数组，元素都已经从小到大排好序，请写一个程序，算出f[]中比g[]中元素大的对数。换句话说，f[0]比g[]中多少个元素大，f[1]比g[]中多少…

人工智能 2023年6月29日
0061
K210的20种物体分类识别

K210物体20种分类检测文章目录 K210物体20种分类检测前言一、找到模型原型以及固件 * 获得K210对应的机器码获得官方模型二、识别20种不同物体的模型 * 代码…

人工智能 2023年7月12日
0090
Swim-Transform V2：用于目标检测，视觉大模型不再是难题（附源代码）

关注并星标从此不迷路计算机视觉研究院公众号ID｜ ComputerVisionGzq 学习群｜扫码在主页获取加入方式论文地址：https://arxiv.org/pdf/…

人工智能 2023年7月9日
00122
【计算机视觉】基于Python—OpenCV的手势识别详解（一）

文章目录更新日记前言前期准备识别手部模型识别视频输入方法手势识别方法完整代码结语更新日记更新日记：2022.04.18：应各位网友需求，已mp库更新后的手部识别…

人工智能 2023年6月23日
0087
AI算法中的梯度下降法是什么

问题：关于AI算法中的梯度下降法是什么？介绍梯度下降法（Gradient Descent）是一种常用的优化算法，广泛应用于机器学习和人工智能领域中。它通过迭代地调整模型参数，使…

人工智能 2024年1月1日
0027
C++后端开发（2.2.3）——POSIXAPI解析

1.网络通信 1.消息传递（管道、FIFO、消息队列）2.同步（互斥量、条件变量、读写锁、文件和写记录锁、信号量）3.共享内存（匿名的和具名的）使用TCP/IP协议通过sock…

人工智能 2023年6月27日
0077
到底有多厉害？ChatGPT初探（多图）

众所周知，nlp领域发展迅猛，出现了很多新东西，注意力、多模态…甚至影响了cv领域。 ChatGpt是OpenAI的最新力作，作为一个阅(git)读(clone)广泛的…

人工智能 2023年7月31日
0053
Python数据分析案例-分别使用时间序列ARIMA、SARIMAX模型与Auto ARIMA预测国内汽车月销量

1. 前言模型： ARIMA模型（英语：Autoregressive Integrated Moving Average model），差分整合移动平均自回归模型，又称整合移动平…

人工智能 2023年7月16日
0051
深入理解ReLU函数（ReLU函数的可解释性）

本篇博文主要来源于对文章 Unwrapping The Black Box of Deep ReLU Networks: Interpretability, Diagnostics…

人工智能 2023年7月13日
0068
opencv-python图像增强

图像的几何变换又称为图像空间变换，它将一幅图像中的坐标位置映射到另一幅图像中的新坐标位置。缩放只是调整图像的大小。为此，OpenCV 附带了一个函数cv.resize(…

人工智能 2023年5月28日
0048
疾病负担研究（GBD）-如何优雅的展示发病率数据

文章首发于微信公众号：”小明学习室” 本文是GBD的第三篇推文，今天我们就以已经发表的高分GBD文章来看下如何展示GBD数据，本文案例文章是发表在JOUR…

人工智能 2023年6月17日
0061
图像处理8-CNN图像分类

图像处理系列图像处理1-经典空间域增强——灰度映射图像处理2-经典空间域增强——直方图均衡化图像处理3-经典空间域增强——空域滤波图像处理4-图像的傅里叶变换图像处理5-…

人工智能 2023年6月20日
0079

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

机器学习模型正则化与岭回归、LASSO回归

文章目录

使用多项式回归模型

使用岭回归模型

岭回归与LASSO回归区别

大家都在看