在Logistic回归模型中，通常使用sigmoid函数将输入数据映射到概率值。常用的sigmoid函数是

2023年12月31日上午4:14 • 人工智能 • 阅读 58

介绍

在Logistic回归模型中，我们通常使用sigmoid函数将输入数据映射到概率值。sigmoid函数是一种常用的激活函数，它能够将任意实数映射到0到1之间的概率值。在Logistic回归中，我们使用sigmoid函数来预测二分类问题中的概率。

算法原理

Logistic回归模型使用线性回归模型的线性组合，并通过sigmoid函数对结果进行转换以得到概率值。其中，sigmoid函数被定义为：
$$
\sigma(z) = \frac{1}{1+e^{-z}}
$$
其中，$z$表示线性回归模型的线性组合结果。

公式推导

在Logistic回归中，我们假设有以下的线性回归模型：
$$
z = w_0 + w_1x_1 + w_2x_2 + … + w_nx_n
$$
其中，$z$表示线性回归模型的线性组合结果，$w_0, w_1, w_2, …, w_n$是模型的参数，$x_1, x_2, …, x_n$是输入特征。

我们希望将线性回归模型的结果转换为0到1之间的概率值。为了实现这一点，我们使用sigmoid函数：
$$
\hat{y} = \sigma(z) = \frac{1}{1+e^{-z}}
$$
其中，$\hat{y}$表示预测的概率值。

计算步骤

初始化模型参数$w_0, w_1, w_2, …, w_n$
对于每个训练样本$(x_1, x_2, …, x_n)$，计算线性组合结果$z$
将线性组合结果$z$通过sigmoid函数转换为概率值$\hat{y}$
根据预测的概率值$\hat{y}$和实际标签$y$，计算损失函数
使用梯度下降法或其他优化算法来更新模型参数$w_0, w_1, w_2, …, w_n$
重复步骤2-5直到满足收敛条件或达到最大迭代次数
使用训练好的模型进行预测

复杂Python代码示例

下面是一个使用Python实现Logistic回归模型的示例代码：

import numpy as np
import matplotlib.pyplot as plt

# 定义sigmoid函数
def sigmoid(z):
 return 1 / (1 + np.exp(-z))

# 初始化模型参数
def initialize_parameters(dim):
 w = np.zeros((dim, 1))
 b = 0
 return w, b

# 前向传播
def propagate(w, b, X, Y):
 m = X.shape[1]

 # 计算线性组合结果
 Z = np.dot(w.T, X) + b
 A = sigmoid(Z)

 # 计算损失函数
 cost = -1 / m artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.sum(Y artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.log(A) + (1 - Y) artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.log(1 - A))

 # 反向传播
 dw = 1 / m artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.dot(X, (A - Y).T)
 db = 1 / m artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls np.sum(A - Y)

 cost = np.squeeze(cost)
 return dw, db, cost

# 梯度下降法更新参数
def optimize(w, b, X, Y, num_iterations, learning_rate, print_cost=False):
 costs = []

 for i in range(num_iterations):
 dw, db, cost = propagate(w, b, X, Y)

 w -= learning_rate artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls dw
 b -= learning_rate artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls db

 # 每100次迭代记录一次损失值
 if i % 100 == 0:
 costs.append(cost)

 # 打印损失值
 if print_cost and i % 100 == 0:
 print("迭代 %i 次后的损失值: %f" % (i, cost))

 return w, b, costs

# 预测
def predict(w, b, X):
 Z = np.dot(w.T, X) + b
 A = sigmoid(Z)

 # 将预测概率大于0.5的设置为1，否则为0
 predictions = (A > 0.5).astype(int)
 return predictions

# 完整的Logistic回归模型
def logistic_regression(X_train, Y_train, X_test, Y_test, num_iterations=2000, learning_rate=0.5, print_cost=False):
 # 初始化参数
 w, b = initialize_parameters(X_train.shape[0])

 # 梯度下降法更新参数
 w, b, costs = optimize(w, b, X_train, Y_train, num_iterations, learning_rate, print_cost=print_cost)

 # 预测
 train_predictions = predict(w, b, X_train)
 test_predictions = predict(w, b, X_test)

 # 计算准确率
 train_accuracy = 100 - np.mean(np.abs(train_predictions - Y_train)) artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls 100
 test_accuracy = 100 - np.mean(np.abs(test_predictions - Y_test)) artical cgpt2md_gpt.sh cgpt2md_johngo.log cgpt2md_johngo.sh cgpt2md.sh _content1.txt _content.txt current_url.txt history_url history_urls log nohup.out online pic.txt seo test.py topic_gpt.txt topic_johngo.txt topic.txt upload-markdown-to-wordpress.py urls 100

 print("训练集准确率: {} %".format(train_accuracy))
 print("测试集准确率: {} %".format(test_accuracy))

 # 绘制损失函数曲线
 plt.plot(costs)
 plt.xlabel("迭代次数 (每100次)")
 plt.ylabel("损失值")
 plt.title("梯度下降法")
 plt.show()

 return train_accuracy, test_accuracy

# 使用示例数据集
np.random.seed(0)
m = 1000
X = np.random.randn(2, m)
Y = np.random.randint(0, 2, (1, m))

X_train = X[:, :800]
Y_train = Y[:, :800]
X_test = X[:, 800:]
Y_test = Y[:, 800:]

train_accuracy, test_accuracy = logistic_regression(X_train, Y_train, X_test, Y_test, num_iterations=2000, learning_rate=0.05, print_cost=True)

代码细节解释

sigmoid函数实现了sigmoid函数的计算，将输入值转换为0到1之间的概率值。
initialize_parameters函数初始化模型参数$w$和$b$，将其设置为零向量和零标量。
propagate函数实现了前向传播的过程，包括计算线性组合结果$Z$，通过sigmoid函数计算预测概率值$A$，并计算损失函数。
optimize函数使用梯度下降法更新模型参数$w$和$b$，并返回更新后的参数和损失值。
predict函数根据线性组合结果$Z$和阈值0.5将预测概率值转换为0或1的预测值。
logistic_regression函数是完整的Logistic回归模型，包括初始化参数、梯度下降法更新参数、预测和计算准确率的过程。
假设训练集和测试集都是二维的输入特征变量$X$，并且对应的标签$Y$是二分类问题的标签。
在示例中，我们使用了随机生成的数据集，并将训练集和测试集的准确率打印出来。
在最后，我们绘制了损失函数的曲线，用于可视化优化过程的效果。

以上是一个使用Python实现Logistic回归模型的示例，其中包含了详细的原理介绍、公式推导、算法步骤、复杂代码示例和代码细节解释。

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/821810/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

【opencv-python】均值滤波、中值滤波、高斯滤波、图像锐化的代码实现

线性滤波、滤波核的基本概念数字图像处理线性滤波: 输出图像fo(x,y)= T[ fi(x,y) ]，T是线性算子，即：输出图像上每个像素点的值都是由输入图像各像素点值加权求和的…

人工智能 2023年6月18日
0092
【pytorch】ECA-NET注意力机制应用于ResNet的代码实现

一、前言 ECA-NET(CVPR 2020)简介：论文名：ECA-Net: Effificient Channel Attention for Deep Convolution…

人工智能 2023年7月20日
00464
DFRobot语音识别模块推荐-Gravity: I2C离线中文语音识别模块

本文首发于创客社区，作者:Jeff2 原文链接:http://mc. .com.cn/thread-289776-1-1.html 项目背景之前和朋友做AI训练街霸游戏，想加入…

人工智能 2023年5月25日
00119
keras中ResNet的调用、参数、模型融合

keras.application模型概览应用 Applications – Keras 中文文档模型大小Top-1 准确率Top-5 准确率参数数量深度Xcept…

人工智能 2023年7月13日
00100
语义分割系列6-Unet++（pytorch实现）

目录 Unet++网络 Dense connection deep supervision 模型复现 Unet++ 数据集准备模型训练训练结果 Unet++：《UNet++: …

人工智能 2023年7月23日
0084
大疆M100无人机妙算Manifold 深度学习视觉伺服系统学习历程（一）妙算Manifold环境配置

实验室有一架 DJI M100 无人机和若干台 DJI Manifold ，由于与我的研究方向有相关性，因此打算将其利用起来做一些深度学习视觉伺服的开发工作，本系列文章将一些我在学…

人工智能 2023年5月23日
0077
【机器学习】：Xgboost使用optuna进行调试参数

代码如下： def objective(trial,data=data,target=target): train_x, test_x, train_y, test_y = tra…

人工智能 2023年6月6日
0091
基于matlab实现数字图像处理之图像复原

一、实验目的（1）了解图像复原的目的及意义，加深对图像复原的感性认识。（2）熟练掌握逆滤波、维纳滤波图像复原方法。二、实验仪器（软件平台） Matlab 软件三、实验原理 …

人工智能 2023年6月17日
00117
点云3D目标检测之——尝试SFD代码跑通（超详细！！）

前言到目前为止还没跑通，但是bug实在太多了，我的每一步都有错，如果不记录下来又会有遗漏，（肯定已经遗漏了很多），在这里把能想起来的都记录一下以便不时之需。另外，本人深度学习小白…

人工智能 2023年7月9日
0072
Multi-hop QA based KG

论文：Improving Multi-hop Question Answering over Knowledge Graphs using Knowledge Base Embed…

人工智能 2023年6月10日
0072
机器学习算法——线性回归（超级详细且通俗）

通俗理解线性回归回归分析什么是回归分析呢？这是一个来自统计学的概念。回归分析是指一种预测性的建模技术，主要是研究自变量和因变量的关系。通常使用线/曲线来拟合数据点，然后研究如何…

人工智能 2023年6月16日
0080
Python 地图篇 – 使用pyecharts绘制世界地图、中国地图、省级地图、市级地图实例详解

使用 pyecharts 绘制世界地图、省级地图、市级地图实例详解第一章：世界地图绘制演示 ① 世界地图数据准备 ② 世界地图生成第二章：省份（河北省）地图绘制演示 ① 省份地…

人工智能 2023年7月4日
0077
Ubuntu 20.04下搭建C++ & OpenCV 4.6.0 & cmake编译

前言：这篇文章记录了我在Ubuntu重新搭建OpenCV，c++环境。后续会补充opencv-python、深度学习、ROS环境配置，使之相互兼容。具体操作和安装包在文章中都有，不…

人工智能 2023年7月19日
0055
什么是GAN网络？

引言 GAN ，全称GenerativeAdversarialNetworks ，中文叫生成式对抗网络，了解GAN，私下我喜欢叫它为”内卷”网络，为啥这么说…

人工智能 2023年7月28日
0080
神经网络基础之卷积、池化详解

文章目录一、卷积 * 1、计算公式 2、参数详解 3、多图片通道数卷积核运算二、池化 * 1、池化作用 2、池化参数 3、池化分类三、卷积与池化总结一、卷积内容上：卷积核…

人工智能 2023年7月12日
0083
WideResNet(宽残差网络)介绍与代码

《Wide Residual Networks》宽度的解耦意义 WRN-n-k denotes a residual network that has a total numbe…

人工智能 2023年6月10日
00132

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31