吴恩达机器学习(五)逻辑回归练习-二分类练习

2023年7月2日下午12:58 • 人工智能 • 阅读 86

1、基础内容

（1）公式总结:

; （2）内容回归:

逻辑回归主要是进行二分类和多分类。

对于线性回归模型，我们定义的代价函数是所有模型误差的平方和。理论上来说，我们也可以对逻辑回归模型沿用这个定义，但是问题在于，当我们将h θ ( x ) {h_\theta}(x)h θ(x )带入到这样定义了的代价函数中时，我们得到的代价函数将是一个非凸函数（ non-convexfunction）。

这意味着我们的代价函数有许多局部最小值，这将影响梯度下降算法寻找全局最小值。

线性回归的代价函数为：J ( θ ) = 1 m ∑ i = 1 m 1 2 ( h θ ( x ( i ) ) − y ( i ) ) 2 J\left( \theta \right)=\frac{1}{m}\sum\limits_{i=1}^{m}{\frac{1}{2}{{\left( {h_\theta}\left({x}^{\left( i \right)} \right)-{y}^{\left( i \right)} \right)}^{2}}}J (θ)=m 1 i =1 ∑m 2 1 (h θ(x (i ))−y (i ))2 。
我们重新定义逻辑回归的代价函数为：J ( θ ) = 1 m ∑ i = 1 m C o s t ( h θ ( x ( i ) ) , y ( i ) ) J\left( \theta \right)=\frac{1}{m}\sum\limits_{i=1}^{m}{{Cost}\left( {h_\theta}\left( {x}^{\left( i \right)} \right),{y}^{\left( i \right)} \right)}J (θ)=m 1 i =1 ∑m C o s t (h θ(x (i )),y (i ))，其中

h θ ( x ) {h_\theta}\left( x \right)h θ(x )与 C o s t ( h θ ( x ) , y ) Cost\left( {h_\theta}\left( x \right),y \right)C o s t (h θ(x ),y )之间的关系如下图所示：

这样构建的C o s t ( h θ ( x ) , y ) Cost\left( {h_\theta}\left( x \right),y \right)C o s t (h θ(x ),y )函数的特点是：当实际的 y = 1 y=1 y =1 且h θ ( x ) {h_\theta}\left( x \right)h θ(x )也为 1 时误差为 0，当 y = 1 y=1 y =1 但h θ ( x ) {h_\theta}\left( x \right)h θ(x )不为1时误差随着h θ ( x ) {h_\theta}\left( x \right)h θ(x )变小而变大；当实际的 y = 0 y=0 y =0 且h θ ( x ) {h_\theta}\left( x \right)h θ(x )也为 0 时代价为 0，当y = 0 y=0 y =0 但h θ ( x ) {h_\theta}\left( x \right)h θ(x )不为 0时误差随着 h θ ( x ) {h_\theta}\left( x \right)h θ(x )的变大而变大。
将构建的 C o s t ( h θ ( x ) , y ) Cost\left( {h_\theta}\left( x \right),y \right)C o s t (h θ(x ),y )简化如下：
C o s t ( h θ ( x ) , y ) = − y × l o g ( h θ ( x ) ) − ( 1 − y ) × l o g ( 1 − h θ ( x ) ) Cost\left( {h_\theta}\left( x \right),y \right)=-y\times log\left( {h_\theta}\left( x \right) \right)-(1-y)\times log\left( 1-{h_\theta}\left( x \right) \right)C o s t (h θ(x ),y )=−y ×l o g (h θ(x ))−(1 −y )×l o g (1 −h θ(x ))
带入代价函数得到：
J ( θ ) = 1 m ∑ i = 1 m [ − y ( i ) log ⁡ ( h θ ( x ( i ) ) ) − ( 1 − y ( i ) ) log ⁡ ( 1 − h θ ( x ( i ) ) ) ] J\left( \theta \right)=\frac{1}{m}\sum\limits_{i=1}^{m}{[-{{y}^{(i)}}\log \left( {h_\theta}\left( {{x}^{(i)}} \right) \right)-\left( 1-{{y}^{(i)}} \right)\log \left( 1-{h_\theta}\left( {{x}^{(i)}} \right) \right)]}J (θ)=m 1 i =1 ∑m [−y (i )lo g (h θ(x (i )))−(1 −y (i ))lo g (1 −h θ(x (i )))]
即：J ( θ ) = − 1 m ∑ i = 1 m [ y ( i ) log ⁡ ( h θ ( x ( i ) ) ) + ( 1 − y ( i ) ) log ⁡ ( 1 − h θ ( x ( i ) ) ) ] J\left( \theta \right)=-\frac{1}{m}\sum\limits_{i=1}^{m}{[{{y}^{(i)}}\log \left( {h_\theta}\left( {{x}^{(i)}} \right) \right)+\left( 1-{{y}^{(i)}} \right)\log \left( 1-{h_\theta}\left( {{x}^{(i)}} \right) \right)]}J (θ)=−m 1 i =1 ∑m [y (i )lo g (h θ(x (i )))+(1 −y (i ))lo g (1 −h θ(x (i )))]

进行向量化表示;

梯度下降和线性回归思路一样:
吴恩达机器学习(五)逻辑回归练习-二分类练习

2、二分类案例(线性可分)___依据两次测试的成绩，预测是否被大学录取

（1）读取数据、绘制图像

"""
  二分类案例:
  依据两次测试的成绩，预测是否被大学录取
"""
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

df = pd.read_csv('ex2data1.txt',header=None,names=['exam1','exam2','accepted'])
print(df.head())

fig,ax = plt.subplots()
ax.scatter(df[df['accepted'] == 0]['exam1'],df[df['accepted'] == 0]['exam2'],c = 'red',marker='x' ,label='y=0')
ax.scatter(df[df['accepted'] == 1]['exam1'],df[df['accepted'] == 1]['exam2'],c = 'green',marker='o',label='y=1' )
ax.legend()
ax.set(xlabel='exam1',ylabel='exam2',title='Fig')
plt.show()

可以看出一个二分类问题。

(2)计算theta_final


def getX_y(df):

    df.insert(0,'const',1)

    X = df.iloc[:,0:-1]
    y = df.iloc[:, -1]

    X = X.values
    y = y.values
    y = y.reshape(len(y),1)
    return X,y

X,y = getX_y(df)

def sigmod(z):
    return 1 / (1 + np.exp(-z))

def costFunction(X, y, theta):
    A = sigmod(X @ theta)
    first = y * np.log(A)
    second = (1 - y) * np.log(1 - A)
    return -np.sum(first + second) / len(y)

def gradientDescent(X, y, theta, alpha, iters):
    costs = []
    for i in range(iters):
        A = sigmod(X @ theta)
        theta = theta - (alpha * X.T @ (A - y)) / (len(y))
        cost = costFunction(X, y, theta)
        costs.append(cost)

        if i % 1000 == 0:
            print(cost)
    return theta, costs

alpha = 0.004
iters = 200000

theta = np.zeros((3,1))

theta_final,costs = gradientDescent(X,y,theta,alpha,iters)
print(theta_final)

（3）计算预测准确率，绘制决策边界


def predict(X, theta):
    p = sigmod(X @ theta)
    return [1 if x >= 0.5 else 0 for x in p]

y_ = np.array(predict(X,theta_final))
y_pre = y_.reshape(len(y_),1)
acc = np.mean(y_pre == y)
print(acc)

x = np.linspace(20,100,100)
f = - theta_final[0,0] / theta_final[2,0] - theta_final[1,0] / theta_final[2,0] * x

fig,ax = plt.subplots()
ax.scatter(df[df['accepted'] == 0]['exam1'],df[df['accepted'] == 0]['exam2'],c = 'red',marker='x' ,label='y=0')
ax.scatter(df[df['accepted'] == 1]['exam1'],df[df['accepted'] == 1]['exam2'],c = 'green',marker='o',label='y=1' )
ax.plot(x,f,c = 'blue',label='border' )
ax.legend()
ax.set(xlabel='exam1',ylabel='exam2',title='Fig')
plt.show()

3、二分类案例(线性不可分)___依据两次测试的成绩，决定芯片要被抛弃还是接受

没有办法用一条直线进行切分。

需要特征映射:

为了防止过拟合，需要加上正则项：

(1)读取原始数据，画图

"""
  逻辑回归练习（线性不可分）:
    决定芯片要被抛弃还是接受
    数据集： 芯片在两次测试中的测试结果
"""
import numpy as np
import pandas as pd
import matplotlib.pyplot as plt

df = pd.read_csv('ex2data2.txt',header=None,names=['test1','test2','accepted'])
print(df.head())

fig,ax = plt.subplots()
ax.scatter(df[df['accepted'] == 0]['test1'],df[df['accepted'] == 0]['test2'],c = 'red',marker='x' ,label='y=0')
ax.scatter(df[df['accepted'] == 1]['test1'],df[df['accepted'] == 1]['test2'],c = 'green',marker='o',label='y=1' )
ax.legend()
ax.set(xlabel='test1',ylabel='test2',title='Fig')
plt.show()

(2)使用特征映射，定义函数计算theta


def feature_mapping(x1, x2, power):
    data = {}
    for i in np.arange(power + 1):
        for j in np.arange(i + 1):
            data['F{}{}'.format(i - j, j)] = np.power(x1, i - j) * np.power(x2, j)
    return pd.DataFrame(data)
x1 = df['test1']
x2 = df['test2']

mdf = feature_mapping(x1,x2,6)

y = df.iloc[:, -1]

X = mdf.values
y = y.values
y = y.reshape(len(y),1)

def sigmod(z):
    return 1 / (1 + np.exp(-z))

def costFunction(X, y, theta, lamda):
    A = sigmod(X @ theta)
    first = y * np.log(A)
    second = (1 - y) * np.log(1 - A)

    reg = np.sum( np.power(theta[1:],2) ) * (lamda / (2 * len(y)) )
    return -np.sum(first + second) / len(y) + reg

def gradientDescent(X, y, theta, alpha, iters, lamda):
    costs = []
    for i in range(iters):
        reg = theta[1:] * (lamda / len(y))
        reg = np.insert(reg, 0, values=0, axis=0)

        A = sigmod(X @ theta)
        theta = theta - (alpha * X.T @ (A - y)) / (len(y)) - alpha * reg
        cost = costFunction(X, y, theta, lamda)
        costs.append(cost)
        if i % 1000 == 0:
            print(cost)
    return theta, costs

alpha = 0.001
iters = 20000

lamda = 0.0001
theta = np.zeros((28,1))

theta_final,costs = gradientDescent(X,y,theta,alpha,iters,lamda)
print(theta_final)

(3)计算预测准确率，画出决策边界


def predict(X, theta):
    p = sigmod(X @ theta)

    return [1 if x >= 0.5 else 0 for x in p]

y_ = np.array(predict(X,theta_final))
y_pre = y_.reshape(len(y_),1)
acc = np.mean(y_pre == y)
print(acc)

x = np.linspace(-1.2,1.2,200)
xx,yy = np.meshgrid(x,x)
print(xx.shape)
z = feature_mapping(xx.ravel(),yy.ravel(),6).values
zz = z @ theta_final
zz = zz.reshape(200,200)

fig,ax = plt.subplots()
ax.scatter(df[df['accepted'] == 0]['test1'],df[df['accepted'] == 0]['test2'],c = 'red',marker='x' ,label='y=0')
ax.scatter(df[df['accepted'] == 1]['test1'],df[df['accepted'] == 1]['test2'],c = 'blue',marker='o',label='y=1' )
ax.legend()
ax.set(xlabel='test1',ylabel='test2',title='Fig')
plt.contour(xx,yy,zz,0)
plt.show()

Original: https://blog.csdn.net/qq_44665283/article/details/123028916
Author: undo_try
Title: 吴恩达机器学习(五)逻辑回归练习-二分类练习

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/665474/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

聚类算法有哪些？又是如何分类？

想要了解聚类算法并对其进行区别与比较的话，最好能把聚类的具体算法放到整个聚类分析的语境中理解。聚类分析是一个较为严密的数据分析过程。从聚类对象数据源开始到得到聚类结果的知识存档，…

人工智能 2023年7月1日
00107
NumPy:高性能科学计算&数据分析的基础包

numpy不仅是 Python 中使用最多的第三方库，而且还是 SciPy、Pandas 等数据科学的基础库。它所提供的数据结构比 Python 自身的”更高级、更高效…

人工智能 2023年6月11日
0094
2000字精华总结，安利一个超好用的 Python 数据分析神器

大家好，今天我来给大家介绍一款用于做EDA(探索性数据分析)的利器，并且可以自动生成代码，帮助大家极大节省工作时间与提升工作效率的利器。这款神器就是 Bamboolib，可以将其…

人工智能 2023年7月6日
0096
【Jetson目标检测SSD-MobileNet应用实例】（五）根据输出的检测结果，使用串口和STM32配合进行电机控制

【Jetson目标检测SSD-MobileNet应用实例】（一）win11中配置SSD-MobileNet网络训练境搭建【Jetson目标检测SSD-MobileNet应用实例】（…

人工智能 2023年7月10日
0090
聚类分析

通俗的讲，聚类分析它是根据研究对象的特征按照一定的标准，对研究对象来进行分类的一种分析方法，它使分成后的每一种类的数据对象具有较高的相似度，而不同类的对象有比较大的差异性，聚类分析…

人工智能 2023年6月2日
00118
基于svm机器学习的手写数字识别

机器学习入门来说，手写数字识别是个很不错的练习项目而我们这里基于svm练习我们的所学习的机器学习。而我们选择的训练集是MNIST，这个训练集量大，好用，有几万张纯手写28*28的…

人工智能 2023年7月26日
0075
ARMA模型的性质之ARMA模型

目录一、ARMA模型的定义二、平稳条件与可逆条件三、传递形式与逆转形式四、ARMA(p,q)模型的统计性质 1.均值 2.自协方差函数 3.自相关系数 4.ARMA(p,q…

人工智能 2023年7月26日
00139
UnicodeDecodeError: ‘utf-8‘ codec can‘t decode byte 0xc0 in position 0: invalid start byte报错解决

UnicodeDecodeError: ‘utf-8’ codec can’t decode byte 0xc0 in position 0: …

人工智能 2023年7月29日
0082
基于深度学习的自动调制识别（含代码链接）

AMR领域具有代表性的新模型在四个不同的数据集（RML2016.10a, RML2016.10b, RML2018.01a, HisarMod2019.1）上的实现，为感兴趣的研究…

人工智能 2023年7月26日
0090
pandas处理Excel基本方法

学习总结主要参考了视频内容 https://www.bilibili.com/video/BV1hk4y1C73S?p=2&vd_source=7771577bd8c0c6…

人工智能 2023年7月14日
0065
LSTM学习记录

文章目录前言一、为什么要用LSTM？二、LSTM结构介绍 * 1.大体结构 2.三个门结构总结前言一些关于LSTM的学习记录。一、为什么要用LSTM？ LSTM（Lo…

人工智能 2023年7月14日
0073
VS+Opencv出现：xxx处有未经处理的异常: Microsoft C++ 异常: cv::Exception，位于内存位置xxx处。

问题描述 opencv配置运行时报错代码如下： #include #include #include using namespace cv; int main() { Mat im…

人工智能 2023年6月18日
00121
图像特征点提取及匹配的几种方法总结——基于C++和OPENCV实现SIFT、SURF、ORB、FAST

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年6月18日
00104
如何选择逻辑回归模型中的特征

如何选择逻辑回归模型中的特征在逻辑回归模型中，一个重要的任务是选择最合适的特征来训练我们的模型，以提高模型的性能和预测准确率。在本文中，我们将详细介绍如何选择逻辑回归模型中的特征…

人工智能 2023年12月31日
0058
MachineLearning入门—第2章—神经网络的数学基础

神经网络的数学基础神经网络的核心组件是层（layer），它是一种数据处理模块,可以看作是数据过滤器，进去一些数据，出来的数据变得更加有用。大多数深度学习都是将简单的层链接起来，从…

人工智能 2023年7月18日
0078
新手树莓派3B——人脸识别门禁管理系统

文章目录前言一、怎么用树莓派设计人脸识别门禁系统？二、树莓派系统的安装三、远程连接四、人脸识别总结前言树莓派，自问世以来，其”麻雀虽小，五脏俱全&#82…

人工智能 2023年7月18日
00115

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31