构建逻辑回归模型识别MNIST手写字——单个神经元

2023年5月23日下午7:00 • 人工智能 • 阅读 84

实验步骤

1、导入库

import tensorflow as tf
import numpy as np
import matplotlib.pyplot as plt
%matplotlib inline
print("Tensorflow&#x7248;&#x672C;&#x662F;:",tf.__version__)

2、数据获取

MNIST 数据集可在http://yann.lecun.com/exdb/mnist/获取

TensorFlow提供了数据集读取方法(1.x和2.0版本提供的方法不同)

mnist = tf.keras.datasets.mnist
(train_images,train_labels),(test_images,test_labels)=mnist.load_data()

MNIST数据集文件在读取时如果指定目录下不存在，则会自动去下载，需等待一定时间；如果已经存在了，则直接读取

3、数据集划分

total_num = len(train_images)
valid_split = 0.2
train_num = int(total_num*(1-valid_split))

train_x = train_images[:train_num]
train_y = train_labels[:train_num]

valid_x = train_images[train_num:]
valid_y = train_labels[train_num:]

test_x = test_images
test_y = test_labels

valid_x.shape

4、数据塑形

train_x = train_x.reshape(-1,784)
valid_x = valid_x.reshape(-1,784)
test_x = test_x.reshape(-1,784)

5、特征数据归一化

train_x = tf.cast(train_x/255.0,tf.float32)
valid_x = tf.cast(valid_x/255.0,tf.float32)
test_x = tf.cast(test_x/255.0,tf.float32)

train_x[1]

6、标签数据独热编码

独热编码常用于表示拥有有限个可能值的字符串或标识符

train_y = tf.one_hot(train_y,depth=10)
valid_y = tf.one_hot(valid_y,depth=10)
test_y = tf.one_hot(test_y,depth=10)

train_y

7、构建模型

def model(x,w,b):
    pred = tf.matmul(x,w)+b
    return tf.nn.softmax(pred)

8、定义模型变量

W=tf.Variable(tf.random.normal([784,10],mean=0.0,stddev=1.0,dtype=tf.float32))

B=tf.Variable(tf.zeros([10]),dtype=tf.float32)

9、定义交叉熵损失函数

在自定义的损失函数loss中直接调用了TensorFlow提供的交叉熵函数。

def loss(x,y,w,b):
    pred = model(x,w,b)
    loss_ = tf.keras.losses.categorical_crossentropy(y_true=y,y_pred = pred)
    return tf.reduce_mean(loss_)

10、定义训练参数

training_epochs=20
batch_size=50
learning_rate=0.001

11、定义梯度计算函数

def grad(x,y,w,b):
    with tf.GradientTape() as tape:
        loss_=loss(x,y,w,b)
        return tape.gradient(loss_,[w,b])

12、选择优化器

常用优化器： SGD、Adagrad、Adadelta、RMSprop、Adam

optimizer=tf.keras.optimizers.Adam(learning_rate=learning_rate)

13、定义准确率

def accuracy(x,y,w,b):
    pred=model(x,w,b)
    correct_prediction = tf.equal(tf.argmax(pred,1),tf.argmax(y,1))
    return tf.reduce_mean(tf.cast(correct_prediction,tf.float32))

14、训练模型

total_step = int(train_num/batch_size)

loss_list_train = []
loss_list_valid = []
acc_list_train = []
acc_list_valid = []

for epoch in range (training_epochs):
    for step in range(total_step):
        xs = train_x[step*batch_size:(step+1)*batch_size]
        ys = train_y[step*batch_size:(step+1)*batch_size]

        grads = grad(xs,ys,W,B)
        optimizer.apply_gradients(zip(grads,[W,B]))

    loss_train = loss(train_x,train_y,W,B).numpy()
    loss_valid = loss(valid_x,valid_y,W,B).numpy()
    acc_train = accuracy(train_x,train_y,W,B).numpy()
    acc_valid = accuracy(valid_x,valid_y,W,B).numpy()
    loss_list_train.append(loss_train)
    loss_list_valid.append(loss_valid)
    acc_list_train.append(acc_train)
    acc_list_valid.append(acc_valid)
    print("epoch={:3d},train_loss={:.4f},train_acc={:.4f},val_loss={:.4f},val_lacc={:.4f}".format(epoch+1,loss_train,acc_train,loss_valid,acc_valid))

15、显示训练过程数据

plt.xlabel("Epochs")
plt.ylabel("Loss")
plt.plot(loss_list_train,'blue',label="Train Loss")
plt.plot(loss_list_valid,'red',label='Valid Loss')
plt.legend(loc=1)

plt.xlabel("Epochs")
plt.ylabel("Accuracy")
plt.plot(acc_list_train,'blue',label="Train Loss")
plt.plot(acc_list_valid,'red',label='Valid Loss')
plt.legend(loc=1)

16、评估模型

acc_test = accuracy(test_x,test_y,W,B).numpy()
print("Test accuracy:",acc_test)

17、模型应用与可视化

应用模型

def predict(x,w,b):
    pred=model(x,w,b)
    result=tf.argmax(pred,1).numpy()
    return result

pred_test=predict(test_x,W,B)

pred_test[0]

2. 定义可视化函数

import matplotlib.pyplot as plt
import numpy as np
def plot_images_label_prediction(images,
                                 labels,
                                 preds,
                                 index=0,
                                 num=10
                                ):
    fig = plt.gcf()
    fig.set_size_inches(10,4)
    if num > 10:
        num = 10
    for i in range(0,num):
        ax = plt.subplot(2,5,i+1)

        ax.imshow(np.reshape(images[index],(28,28)),cmap='binary')

        title = "label=" + str(labels[index])
        if len(preds)>0:
            title +=",predict=" + str(labels[index])

        ax.set_title(title,fontsize=10)
        ax.set_xticks([]);
        ax.set_yticks([])
        index = index + 1

    plt.show()

可视化预测结果

plot_images_label_prediction(test_images,test_labels,pred_test,10,10)

Original: https://blog.csdn.net/m0_59324564/article/details/124474111
Author: 小洁酱
Title: 构建逻辑回归模型识别MNIST手写字——单个神经元

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/497490/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

【图像分割】基于matlab遗传算法优化K聚类图像分割【含Matlab源码 1605期】

⛄一、遗传算法优化K聚类简介文中提出基于优化遗传算法的模糊聚类图像分割算法, 是在上述对遗传算法进行了优化的基础上形成的。不仅根据个体适应度大小和变化快慢自适应调节变异率和交叉率…

人工智能 2023年6月2日
0087
Vue中播放音频和语音合成

利用audio标签播放音频 1,把音频文件notify.mp3放到public目录, mp3 wav ogg 都放上兼不同的浏览器2,添加如下标签 <audio contro…

人工智能 2023年5月27日
0066
学习CANopen — [10] 汽车外接OBD模块原理

在某宝上搜索汽车OBD，可以发现很多卖OBD模块的，通过接入OBD模块可以增加车子本身没有的功能，如锁车升窗，行车自动落锁和后视镜折叠等，那么其实现原理是什么呢？使用时会造成亏电吗…

人工智能 2023年6月27日
0086
iOS苹果开发者账号(公司账号)申请流程详解

前言近期由于公司项目的需要，全程参与了公司一款iOS原生应用的开发与上线。其中最让我印象深刻的还是申请苹果开发者账号以及app的上架审核。申请苹果开发者账号一波三折，又是邮件又…

人工智能 2023年5月30日
0085
python基础练习（学python的多多少少听说过）

🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝🔝 🥰 博客首页： knighthood2001😗 欢迎点赞👍评论🗨️❤️ 热爱python，期待与大家一同进步成长！！❤️👀给大家推荐一款很火爆的刷题、面试…

人工智能 2023年7月6日
00118
误差反向传播算法中的权重和偏差是如何更新的

问题介绍误差反向传播算法（Backpropagation）是一种常用的用于训练深度神经网络的算法。该算法通过将训练样本输入到神经网络中，并计算输出结果与真实结果之间的误差，然后根…

人工智能 2024年1月4日
0054
openCV——梯度运算

梯度运算 import cv2 import numpy as np import matplotlib.pyplot as plt %matplotlib inline def …

人工智能 2023年6月22日
0076
Python_时间序列处理及可视化_美国能源消耗数据集分析

数据集(PJME_hourly.csv，PJMW_hourly.csv),可以在Kaggle官网下载。这里列出几个基本的任务。任务：读取数据，创建以时间列为索引的时间序列，截取…

人工智能 2023年7月7日
00131
【Machine Learning】5.特征工程和多项式回归

特征工程和多项式回归 1. 导入 2.多项式特征 3.特征选择 4.多项式特征与线性特征的关联 5. 特征缩放 Scaling features 6.复杂函数的拟合 7.课后题特…

人工智能 2023年6月17日
0093
常用数据特征提取，时域特征、频域特征、小波特征提取汇总；特征提取；有效matlab代码

clc;clear%% 导入数据load(‘ct.mat’)Fs=12800;a=[];c=[];w=[];%% 1时域特征提取for i=2:8y=ct(…

人工智能 2023年6月19日
00127
AI实现艺术品自动生成？太牛了

CSDN话题挑战赛第1期活动详情地址：https://marketing.csdn.net/p/bb5081d88a77db8d6ef45bb7b6ef3d7f参赛话题：哪项人工智…

人工智能 2023年6月23日
0088
中兴c600olt数据配置_OLT(ONU)语音业务数据标准配置指导-zte

中兴语音业务配置流程： (本数据规范以常见开局方式为例) 假设在 OLT 口下注册一个 onu onu 的语音 vlan 3000, IP:10.65.3.22, 语音网关为： 1…

人工智能 2023年5月27日
00166
深度学习之concatenate和elementwise操作（二）

一、深度学习里面的element-wise特征相乘和相加到底有什么区别？很多深度学习模型在设计时，中间特征在分支处理后，然后可能会采用element-wise相乘或相加，不知道这…

人工智能 2023年7月27日
0078
智源社区AI周刊No.109：ChatGPT预示大模型取代搜索引擎；Stable Diffusion2.1发布，8k高清图像生成…

啊哦~你想找的内容离你而去了哦内容不存在，可能为如下原因导致： ① 内容还在审核中 ② 内容以前存在，但是由于不符合新的规定而被删除 ③ 内容地址错误 ④ 作者删除了内容。可…

人工智能 2023年7月30日
0067
深度学习第3章线性分类实验四 pytorch实现 Softmax回归鸢尾花分类任务下篇

目录：第3章线性分类 * 3.3 实践：基于Softmax回归完成鸢尾花分类任务 – 3.3.1 数据处理 + 3.3.1.1 数据集介绍 3.3.1.2 数据清洗…

人工智能 2023年6月16日
00102
深度学习环境搭建超级详解（Miniconda、pytorch安装）

小白刚开始学习《动手学深度学习》，第一次发文，本文主要是为了记录在环境搭建过程中遇到的问题和疑惑，以及解决方法，同时希望能帮到遇到相同问题的小伙伴。在学习中遇到的疑惑和最后搜索得…

人工智能 2023年7月23日
0074

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31