卷积神经网络手势识别之剪刀石头布

2023年5月25日上午3:03 • 人工智能 • 阅读 58

剪刀石头布手势识别

1.加载数据并解压

（1）使用wget下载训练样本和测试样本的压缩文件

!wget  https://storage.googleapis.com/laurencemoroney-blog.appspot.com/rps.zip

!wget https://storage.googleapis.com/laurencemoroney-blog.appspot.com/rps-test-set.zip

（2）调用os 与zipfile方法解压文件

import os
import zipfile

local_zip = 'C:\\Users\\......\\tmp\\rps.zip'
zip_ref = zipfile.ZipFile(local_zip, 'r')
zip_ref.extractall('\\tmp\\')
zip_ref.close()

local_zip = 'C:\\Users\\......\\tmp\\rps-test-set.zip'
zip_ref = zipfile.ZipFile(local_zip, 'r')
zip_ref.extractall('\\tmp\\')
zip_ref.close()

注：具体路径需要根据您自己的计算机进行设置。

[En]

Note: the specific path needs to be set according to your own computer.

（3）查看样本数据并列出前10个样本的文件名

import os
rock_dir = os.path.join('/tmp/rps/rock')
paper_dir = os.path.join('/tmp/rps/paper')
scissors_dir = os.path.join('/tmp/rps/scissors')

print('total training rock images:', len(os.listdir(rock_dir)))
print('total training paper images:', len(os.listdir(paper_dir)))
print('total training scissors images:', len(os.listdir(scissors_dir)))

rock_files = os.listdir(rock_dir)
print(rock_files[:10])

paper_files = os.listdir(paper_dir)
print(paper_files[:10])

scissors_files = os.listdir(scissors_dir)
print(scissors_files[:10])

运行结果

（4）可视化，查看样本图片

%matplotlib inline

import matplotlib.pyplot as plt
import matplotlib.image as mpimg

pic_index = 2

next_rock = [os.path.join(rock_dir, fname)
                for fname in rock_files[pic_index-2:pic_index]]
next_paper = [os.path.join(paper_dir, fname)
                for fname in paper_files[pic_index-2:pic_index]]
next_scissors = [os.path.join(scissors_dir, fname)
                for fname in scissors_files[pic_index-2:pic_index]]

for i, img_path in enumerate(next_rock+next_paper+next_scissors):
  print(i,img_path)
  img = mpimg.imread(img_path)
  plt.imshow(img)
  plt.axis('Off')
  plt.show()

运行结果：

2.数据预处理与模型构建

数据预处理
首先对训练样本和测试样本进行归一化处理；除归一化外，还对训练样本进行旋转、平移、剪切、缩放、水平翻转等一系列数据增强，以增加样本数量。从而提高了网络的泛化能力，降低了网络过拟合的风险。

[En]

First of all, the training samples and test samples are normalized; in addition to normalization, there are also a series of data enhancements to the training samples, such as rotation, translation, shearing, scaling, horizontal flipping, so as to increase the number of samples. so as to improve the generalization ability of the network and reduce the risk of network overfitting.

import tensorflow as tf
import keras_preprocessing
from keras_preprocessing import image
from keras_preprocessing.image import ImageDataGenerator

TRAINING_DIR = "/tmp/rps/"
training_datagen = ImageDataGenerator(
      rescale = 1./255,
      rotation_range=40,
      width_shift_range=0.2,
      height_shift_range=0.2,
      shear_range=0.2,
      zoom_range=0.2,
      horizontal_flip=True,
      fill_mode='nearest')

VALIDATION_DIR = "/tmp/rps-test-set/"
validation_datagen = ImageDataGenerator(rescale = 1./255)

train_generator = training_datagen.flow_from_directory(
    TRAINING_DIR,
    target_size=(150,150),
    class_mode='categorical'
)

validation_generator = validation_datagen.flow_from_directory(
    VALIDATION_DIR,
    target_size=(150,150),
    class_mode='categorical'
)

model = tf.keras.models.Sequential([

    tf.keras.layers.Conv2D(64, (3,3), activation='relu', input_shape=(150, 150, 3)),
    tf.keras.layers.MaxPooling2D(2, 2),

    tf.keras.layers.Conv2D(64, (3,3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2,2),

    tf.keras.layers.Conv2D(128, (3,3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2,2),

    tf.keras.layers.Conv2D(128, (3,3), activation='relu'),
    tf.keras.layers.MaxPooling2D(2,2),

    tf.keras.layers.Flatten(),
    tf.keras.layers.Dropout(0.5),

    tf.keras.layers.Dense(512, activation='relu'),
    tf.keras.layers.Dense(3, activation='softmax')
])

model.summary()

模型结构：

整个网络有4对卷积+池化，后接一个全连接层，接一个输出层
每个卷积都是3×3的卷积核，每个池化都是2×2的池化核，最大池化做特征压缩。激活函数都是relu
网络输入是150×150的三通道彩色图像
4个卷积层，前两个卷积层是64个通道，后两个卷积层是128个通道
在全连接层之前要做一个flatten操作，最后一个池化层拉直成一个向量，在把128个通道两成一个更大的向量
这样就可以输入全连接隐层，全连接隐层有512个神经元，激活函数用的是relu
最后输出层是3个神经元，因为石头剪刀布是三分类问题，使用的是softmax激活函数，使得三个输出加起来等于1

3. 模型训练与优化

model.compile(loss = 'categorical_crossentropy', optimizer='rmsprop', metrics=['accuracy'])

history = model.fit_generator(train_generator, epochs=25, validation_data = validation_generator, verbose = 1)

model.save("rps.h5")

运行结果：

4.模型评价

整体训练比较稳定，有时测试样本的准确率高于训练样本，这可能是训练样本不足的原因。此时，我们可以增加训练样本、调整或增加训练次数。在调整时，我们可以改变网络结构，比如增加网络的层数，增加隐层中的神经元数量。

[En]

The overall training is relatively stable, and sometimes the accuracy of the test samples is higher than that of the training samples, which may be due to the lack of training samples. At this time, we can increase the training samples, tune or increase the number of training times. When tuning, we can change the network structure, such as increasing the number of layers of the network and increasing the number of neurons in the hidden layer.

import matplotlib.pyplot as plt
acc = history.history['accuracy']
val_acc = history.history['val_accuracy']
loss = history.history['loss']
val_loss = history.history['val_loss']

epochs = range(len(acc))

plt.plot(epochs, acc, 'r', label='Training accuracy')
plt.plot(epochs, val_acc, 'b', label='Validation accuracy')
plt.title('Training and validation accuracy')
plt.legend(loc=0)
plt.figure()

plt.show()

准确率可视化：

5.投入使用

import numpy as np
from google.colab import files
from keras.preprocessing import image

uploaded = files.upload()

for fn in uploaded.keys():

  path = fn
  img = image.load_img(path, target_size=(150, 150))
  x = image.img_to_array(img)
  x = np.expand_dims(x, axis=0)

  images = np.vstack([x])
  classes = model.predict(images, batch_size=10)
  print(fn)
  print(classes)

运行结果：

通过此次训练，学会加载数据，切分数据集，预处理，搭建网络，网络训练，优化测试，最后投入使用

Original: https://blog.csdn.net/fencecat/article/details/124059492
Author: Zkaisen
Title: 卷积神经网络手势识别之剪刀石头布

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/511736/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

KDD 2022 | kgTransformer：基于知识图谱与Transformer的复杂逻辑查询

©PaperWeekly 原创 · 作者 |张成蹊单位 | Freewheel机器学习工程师研究方向 |自然语言处理 Preface 最近在 KDD’22 上看到了…

人工智能 2023年6月1日
0079
【python】———merge函数

@爱学习的DUO 目录 * – 1 数据读取（A、B表） – 2 当右表无重复项 – 3 当右表有重复项 – + 3.1 数据读取（…

人工智能 2023年7月29日
0067
Python 基础语法

一、字面量字面量：在代码中，被写下来的固定的值，称之为字面量数字（Number）字符串（String）列表（List）：有序的可变序列元祖（Tuple）：有序的不可变序…

人工智能 2023年7月3日
0077
RobotStudio的基本布局方法，模型加载，工件坐标系的创建，手动操作机器人示教，以及模拟仿真机器人运动轨迹。

2、在【基本】功能选项卡中，打开【ABB模型库】，选择【IRB2600】。3、设定好数值，然后单击【确认】。4、在【基本】功能选项里，打开【导入模型库】—【设备】，选择【myToo…

人工智能 2023年6月1日
00321
【Python】numpy矩阵运算大全

文章目录前言 0 遇事不决，先查官网，查着查着就查熟了 1 矩阵运算及其必要性 2 矩阵的创建 * 2.1 普通矩阵 2.2 特殊矩阵 3 矩阵的索引 * 3.1 str, li…

人工智能 2023年7月3日
0046
用ArkTs在鸿蒙系统上画一个世界杯海报

偶然看到了CSDN关于世界杯的征文活动：用代码画一个足球？哈哈很有意思！想了想，画一个自定义View（足球），当然是使用Canvas了，但除了Canvas还有没有其它方法呢？…

人工智能 2023年7月31日
0043
python大数据之随机森林（回归与分类）

随机森林在大数据运用中非常的常见，它在预测和回归上相比于SVM，多元线性回归，逻辑回归，多项式回归这些，有着比较好的鲁棒性。随机森林是一个用随机方式建立的，包含多个决策树的分类器…

人工智能 2023年6月30日
0094
python–基于三层神经网络实现手写数字分类

背景知识一个完整的神经网络通常由多个基本的网络层堆叠而成。本实验中的三层神经网络由三个全连接层构成，在每两个全连接层之间会插入ReLU激活函数引入非线性变换，最后使用Softma…

人工智能 2023年7月3日
0077
算法学习之gumbel softmax

1. gumbel_softmax有什么用呢? 假设如下场景:模型训练过程中, 网络的输出为p = [0.1, 0.7, 0.2], 三个数值分别为”向左”…

人工智能 2023年6月15日
0058
视觉笔记（1）cmake+OpenCV+OpenCV_contrib编译及报错总结和解决

视觉笔记（1）cmake+OpenCV+OpenCV_contrib编译及报错总结和解决前言前前后后删了，下载，编译十多次总算成功了，大大小小的坑踩了个便使用cmake编译Op…

人工智能 2023年7月19日
0095
要被抖音笑死了，打开个网页就算黑客？

大家好，我是朱小五大家在刷抖音的时候都会看到类似的视频：营销号用txt记事本巴拉巴拉写几行代码，就可以伪装成黑客了。 ▲一顿操作猛如虎又比如下面这样，远看一顿操作猛如虎，近看代…

人工智能 2023年5月30日
00107
python新建一个目录

源码部分 import os 创建目录 def mkdir(path): isExists = os.path.exists(path) if not isExists: os.m…

人工智能 2023年6月4日
0080
【自然语言处理系列】自编码器AE、变分自编码器VAE和条件变分自编码器CVAE

作者：CHEONG公众号：AI机器学习与知识图谱研究方向：自然语言处理与知识图谱本文主要分享自编码器、变分自编码器和条件变分自编码器的相关知识以及在实际实践中的应用技巧，原创不易…

人工智能 2023年5月31日
0091
深度学习基础及实现的必备步骤

为什么要以均方误差作为损失函数？(将模型在每个训练样本上的预测误差加和，来衡量整体样本的准确性) 解：利用均方误差画出来的图像有如下特点曲线的最低点是可导的。越接近最低点，曲线…

人工智能 2023年7月27日
0051
启明智显分享|乐鑫ESP32-S3离线语音在86控制面板、温控器的应用

提示：启明智显基于乐鑫ESP32-S3与国产芯高效开发平台8ms(8ms.xyz)设计开发的一款可快速开发86智能开关、温控器的串口屏已批量应用于86智能开关\温控器解决方案 MC…

人工智能 2023年5月25日
0085
AE自动曝光

目录简述：影响因素： AE统计模块： AE算法模块： AE算法要求： AE工作流程：曝光表： AE常见问题：目标亮度：简述： AE（auto exposure），即自动曝…

人工智能 2023年6月20日
0089

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

卷积神经网络手势识别之剪刀石头布

1.加载数据并解压

2.数据预处理与模型构建

3. 模型训练与优化

4.模型评价

5.投入使用

大家都在看