Keras深度学习使用VGG16预训练神经网络实现猫狗分类

2023年5月23日下午4:50 • 人工智能 • 阅读 50

最近刚刚接触深度学习不久，而Keras呢，是在众多的深度学习框架中，最适合上手的，而猫狗的图像分类呢，也算是计算机视觉中的一个经典案例，下面开始为大家介绍一下实现流程：

前置环境

Python3.6 Pip3
Keras，使用Tensorflow作为后端
Numpy
Matplotlib
Opencv

实现过程

模型训练

首先我们需要导入所需要的包，本次训练模型呢，使用了VGG16预训练网络模型，对于没有GPU加持的小伙伴众多的预训练网络模型也算是福音。即可加快模型训练速度，也可以使小批量的数据集的准确率提高很多。

import keras
import tensorflow as tf
from keras import layers
import numpy as np
import os
import shutil
import matplotlib.pyplot as plt
%matplotlib inline
from keras.preprocessing.image import ImageDataGenerator
from tensorflow.keras.applications import VGG16

从网上找到的猫狗数据集资源，加载猫狗数据集和划分猫狗训练（train）数据和测试（test）数据。猫狗原数据集我已上传到百度云盘，需要的文章末尾请自取。


BASE_DIR = './cat_dog'
train_dir = os.path.join(BASE_DIR, 'train')
train_dir_dog = os.path.join(train_dir, 'dog')
train_dir_cat = os.path.join(train_dir, 'cat')

test_dir = os.path.join(BASE_DIR, 'test')
test_dir_dog = os.path.join(test_dir, 'dog')
test_dir_cat = os.path.join(test_dir, 'cat')
train_dir_dog, test_dir_cat

os.mkdir(BASE_DIR)
os.mkdir(train_dir)
os.mkdir(train_dir_dog)
os.mkdir(train_dir_cat)
os.mkdir(test_dir)
os.mkdir(test_dir_dog)
os.mkdir(test_dir_cat)

source_dir = './source_data/train'

fnames = ['cat.{}.jpg'.format(i) for i in range(1000)]
for fname in fnames:
    s = os.path.join(source_dir, fname)
    d = os.path.join(train_dir_cat, fname)
    shutil.copyfile(s, d)

fnames = ['dog.{}.jpg'.format(i) for i in range(1000)]
for fname in fnames:
    s = os.path.join(source_dir, fname)
    d = os.path.join(train_dir_dog, fname)
    shutil.copyfile(s, d)

fnames = ['dog.{}.jpg'.format(i) for i in range(1000, 1500)]
for fname in fnames:
    s = os.path.join(source_dir, fname)
    d = os.path.join(test_dir_dog, fname)
    shutil.copyfile(s, d)

fnames = ['cat.{}.jpg'.format(i) for i in range(1000, 1500)]
for fname in fnames:
    s = os.path.join(source_dir, fname)
    d = os.path.join(test_dir_cat, fname)
    shutil.copyfile(s, d)

建立图像数据迭代器，并对原始图像进行归一化。

[En]

The image data iterator is established and the original image is normalized.

train_datagen = ImageDataGenerator(rescale=1 / 255)
test_datagen = ImageDataGenerator(rescale=1 / 255)

train_generator = train_datagen.flow_from_directory(train_dir,
target_size=(200, 200), batch_size=20, class_mode='binary')

test_generator = test_datagen.flow_from_directory(test_dir,
target_size=(200, 200), batch_size=20, class_mode='binary')

使用Matplotlib，我们可以将图像进行输出；图像的数据呢，本质上就是三个通道的颜色数据值，即RGB值。


plt.imshow(train_generator[0][0][0])
print(train_generator[0][1][0])

初始化VGG16预训练神经网络；使用vgg网络，使用imageNet权重，include_top是否包含最后的全连接层和输出层，

covn_base = VGG16(weights='imagenet', include_top=False, input_shape=(200,200,3))

使用summary()可以查看神经网络的结构，可以看到VGG16的结构也是由多层Conv2D（卷积）和MaxPooling2D（池化）组成。

covn_base.summary()

使用VGG网络把图片的特征值提取出来，在放入线性网络中进行训练，以提高速度

batch_size = 20
def extract_features(data_generator, sample_count):
    i = 0
    features = np.zeros(shape=(sample_count, 6, 6, 512))
    labels = np.zeros(shape=(sample_count))
    for inputs_batch, labels_batch in data_generator:
        features_batch = covn_base.predict(inputs_batch)
        features[i * batch_size : (i+1)*batch_size] = features_batch
        labels[i*batch_size:(i+1)*batch_size] = labels_batch
        i+=1
        if i * batch_size >= sample_count:
            break
    return features, labels

train_featrues, train_labels = extract_features(train_generator, 2000)
test_featrues, test_labels = extract_features(test_generator, 1000)

搭建自己模型的全连接Dense层，对结果进行输出；使用GlobalAveragePooling2D对VGG16处理的图像数据进行扁平化处理（即变成一维数据），最终归结为y=w1x1+w2x2…+b的问题，对结果进行输出；使用relu激活函数；使用Dropout抑制过拟合；最后输出结果，因为结果为二分类，即0为猫，1为狗。故输出结果只有一个，所以使用sigmoid函数进行二分类结果的输出。

model = keras.Sequential()
model.add(layers.GlobalAveragePooling2D(input_shape=(6, 6, 512)))
model.add(layers.Dense(512, activation='relu'))
model.add(layers.Dropout(0.5))
model.add(layers.Dense(1, activation='sigmoid'))

编译模型；使用Adam激活函数，并调整优化速率；因为是二分类问题，所以这里损失函数使用binary_crossentropy

model.compile(optimizer=tf.keras.optimizers.Adam(lr=0.0005/10), loss='binary_crossentropy', metrics=['acc'])

启动训练模型；在训练期间测试测试集，这里总共测试50次

[En]

Start the training model; test the test set during training, here for a total of 50 times

history = model.fit(train_featrues,train_labels, epochs=50,
batch_size=50, validation_data=(test_featrues, test_labels))

以下为训练结果。其中loss为训练集损失值，acc为训练集准确率；val_loss为测试集损失值，val_acc为测试集准确率。可以看到结果还是比较理想的，其训练集和测试集的准确率均能达到90%左右，而且拟合的很好。

使用Matplotlib绘制以下训练集和测试集的准确率曲线，可以更清晰的看出，训练过程的变化。

plt.plot(range(50), history.history.get('val_acc'), c='r', label='val_acc')
plt.plot(range(50), history.history.get('acc'), c='b', label='acc')
plt.legend

将训练好的模型保存为本地的h5类型文件

model.save('cat_dog_model.h5')

在上述训练过程结束时，使用保存的训练模型对真实数据进行测试。

[En]

At the end of the above training process, the real data are tested using the saved training model.

模型测试

在模型测试中，为了方便，我们借助OpenCV，来帮我们将网络上获取的图片进行resize处理和方便展示输出结果。
导入所需要的包。

import tensorflow as tf
import numpy as np
from keras.models import load_model
import cv2

定义OpenCV图像展示函数

def show(image):
    cv2.namedWindow('test', 0)
    cv2.imshow('test', image)

    cv2.waitKey(0)
    cv2.destroyAllWindows()

加载VGG16的权重以及保存的训练模型

covn_base = tf.keras.applications.VGG16(weights='imagenet', include_top=False, input_shape=(200, 200, 3))
cat_dog_model = load_model('./cat_dog_model.h5')

使用OpenCV读取图片，并将图片resize为200✖️200的大小，将图像数据扩展为VGG16所需要的数据格式

image = cv2.imread('cat1.jpeg')
resize_image = cv2.resize(image, (200, 200), interpolation=cv2.INTER_AREA)
input_data = np.expand_dims(resize_image, axis=0)

分别使用VGG16和自己训练好的模型对图像进行predict预测

result = int(cat_dog_model.predict(covn_base.predict(input_data))[0][0])

输出识别结果并显示输入图像

[En]

Output the recognition result and display the input image

if result == 1:
    print("狗")
if result == 0:
    print("猫")
show(resize_image)

你可以看到以下的识别结果，猫，准确

[En]

You can see the following results for recognition, cat, accurate

对狗的形象进行识别，结果准确

[En]

Identify the image of the dog, the result is accurate

文章到此结束，这个案例就算是出入Keras深度学习的小试牛刀，希望同样可以作为大家初入深度学习的小案例之一。

猫狗数据集百度网盘链接

链接: https://pan.baidu.com/s/16K4P5Nb1k5_sfFml-qEF2g 提取码: mchl

Original: https://blog.csdn.net/wFitting/article/details/123921832
Author: wFitting
Title: Keras深度学习使用VGG16预训练神经网络实现猫狗分类

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/496602/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

DataWhale-(动手学数据分析)-Task01(数据加载及探索性数据分析)-202201

1第一节：数据加载 1.1 载入数据数据集下载 https://www.kaggle.com/c/titanic/overview 1.1.1 任务一：导入numpy和panda…

人工智能 2023年7月18日
0050
GitHub 23.6k星资源之机器学习必读100篇论文清单：高引用、分类全、覆盖面广

文章目录项目评价标准包含内容 * Understanding / Generalization / Transfer Optimization / Training Techn…

人工智能 2023年7月1日
0079
数据分析的同比和环比以及其在excel中的应用

同比更侧重的是反映数据长期的大趋势。环比则是突出显示数据的短期变化趋势。具体的公式应该是怎样的呢？. 同比：同比发展速度主要是为了消除季节变动的影响，用以说明本期发展水平与去年同…

人工智能 2023年7月15日
00109
低光图像目标检测论文：YOLO in the Dark – Domain Adaptation Method for Merging Multiple Models阅读笔记

论文链接：https://link.springer.com/chapter/10.1007/978-3-030-58589-1_21 来源：ECCV 2020 Abstract …

人工智能 2023年7月12日
0070
Template-Based Named Entity Recognition Using BART

场景：Few-Shot Learning + Prompt Learning+PLM（BART）+Transfer Learning Abstract 最近人们对研究少量的NER很…

人工智能 2023年5月31日
0070
python opencv实现找到图像的轮廓，填充颜色

我想找到图片中的闭合圈，然后填充颜色所需要的cv函数： 1。OpenCV提供的findContours()方法可以通过计算图像梯度来判断出图像的边缘，然后将边缘的点封装成数组返回…

人工智能 2023年6月17日
00125
【Python-Anaconda】在anaconda中创建、激活虚拟环境；在anaconda中所创建的虚拟环境中安装OpenCv；如何在jupter notebook中使用所创建的虚拟环境

一、在anaconda中创建虚拟环境 1、为什么要创建虚拟环境？答：为了避免库依赖冲突。所以在安装pytorch、tensflow等时最好创建虚拟环境进行安装。 2、创建虚拟环境…

人工智能 2023年6月19日
0097
“multiprocessingspawn.py”, line 105, in spawn_main错误与解决方法

记录一个不知名的错误 * – + 错误 + 解决方法 OS： Windows 10错误非常的长，以至于，我也没有什么耐心去看，看了前面几行，应该是多线程引起的。下面太长…

人工智能 2023年7月23日
0065
Python实现Canny边缘检测

文章目录一、Canny边缘检测二、具体步骤 * 1. 高斯平滑滤波 Noise Reduction 2. Sobel Kernel 3. NMS (Non-Maximum Su…

人工智能 2023年5月28日
00112
理解Python中列表，元组，字典，集合里的一些坑

列表对象不能越界访问越界访问 In [1]: list = [1,2,3] In [2]: list[4] IndexError: list index out of range…

人工智能 2023年6月4日
0068
图像中的注意力机制详解(SEBlock|ECABlock|CBAM)

图像中的注意力机制详解 * – 一、前言 – 二、SENet——通道注意力机制 – + 1. 论文介绍 + * SEBlock结构图： * 摘…

人工智能 2023年7月23日
0080
面向数据发布和分析的差分隐私保护 — 张啸剑

面向数据发布和分析的差分隐私保护读这篇综述的随记基于ｋ－匿名或者划分的隐私保护方法，只适应特定背景知识下的攻击而存在严重的局限性．差分隐私作为一种新出现的隐私保护框架，能够防…

人工智能 2023年7月17日
0059
深度学习模型训练完之后预测的数据差不多(模型预测输出数据一样)(训练结果一样)

模型训练之后输出的内容一样 * – 第一种可能性是某一种数据的分布过多造成数据分布不均匀 – 第二种可能性是开头的标签被mask掉了 – 第三种…

人工智能 2023年6月16日
00105
Python实现GBDT(梯度提升树)分类模型(GradientBoostingClassifier算法)并应用网格搜索算法寻找最优参数项目实战

说明：这是一个机器学习实战项目（附带数据+代码+文档+视频讲解），如需数据+代码+文档+视频讲解可以直接到文章最后获取。 1.项目背景 GBDT是Gradient Boosti…

人工智能 2023年7月3日
0062
数字图像处理图像形态学变换

文章目录前言膨胀 * 实现腐蚀 * 实现开运算与闭运算 * 开运算 – 实现闭运算梯度运算礼/顶帽黑帽总结前言形态学图像处理（简称形态学）是指一系…

人工智能 2023年6月22日
0083
《Knowledge graph completion via complex tensor factorization》理论（下）

总结数学理论实矩阵、可对角化可对角化矩阵存在一个可逆矩阵P使得P-1AP是对角矩阵，则它就被称为可对角化的。酉矩阵unitary matrix 又称为幺正矩阵（unitar…

人工智能 2023年6月1日
0059

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

Keras深度学习使用VGG16预训练神经网络实现猫狗分类

前置环境

实现过程

模型训练

模型测试

猫狗数据集百度网盘链接

大家都在看