tensorflow的gpu和cpu计算时间对比的小例子

2023年5月24日上午8:34 • Python • 阅读 102

原创

楚千羽2022-08-05 17:35:18©著作权

文章标签 Python tensorflow 2d 卷积 文章分类 Python 后端开发

©著作权归作者所有：来自51CTO博客作者楚千羽的原创作品，请联系作者获取转载授权，否则将追究法律责任

例子1

参数设置

NVIDIA3070, cuda11.2 cudnn8.1.0 tensorfow2.5.0,tensorflow-gpu2.5.0

cpu约80 s计算1代epoch, 而 gpu却约3 s计算一代epoch

-*- coding: utf-8 -*-# @Time : 2022/6/11 16:03# @Author : chuqianyu# @FileName: testtt2tt.py# @Software: PyCharm# # 指定GPU训练# import os# os.environ["CUDA_VISIBLE_DEVICES"]="/gpu:0"  ##表示使用GPU编号为0的GPU进行计算import numpy as npfrom tensorflow.keras.models import Sequential  # 采用贯序模型from tensorflow.keras.layers import Dense, Dropout, Conv2D, MaxPool2D, Flattenfrom tensorflow.keras.datasets import mnistfrom tensorflow.keras.utils import to_categoricalfrom tensorflow.keras.callbacks import TensorBoardimport timedef create_model():    model = Sequential()    model.add(Conv2D(32, (5, 5), activation='relu', input_shape=[28, 28, 1]))  # 第一卷积层    model.add(Conv2D(64, (5, 5), activation='relu'))  # 第二卷积层    model.add(MaxPool2D(pool_size=(2, 2)))  # 池化层    model.add(Flatten())  # 平铺层    model.add(Dropout(0.5))    model.add(Dense(128, activation='relu'))    model.add(Dropout(0.5))    model.add(Dense(10, activation='softmax'))    return modeldef compile_model(model):    model.compile(loss='categorical_crossentropy', optimizer="adam", metrics=['acc'])    return modeldef train_model(model, x_train, y_train, batch_size=128, epochs=10):    tbCallBack = TensorBoard(log_dir="model", histogram_freq=1, write_grads=True)    history = model.fit(x_train, y_train, batch_size=batch_size, epochs=epochs, shuffle=True, verbose=2,                        validation_split=0.2, callbacks=[tbCallBack])    return history, modelif __name__ == "__main__":    import tensorflow as tf    print(tf.__version__)        # NVIDIA3070, cuda11.2  cudnn8.1.0 tensorfow2.5.0,tensorflow-gpu2.5.0     # cpu约80 s计算1代epoch, 而 gpu却约3 s计算一代epoch    with tf.device("/gpu:0"):        from tensorflow.python.client import device_lib        print(device_lib.list_local_devices())        (x_train, y_train), (x_test, y_test) = mnist.load_data()  # mnist的数据我自己已经下载好了的        print(np.shape(x_train), np.shape(y_train), np.shape(x_test), np.shape(y_test))        x_train = np.expand_dims(x_train, axis=3)        x_test = np.expand_dims(x_test, axis=3)        y_train = to_categorical(y_train, num_classes=10)        y_test = to_categorical(y_test, num_classes=10)        print(np.shape(x_train), np.shape(y_train), np.shape(x_test), np.shape(y_test))        model = create_model()        model = compile_model(model)        print("start training")        ts = time.time()        history, model = train_model(model, x_train, y_train, epochs=20)    print("start training", time.time() - ts)

gpu约3 s计算一代epoch

cpu约80 s计算一代epoch

例子2

-*- coding: utf-8 -*-# @Time : 2022/6/11 20:32# @Author : chuqianyu# @FileName: testtt3tt.py# @Software: PyCharmimport tensorflow as tffrom tensorflow.keras import *import timetf.config.set_soft_device_placement(True)tf.debugging.set_log_device_placement(True)gpus = tf.config.experimental.list_physical_devices('GPU')print(gpus)tf.config.experimental.set_visible_devices(gpus[0], 'GPU')tf.config.experimental.set_memory_growth(gpus[0], True)t=time.time()with tf.device("/gpu:0"):    tf.random.set_seed(0)    a = tf.random.uniform((10000,10000),minval = 0,maxval = 3.0)    c = tf.matmul(a, tf.transpose(a))    d = tf.reduce_sum(c)print('gpu: ', time.time()-t)t=time.time()with tf.device("/cpu:0"):    tf.random.set_seed(0)    a = tf.random.uniform((10000,10000),minval = 0,maxval = 3.0)    c = tf.matmul(a, tf.transpose(a))    d = tf.reduce_sum(c)print('cpu: ', time.time()-t)

赞
收藏
评论
*举报

上一篇：Python装饰器记录函数被调用次数和最大值

下一篇：重要学习网址收藏3

Original: https://blog.51cto.com/u_15240054/5548981
Author: 楚千羽
Title: tensorflow的gpu和cpu计算时间对比的小例子

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/504443/

转载文章受原作者版权保护。转载请注明原作者出处！

python

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

python——pytest单元测试

前提：需要安装pytest和pytest-html(生成html测试报告）pip install pytest 和 pip install pytest-html案例一pytest…

Python 2023年9月10日
0041
Pygame学习笔记5：位图图形和Orbiting Spaceship

使用位图前面一直使用的如下代码： screen = pygame.display.set_mode((600, 500)) screen是一个surface对象，在pygame中…

Python 2023年9月20日
0060
IDLE中使用pygame模块的方法

T ABLE OF C ONTENTS Who is this book for? ………………&#8230…

Python 2023年9月18日
0048
重温Python基础——字典

哈喽小伙伴们，今天咱们来复习一下 Python基础中的字典，看看还记得多少！使用字典在 python中，字典是一系列键值对。每个键都与一个值相关联，你可使用键值来访问相关联的值…

Python 2023年10月30日
0041
【图像配准】基于surf实现图像特征点检测及图像拼接处理附matlab代码

Python 2023年5月24日
0064
数电实验数字电子钟设计基于quartus 实现计时校时闹钟秒表稍复杂音频分享电路图设计以及工程文件

数字电子钟设计本文主要完成数字电子钟的以下功能1、计时功能（24小时）2、秒表功能（一个按键实现开始暂停，另一个按键实现清零功能）3、闹钟功能（设置闹钟以及到时响10秒）4、校时…

Python 2023年9月16日
0061
pandas将df赋值到另一个df_Pandas常用功能总结

1.读取.csv文件 df2 = pd.read_csv(‘beijingsale.csv’, encoding=’gb2312′,…

Python 2023年8月6日
0085
自定义ListView下拉刷新上拉加载更多

自定义ListView下拉刷新上拉加载更多跳转自定义RecyclerView下拉刷新上拉加载更多跳转 Listview现在用的很少了，基本都是使用Recycleview，但是…

Python 2023年10月19日
0037
gunicorn 超时报错：[1] [CRITICAL] WORKER TIMEOUT 解决

一. 问题描述在使用 Gunicorn + flask 搭建的一个 web 项目中，由于使用了机器学习模型，在第一次请求接口的时候需要加载模型，在本地运行的时候加载模型很快所以正…

Python 2023年8月9日
0070
【盘点】Python新手入门常犯的错误!

推荐文章很多小伙伴都发现了，用户自主「申请上首页」的按钮取消了，那博主们写的文章还有上首页曝光的机会吗？我们的回答是”当然有！！！”虽然我们取消了上首页申…

Python 2023年5月24日
0069
【计算方法】实验二：python实现高斯消去、列主元高斯消去，LU分解分别求解线性方程组

文章目录题目方法一：高斯消去法 * – + 结果截图方法二：列主元素高斯消元法 * – + 结果截图方法三：LU分解结果截图 * – …

Python 2023年8月3日
0063
Docker+Ubuntu+Nginx+uwsgi+Django+Python部署

前言用Python开发时，经常会用到Django框架，本地跑项目还是比较简单能跑起来，但在云服务器部署时，踩了不少坑，本文将记录部署的全过程，避免再踩同样的坑。文末附上打包好的完…

Python 2023年8月4日
0041
Python Pytest自动化测试获取测试用例执行结果

Time will tell. Pytest 提供了很多钩子方法让我们对测试用例框架进行二次开发，可根据自己的需求进行改造。所以接下来就来学习下 pytest_runtest_ma…

Python 2023年9月10日
0050
在Linux云主机上部署python3.8 并且不和系统原装py冲突

背景部署Django工程至云主机部署步骤 1、申请一些内部权限/略 2、使用root下载python3.8源码安装包 ; 3、解压，配置，编译，安装解压： cd /root/…

Python 2023年8月6日
0062
帮你搞懂什么是Session

目录 1. 什么是Session？ 2. 什么时间考虑使用Session？ 3. 怎么使用Session？ 4. Session的详解： 5. HttpSession与Cookie…

Python 2023年11月7日
0033
已解决module ‘keras.preprocessing.image‘ has no attribute ‘load_img‘

已解决module ‘keras.preprocessing.image’ has no attribute ‘load_img’ …

Python 2023年8月1日
0062

2024 年 5 月
一	二	三	四	五	六	日
		1	2	3	4	5
6	7	8	9	10	11	12
13	14	15	16	17	18	19
20	21	22	23	24	25	26
27	28	29	30	31

tensorflow的gpu和cpu计算时间对比的小例子

例子1

参数设置

NVIDIA3070, cuda11.2 cudnn8.1.0 tensorfow2.5.0,tensorflow-gpu2.5.0

cpu约80 s计算1代epoch, 而 gpu却约3 s计算一代epoch

gpu约3 s计算一代epoch

cpu约80 s计算一代epoch

例子2

大家都在看