Tensorflow – Dataset 之 repeat(), shuffle(), batch()作用

2023年5月24日下午8:56 • 人工智能 • 阅读 91

repeat() – 该函数让数据集重复的次数，如没有参数，则数据集可以任意获取

shuffle() – 打乱数据集的顺序

batch() – 设置一次操作允许获取的数据个数

例子如下：

import tensorflow as tf

import numpy as np

feature = np.array([1 ,2 ,3 ,4 ,5 ,6 ,7 ,8 ,9 ], np.float32)

label = np.array([0 ,0 ,0 ,0 ,1 ,1 ,1 ,1 ,1 ])

train_data = tf.data.Dataset.from_tensor_slices((feature, label)) //定义9 个数据的数据集

def print_train_data (data ,cnt ):

it = data.iter()

for i in range (cnt):

x, y = it.next ()

print (x, y)

print_train_data(train_data,9 )

print (“=== after repeat ====”)

train_data = train_data.repeat()//调用该函数后后面可以无限使用该数据集

print_train_data(train_data,9 )

print (“=== after shuffle ====”)

train_data = train_data.shuffle(buffer_size=5 ) //打乱数据集的顺序

print_train_data(train_data,9 )

print (“=== after batch ====”)

dataset_batch = train_data.batch(batch_size=3 )//设置每次回去数据集的数据条数

it = dataset_batch.iter()

for i in range (20 ): //从数据集中取20次数据，由于上面repeat()调用，表面可以无限使用数据集，因此这里的range里的参数可以任意填写。如果这样调用repeat(2). 则最多只能获取9 * 2个数据. 超出数据会报错

x, y = it.next ()

print (x, y)//由于batch()设置了每次取3条数据，因此，这里的打印X,Y都是3个数据的数组。所以这个for 循环，总共获取了20 * 3条数据

打印数据如下：

tf.Tensor(1.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(2.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(3.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(4.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(5.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(6.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(7.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(8.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(9.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
=== after repeat ====
tf.Tensor(1.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(2.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(3.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(4.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(5.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(6.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(7.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(8.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(9.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
=== after shuffle ====
tf.Tensor(5.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(3.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(6.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(1.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(4.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(2.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(8.0, shape=(), dtype=float32) tf.Tensor(1, shape=(), dtype=int64)
tf.Tensor(1.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
tf.Tensor(2.0, shape=(), dtype=float32) tf.Tensor(0, shape=(), dtype=int64)
=== after batch ====
tf.Tensor([2. 4. 5.], shape=(3,), dtype=float32) tf.Tensor([0 0 1], shape=(3,), dtype=int64)
tf.Tensor([1. 7. 3.], shape=(3,), dtype=float32) tf.Tensor([0 1 0], shape=(3,), dtype=int64)
tf.Tensor([8. 6. 1.], shape=(3,), dtype=float32) tf.Tensor([1 1 0], shape=(3,), dtype=int64)
tf.Tensor([3. 6. 4.], shape=(3,), dtype=float32) tf.Tensor([0 1 0], shape=(3,), dtype=int64)
tf.Tensor([7. 8. 1.], shape=(3,), dtype=float32) tf.Tensor([1 1 0], shape=(3,), dtype=int64)
tf.Tensor([5. 2. 4.], shape=(3,), dtype=float32) tf.Tensor([1 0 0], shape=(3,), dtype=int64)
tf.Tensor([9. 5. 3.], shape=(3,), dtype=float32) tf.Tensor([1 1 0], shape=(3,), dtype=int64)
tf.Tensor([9. 9. 7.], shape=(3,), dtype=float32) tf.Tensor([1 1 1], shape=(3,), dtype=int64)
tf.Tensor([8. 2. 6.], shape=(3,), dtype=float32) tf.Tensor([1 0 1], shape=(3,), dtype=int64)
tf.Tensor([3. 1. 2.], shape=(3,), dtype=float32) tf.Tensor([0 0 0], shape=(3,), dtype=int64)
tf.Tensor([8. 9. 4.], shape=(3,), dtype=float32) tf.Tensor([1 1 0], shape=(3,), dtype=int64)
tf.Tensor([1. 5. 7.], shape=(3,), dtype=float32) tf.Tensor([0 1 1], shape=(3,), dtype=int64)
tf.Tensor([5. 6. 4.], shape=(3,), dtype=float32) tf.Tensor([1 1 0], shape=(3,), dtype=int64)
tf.Tensor([3. 6. 9.], shape=(3,), dtype=float32) tf.Tensor([0 1 1], shape=(3,), dtype=int64)
tf.Tensor([1. 7. 4.], shape=(3,), dtype=float32) tf.Tensor([0 1 0], shape=(3,), dtype=int64)
tf.Tensor([2. 2. 3.], shape=(3,), dtype=float32) tf.Tensor([0 0 0], shape=(3,), dtype=int64)
tf.Tensor([5. 8. 8.], shape=(3,), dtype=float32) tf.Tensor([1 1 1], shape=(3,), dtype=int64)
tf.Tensor([2. 6. 4.], shape=(3,), dtype=float32) tf.Tensor([0 1 0], shape=(3,), dtype=int64)
tf.Tensor([1. 9. 7.], shape=(3,), dtype=float32) tf.Tensor([0 1 1], shape=(3,), dtype=int64)
tf.Tensor([6. 3. 5.], shape=(3,), dtype=float32) tf.Tensor([1 0 1], shape=(3,), dtype=int64)

Original: https://blog.csdn.net/aaronychen/article/details/122879141
Author: aaronychen
Title: Tensorflow – Dataset 之 repeat(), shuffle(), batch()作用

原创文章受到原创版权保护。转载请注明出处：https://www.johngo689.com/509532/

转载文章受原作者版权保护。转载请注明原作者出处！

人工智能

【自取】最近整理的，有需要可以领取学习：

Linux核心资料大放送~

全栈面试题汇总（持续更新&可下载）

一个提高学习100%效率的工具！

【超详细】深度学习面试题目！

LeetCode Python刷题答案下载！

LeetCode Java版刷题答案下载！

LeetCode C++ 版本，抓紧保存！

LeetCode GO语言刷题答案下载！

关于构建网络安全知识库方向相关知识的学习和思考

说明：最近在看《面向知识服务的知识库结构理论与方法》蒋勋，将自己研究可能用到的知识进行梳理和摘录，并加入部分自己的主观想法，由于17年的书，有些思想或描述已过时，但可借鉴的思…

人工智能 2023年6月1日
0062
优化问题—凸优化基本概念

目录 1.凸优化到底是什么？ 1.1 基本概念 1.2 凸优化和非凸优化 2、集合概念 2.1 仿射集、仿射包、仿射组合 2.2 凸集、凸包、凸组合 2.3 锥、凸锥 3.凸函数与…

人工智能 2023年6月30日
0098
计量经济学笔记

目录 1. 绪论 1.1 数据类型 1.2 模型检验 2. 双变量线性回归模型 2.1 回归分析基本概念 2.1.2 总体回归函数 PRF 2.1.3 样本回归函数 SRF 2.2…

人工智能 2023年6月17日
0068
【多元统计分析】主成分分析——SPSS上机实验【过程+结果分析】

数据请关注公众号：321红绿灯回复：例5-3 即可获取题目来自何晓群《多元统计分析》（第五版）例题5-3 ; 实验内容试利用主成分综合评价全国各地区水泥制造业规模以上企业的经济…

人工智能 2023年6月19日
0091
数据合并聚合笔记

文章目录在pandas中类似的分组的操作 DataFrameGroupBy对象简单的索引操作：动手在pandas中类似的分组的操作 df.groupby(by=&#8221…

人工智能 2023年7月17日
0080
MATLAB示例——图像中圆的识别与判断

目录 * – 一、基本介绍 – 二、程序代码 – + 2.1 加载图像 + 2.2 灰度化与二值化 + 2.3 形态学处理 + 2.4 寻找边界…

人工智能 2023年6月17日
00100
conda配置TensorFlow2.4.0GPU版本

conda配置TensorFlow2.4.0GPU版本前几天废了老大的劲儿，终于在conda下安装好了CUDA11.0+cuDNN8.0+PyTorch，（该总结只适合已经安装好…

人工智能 2023年5月25日
0075
JAVA学习笔记- – – day 1

💕前言：作者是一名正在学习JAVA的初学者，每天分享自己的学习笔记，希望能和大家一起进步成长💕 目录 💕前言：作者是一名正在学习JAVA的初学者，每天分享自己的学习笔记，希望能和大…

人工智能 2023年7月29日
0039
【视觉SLAM14讲】【汇总】

抵扣说明： 1.余额是钱包充值的虚拟货币，按照1:1的比例进行支付金额的抵扣。2.余额无法直接购买下载，可以购买VIP、付费专栏及课程。 Original: https://blo…

人工智能 2023年7月25日
0072
Tensorflow和Keras版本对照及环境安装

在安装tensorflow环境的时候，一定要先弄清楚对应的版本对应的情况，不要上来就 pip install tensorflow， pip install keras。最后发现全…

人工智能 2023年7月5日
0097
语义分割系列15-UPerNet（pytorch实现）

UPerNet：《Unified Perceptual Parsing for Scene Understanding》发布于2018ECCV。引文人类在识别物体上往往是通过…

人工智能 2023年5月26日
0090
高仿英雄联盟游戏网页制作作业英雄联盟LOL游戏HTML网页设计模板简单学生网页设计静态HTML CSS网站制作成品

🎉精彩专栏推荐👇🏻👇🏻👇🏻✍️ 作者简介: 一个热爱把逻辑思维转变为代码的技术博主💂 作者主页: 【主页——🚀获取更多优质源码】🎓 web前端期末大作业：【📚毕设项目精品实战案例…

人工智能 2023年6月27日
0057
阿里巴巴稀疏模型训练引擎-DeepRec

导读：DeepRec从2016年起深耕至今，支持了淘宝搜索、推荐、广告等核心业务，沉淀了大量优化的算子、图优化、Runtime优化、编译优化以及高性能分布式训练框架，在稀疏模型的训…

人工智能 2023年5月28日
0078
Python矩阵运算（不使用库）

矩阵乘法矩阵乘法中，需要判断两个矩阵是否可以进行相乘，即前一个矩阵的列是否等于后一个矩阵的行。具体代码如下： class Solution: def multiplyofmatr…

人工智能 2023年6月16日
0059
DDPM代码详细解读(2)：Unet结构、正向和逆向过程、IS和FID测试、EMA优化

以下是将 Unet_和门 _结构_结合的 _PyTorch 代码： import torch import torch.nn as nn import torch.nn.funct…

人工智能 2023年7月26日
0040
机器学习：使用matlab实现SVM解决分类问题

文章目录 tips 大佬的函数 * 训练SVM模型利用模型预测模型可视化线性边界复杂非线性边界参数选择 tips 因为现在已经有许多很成熟的SVM软件或者包来实现最小化代…

人工智能 2023年6月30日
00157

2024 年 4 月
一	二	三	四	五	六	日
1	2	3	4	5	6	7
8	9	10	11	12	13	14
15	16	17	18	19	20	21
22	23	24	25	26	27	28
29	30

Tensorflow – Dataset 之 repeat(), shuffle(), batch()作用

大家都在看