keras_cv进行数据增强

2023-10-26

使用keras_cv来做分类数据增强

以下直接上流程，具体的原理和代码上github查看源码及配合tensorflow官网及keras官网来做处理。当前（2022.10.8)这些文档还不是很全。

import os
import numpy as np
import tensorflow as tf 
from tensorflow import keras
import keras_cv
import matplotlib.pyplot as plt
from PIL import Image,ImageEnhance,ImageOps
#多GPU,只使用第二，三个
gpu = tf.config.list_physical_devices('GPU')
tf.config.set_visible_devices(gpu[1:3],'GPU')
tf.config.experimental.set_memory_growth(gpu[1],True)
tf.config.experimental.set_memory_growth(gpu[2],True)


print(keras_cv.__version__) #当前keras_cv 的版本

2022-10-08 09:54:14.941532: I tensorflow/core/util/util.cc:169] oneDNN custom operations are on. You may see slightly different numerical results due to floating-point round-off errors from different computation orders. To turn them off, set the environment variable `TF_ENABLE_ONEDNN_OPTS=0`.


0.3.4

image = tf.io.read_file('imagenet/n02106662/n02106662_38997.JPEG')
image = tf.image.decode_image(image,channels=3,expand_animations=False)
# image = tf.cast(image,tf.float32)
# image = tf.image.convert_image_dtype(image,tf.float32)

image.dtype

tf.uint8

plt.imshow(image)

<matplotlib.image.AxesImage at 0x7f74fb181b50>

在这里插入图片描述



augmenter = keras_cv.layers.preprocessing.Augmenter(
    layers=[
        # keras_cv.layers.preprocessing.RandomCropAndResize(target_size=(224,224),crop_area_factor=(0.8,1.0), aspect_ratio_factor=(3/4.0,4/3.0)),
        # keras_cv.layers.preprocessing.RandomFlip(mode='horizontal'), #
        # keras_cv.layers.preprocessing.MaybeApply(layer=keras_cv.layers.preprocessing.ChannelShuffle(),rate=0.5),
        # keras_cv.layers.preprocessing.MaybeApply(layer=keras_cv.layers.preprocessing.Grayscale(output_channels=3),rate=0.2),
        # keras_cv.layers.preprocessing.Equalization(value_range=[0,255]),
        # keras_cv.layers.preprocessing.RandomJpegQuality(factor=(75,100)),
        # keras_cv.layers.preprocessing.MaybeApply(keras_cv.layers.preprocessing.RandomGaussianBlur(kernel_size=3,factor=(0.,5.0)),0.2),
        # keras_cv.layers.preprocessing.MaybeApply(keras_cv.layers.preprocessing.RandomRotation(factor=0.08),0.5)
        # keras_cv.layers.preprocessing.FourierMix(alpha=0.5) # this need batchsize data
        # keras_cv.layers.preprocessing.AugMix(value_range=[0,255],severity=0.3,num_chains=3,chain_depth=[1,3],alpha=1.0)
        # keras_cv.layers.preprocessing.RandomCutout(height_factor=(0.0,0.5),weight_factor=(0.0,0.5),fill_mode="gaussian_noise"),
        # keras_cv.layers.preprocessing.GridMask(ratio_factor=(0.0,0.3),rotation_factor=(0.0,0.1),fill_mode="gaussian_noise"),
        # keras_cv.layers.preprocessing.RandomBrightness(factor=0.5),
        # keras_cv.layers.preprocessing.RandomContrast(factor=0.5),
        # keras_cv.layers.preprocessing.RandomSaturation(factor=(0.1,0.9))
        # keras_cv.layers.preprocessing.RandomHue(factor=0.2,value_range=[0,255]),
        # keras_cv.layers.preprocessing.RandomAugmentationPipeline(layers=[
        #     keras_cv.layers.preprocessing.Augmenter(layers=[
        #         keras_cv.layers.preprocessing.RandomAugmentationPipeline(layers=[keras_cv.layers.preprocessing.RandomBrightness(factor=0.5),keras_cv.layers.preprocessing.RandomContrast(factor=0.5), \
        #             keras_cv.layers.RandomSaturation(factor=(0.1,0.9)),keras_cv.layers.RandomHue(factor=0.2,value_range=[0,255])], augmentations_per_image=1,rate=1.0),
        #         keras_cv.layers.preprocessing.AugMix(value_range=[0,255],severity=0.3,num_chains=3,chain_depth=[1,3],alpha=1.0) # augmix不含颜色的处理
        #     ]),
        #     keras_cv.layers.preprocessing.RandAugment(value_range=(0, 255),magnitude=0.3,magnitude_stddev=0.1)],augmentations_per_image=1,rate=1.0)
        # keras_cv.layers.preprocessing.RandomAugmentationPipeline(layers=[keras_cv.layers.preprocessing.RandomBrightness(factor=0.5),keras_cv.layers.preprocessing.RandomContrast(factor=0.5), \
        #     keras_cv.layers.preprocessing.RandomSaturation(factor=(0.1,0.9)),keras_cv.layers.preprocessing.RandomHue(factor=0.2,value_range=[0,255])], \
        #         augmentations_per_image=1,rate=1.0),
        # keras_cv.layers.preprocessing.RandomColorJitter(value_range=[0,255],brightness_factor=0.5,contrast_factor=0.5,saturation_factor=(0.1,0.9),hue_factor=0.2),
        # keras_cv.layers.preprocessing.AugMix(value_range=[0,255],severity=0.3,num_chains=3,chain_depth=[1,3],alpha=1.0)
        # keras_cv.layers.preprocessing.RandAugment(value_range=(0, 255)),
        # keras_cv.layers.preprocessing.CutMix(),
        # keras_cv.layers.preprocessing.preprocessing.MixUp()
        ]
)

plt.figure(figsize=(60,60))
for i in range(30):
    plt.subplot(6,5,i+1)
    im = augmenter(image)
    print(im.dtype)
    print(tf.reduce_max(im))
    im1 = tf.cast(im,tf.uint8)
    plt.imshow(im1)

<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(254.99686, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(204.13892, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(251.00674, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(241.88087, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(246.78668, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(190.48203, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(132.34175, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(167.4629, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(199.27293, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(165.47491, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(206.75331, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(253.02667, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(235.15588, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(184.35695, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(254.46008, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(225.19394, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(249.16089, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(227.499, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(169.73618, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(191.14867, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(243.1288, shape=(), dtype=float32)
<dtype: 'float32'>
tf.Tensor(255.0, shape=(), dtype=float32)

在这里插入图片描述
以上的各种增强对于分类特别有用，有些需要是batch size的data才可以。
有几个特别需要注意：

1、keras_cv.layers.preprocessing.Augmenter(layers=[])

会把layers中的各层依次执行

2、keras_cv.layers.preprocessing.MaybeApply(layer=,rate=)

rate=0到1 ，表明一个layer在执行时的百分比，rate=1表示一定执行

3、keras_cv.layers.preprocessing.RandomArgumentationPipeline(layers=[],augmentations_per_image=,rate=)

augmentations_per_image可以理解为从layers中选几个层来执行，rate则是每个层执行的可能性

4、keras_cv.layers.preprocessing.RandomChoice(layers=[])

这个和3中当augmentations_per_image=1,rate=1.0时是一样的，从layers中选一个出来进行执行。

还要注意的是以上各层可以嵌套。
处理batch size的数据增强如下：

batch_augmenter = keras_cv.layers.preprocessing.Augmenter(
    layers=[

        keras_cv.layers.preprocessing.FourierMix(alpha=0.5) # this need 
        # keras_cv.layers.preprocessing.CutMix(),
        # keras_cv.layers.preprocessing.MixUp()
        ]
)

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

Tensorflow

图像处理

Keras

tensorflow

深度学习

keras_cv进行数据增强的相关文章

如何在google colab中降级到tensorflow-gpu版本1.12

我正在运行一个仅与旧版本的tensorflow GPU兼容的GAN 因此我需要将google colab中的tensorflow gpu从1 15降级到1 12 我尝试使用本中建议的以下命令thread https stackoverflo
预训练 inception v3 模型的层名称（tensorflow）[重复]

这个问题在这里已经有答案了任务是获取a的每层输出预训练的 cnn inceptionv3 https www tensorflow org versions master tutorials image recognition index
安装tensorflow的正确命令

当尝试在 Anaconda 上安装 Tensorflow 时我尝试了两种类型的命令 conda install tensorflow gpu工作得很好然而当尝试conda install c anaconda tensorflow g
为什么不使用均方误差来解决分类问题？

我正在尝试使用 LSTM 解决一个简单的二元分类问题我正在尝试找出网络的正确损失函数问题是当我使用二元交叉熵作为损失函数时与使用均方误差 MSE 函数相比训练和测试的损失值相对较高经过研究我发现二元交叉熵应该用于分类问题 MS
scikit-learn 和tensorflow 有什么区别？可以一起使用它们吗？

对于这个问题我无法得到满意的答案据我了解 TensorFlow是一个数值计算库经常用于深度学习应用而Scikit learn是一个通用机器学习框架但它们之间的确切区别是什么 TensorFlow 的目的和功能是什么我可以一起使用它
如何在Tensorflow中保存估计器以供以后使用？

我按照教程 TF Layers 指南构建卷积神经网络以下是代码 https github com tensorflow tensorflow blob r1 1 tensorflow examples tutorials layers
将 Dropout 与 Keras 和 LSTM/GRU 单元结合使用

在 Keras 中您可以像这样指定 dropout 层 model add Dropout 0 5 但对于 GRU 单元您可以将 dropout 指定为构造函数中的参数 model add GRU units 512 return se
可视化 TFLite 图并获取特定节点的中间值？

我想知道是否有办法知道 tflite 中特定节点的输入和输出列表我知道我可以获得输入输出详细信息但这不允许我重建发生在Interpreter 所以我要做的是 interpreter tf lite Interpreter model
张量流如何处理无法存储在一个盒子中的大变量

我想通过训练超过十亿特征维度的数据来训练 DNN 模型因此第一层权重矩阵的形状将为 1 000 000 000 512 这个权重矩阵太大无法存储在一个盒子中目前有没有什么解决方案来处理这么大的变量例如将大的权重矩阵划分为多个框 Up
对于只有 10000 个单词的字典来说，真正需要什么嵌入层 output_dim？

我正在训练一个 RNN 其单词特征集非常少大约 10 000 个我计划在添加 RNN 之前从嵌入层开始但我不清楚真正需要什么维度我知道我可以尝试不同的值 32 64 等但我宁愿先有一些直觉例如如果我使用 32 维嵌入向量则每
无法加载动态库“libcudart.so.11.0”；

我尝试将 Tensorflow 2 7 0 与 GPU 结合使用但我不断遇到同样的问题 2022 02 03 08 32 31 822484 W tensorflow stream executor platform default ds
Tensorflow 中的自定义资源

由于某些原因我需要为 Tensorflow 实现自定义资源我试图从查找表实现中获得灵感如果我理解得好的话我需要实现3个TF操作创建我的资源资源的初始化例如在查找表的情况下填充哈希表执行查找查找查询步骤为了促进实施我
张量流中的复杂卷积

我正在尝试运行一个简单的卷积但包含复数 r np random random 1 10 10 10 i np random random 1 10 10 10 x tf complex r i conv layer tf layers c
tf.gather_nd 直观上是做什么的？

你能直观地解释一下或者举更多例子吗tf gather nd用于在 Tensorflow 中索引和切片为高维张量我读了API https www tensorflow org api docs python tf gather nd 但它保
如何将神经网络的输出限制在特定范围内？

我正在使用 Keras 进行回归任务并希望将输出限制在一个范围内例如 1 到 10 之间有没有办法保证这一点像这样编写自定义激活函数 a simple custom activation from keras import back
使用预训练的 word2vec 初始化 Seq2seq 嵌入

我对使用预训练的 word2vec 初始化tensorflow seq2seq 实现感兴趣我已经看过代码了嵌入似乎已初始化 with tf variable scope scope or embedding attention deco
验证 Transformer 中多头注意力的实现

我已经实施了MultiAttention head in Transformers 周围有太多的实现所以很混乱有人可以验证我的实施是否正确 DotProductAttention 引用自 https www tensorflow org
TensorFlow 无法编译

尝试从源代码编译 TensorFlow 时出现以下错误任何想法都会有帮助 bazel out host bin solib local U S Stensorflow Spython Cgen Unn Uops Upy Uwrappers
错误：分配具有形状的张量时出现 OOM

在使用 Apache JMeter 进行性能测试期间我面临着初始模型的问题错误分配形状为 800 1280 3 和类型的张量时出现 OOM 通过分配器浮动在 job localhost replica 0 task 0 device
TensorFlow的./configure在哪里以及如何启用GPU支持？

在我的 Ubuntu 上安装 TensorFlow 时我想将 GPU 与 CUDA 结合使用但我却停在了这一步官方教程 http www tensorflow org get started os setup md 这到底是哪里 con

随机推荐

Maven(六) eclipse 使用Maven deploy命令部署构建到Nexus

转载于 http blog csdn net jun55xiu article details 43051627 1 应用场景 SYS UTIL 系统工具项目部署构建成JAR包 SYS UTIL XXX jar 存储到Nexus私服上
spring boot 使用application.properties 进行外部配置

application properties大家都不陌生我们在开发的时候经常使用它来配置一些可以手动修改而且不用编译的变量这样的作用在于打成war包或者jar用于生产环境时我们可以手动修改环境变量而不用再重新编译 spring b
python里的pypi是干什么用的_【python工具篇】pip和pypi

PyPI the Python Package Index The Python Package Index is a repository of software for the Python programming language T
HTTP中GET，POST和PUT的区别

一 HTTP中定义了以下几种请求方法 1 GET 2 POST 3 PUT 4 DELETE 5 HEAD 6 TRACE 7 OPTIONS 二各个方法介绍 1 GET方法对这个资源的查操作 2 DELETE方法对这个资源的删操作
电脑检测不到第二个显示器的解决方法

一般是因为显示适配器被失效了右击开始菜单选择设备管理器再选择显示适配器这时图标上一般会带上感叹号右击后选择禁用再选择启用就能检测到第二个显示器
第一个跑马灯实验

如何新建一个工程 1 打开工程模板删除其他不重要的库文件把main 函数里的内容删除不用的外设固件库文件可以删掉节省编译时间 rcc 时钟使能 usart 串口复用映射 setbits 设置高电平 resetbits 低电平 2
「PAT甲级真题解析」Advanced Level 1006 Sign In and Sign Out

PAT Advanced Level Practice 1006 Sign In and Sign Out 如果对你有帮助要点个赞让我知道喔文章目录问题分析完整描述步骤伪代码描述完整提交代码问题分析题目给出一组学生进入机房的
计量经济学及Stata应用陈强第八章自相关习题8.3

8 3使用数据集gasoline dta估计美国1953 2004年的汽油需求函数考虑如下回归其中被解释变量lgasq为人均汽油消费量的对数解释变量lincome为人均收入的对数 lgasp为汽油价格指数的对数 lpnc为新车价格指
ROS建模仿真(1)-创建机器人模型

ROS建模仿真 1 创建机器人模型创建catkin creat pkg功能包创建机器人描述文件创建launch文件创建catkin creat pkg功能包创建机器人描述文件创建launch文件创建catkin creat p
Mac OS上使用ffmpeg的“血泪”总结

标题真不是夸张这几天在整理视频相关的处理流程为了获得一些性能数据打算在自己的MacBook Pro 上面装ffmepg 这一折腾4 5天就过去了有些问题在解决之后就豁然开朗了没有解决之前真的是百思不得其解中间就好像隔着一层纱
AF_INET和AF_PACKET区别

http blog csdn net kzm2008 article details 5372834 man 7 ip man 7 packet Packet sockets are used to receive or send raw
单片机蓝桥杯——定时中断实现数码管显示、按键判断

1 1ms定时中断T0 控制数码管显示 1 关于中断关于定时中断的初始化函数可直接在STC ISP软件上生成如下图所示注意初始化函数中并没有打开EA和ET0 需要自己加上 2 关于数码管显示数码管段码 segCode 0 segC
Flutter提供者模式说明

在本文中我们将介绍Flutter中的Provider模式 Google的工作小组建议使用提供程序模式他们还在Flutter的Pragmatic State Management中的 Google I O 2019上进行了介绍其他一些模
Nginx的Gzip压缩

Nginx的Gzip压缩 Nginx开启Gzip压缩功能可以使网站的css js xml html 文件在传输时进行压缩提高访问速度进而优化Nginx性能在Nginx配置文件中可以配置Gzip的使用相关指令可以在http区域 se
Java流程控制--分支结构

Java流程控制分支结构 if 单分支结构 if 条件表达式这个表达式的结果是布尔值要么是false 要么是true 如果上面中的表达式返回结果是true 那么执行中代码如果上面中的表达式返回结果是false 那么不执行中
Unity的Audio组件命令有哪些

Unity 的 Audio 组件命令有以下几种 Play 播放音频 Pause 暂停音频 UnPause 取消暂停音频 Stop 停止播放音频 SetScheduledStartTime 设置音频开始播放的时间 SetScheduledEn
SpringBoot使用Redisson做延迟队列案列(超详细)

背景有些场景下需要延迟触发一些任务比如延迟几秒钟发送短信或者邮件某些业务系统回调需要延时几秒钟后回调当然实现延时触发的方式有很多我这里采用 redisson 的 RDelayedQueue 一是因为接入简单二是没有分布式
post使用form-data和x-www-form-urlencoded的本质区别

一是数据包格式的区别二是数据包中非ANSCII字符怎么编码是百分号转码发送还是直接发送一 application x www form urlencoded 1 它是post的默认格式使用js中URLencode转码方法包括将na
修改onnx模型输出示例

前言如图是netron github链接软件中打开的onnx模型可以看到右边模型的最终输出结果是分类值predict 0而非概率值那么如何获取中间过程的概率值或者说怎么把右边的图砍掉一截变成左边的图呢代码读入模型 import
keras_cv进行数据增强

使用keras cv来做分类数据增强以下直接上流程具体的原理和代码上github查看源码及配合tensorflow官网及keras官网来做处理当前 2022 10 8 这些文档还不是很全 import os import numpy