Tensorflow numpy 图像重塑 [灰度图像]

2023-12-25

我正在尝试使用我训练过的神经网络数据在 jupyter 笔记本中执行 Tensorflow“object_detection_tutorial.py”，但它会抛出 ValueError。上面提到的文件是 YouTube 上用于对象检测的 Sentdexs 张量流教程的一部分。

你可以在这里找到它：（)

我的图像尺寸：490x704。这样就会得到一个 344960 数组。

但它说：ValueError: cannot reshape array of size 344960 into shape (490,704,3)

我究竟做错了什么？

Code:

Imports

import numpy as np
import os
import six.moves.urllib as urllib
import sys
import tarfile
import tensorflow as tf
import zipfile

from collections import defaultdict
from io import StringIO
from matplotlib import pyplot as plt
from PIL import Image

环境设置

# This is needed to display the images.
%matplotlib inline

# This is needed since the notebook is stored in the object_detection folder.
sys.path.append("..")

物体检测导入

from utils import label_map_util

from utils import visualization_utils as vis_util

变量

# What model to download.
MODEL_NAME = 'shard_graph'

# Path to frozen detection graph. This is the actual model that is used for the object detection.
PATH_TO_CKPT = MODEL_NAME + '/frozen_inference_graph.pb'

# List of the strings that is used to add correct label for each box.
PATH_TO_LABELS = os.path.join('training', 'object-detection.pbtxt')

NUM_CLASSES = 90

将（冻结的）Tensorflow 模型加载到内存中。

detection_graph = tf.Graph()
with detection_graph.as_default():
  od_graph_def = tf.GraphDef()
  with tf.gfile.GFile(PATH_TO_CKPT, 'rb') as fid:
    serialized_graph = fid.read()
    od_graph_def.ParseFromString(serialized_graph)
    tf.import_graph_def(od_graph_def, name='')

加载标签图

label_map = label_map_util.load_labelmap(PATH_TO_LABELS)
categories = label_map_util.convert_label_map_to_categories(label_map, max_num_classes=NUM_CLASSES, use_display_name=True)
category_index = label_map_util.create_category_index(categories)

辅助代码

def load_image_into_numpy_array(image):
  (im_width, im_height) = image.size
  return np.array(image.getdata()).reshape(
      (im_height, im_width, 3)).astype(np.uint8)

检测

# For the sake of simplicity we will use only 2 images:
# image1.jpg
# image2.jpg
# If you want to test the code with your images, just add path to the images to the TEST_IMAGE_PATHS.
PATH_TO_TEST_IMAGES_DIR = 'test_images'
TEST_IMAGE_PATHS = [ os.path.join(PATH_TO_TEST_IMAGES_DIR, 'frame_{}.png'.format(i)) for i in range(0, 2) ]

# Size, in inches, of the output images.
IMAGE_SIZE = (12, 8)

with detection_graph.as_default():
  with tf.Session(graph=detection_graph) as sess:
    # Definite input and output Tensors for detection_graph
    image_tensor = detection_graph.get_tensor_by_name('image_tensor:0')
    # Each box represents a part of the image where a particular object was detected.
    detection_boxes = detection_graph.get_tensor_by_name('detection_boxes:0')
    # Each score represent how level of confidence for each of the objects.
    # Score is shown on the result image, together with the class label.
    detection_scores = detection_graph.get_tensor_by_name('detection_scores:0')
    detection_classes = detection_graph.get_tensor_by_name('detection_classes:0')
    num_detections = detection_graph.get_tensor_by_name('num_detections:0')
    for image_path in TEST_IMAGE_PATHS:
      image = Image.open(image_path)
      # the array based representation of the image will be used later in order to prepare the
      # result image with boxes and labels on it.
      image_np = load_image_into_numpy_array(image)
      # Expand dimensions since the model expects images to have shape: [1, None, None, 3]
      image_np_expanded = np.expand_dims(image_np, axis=0)
      # Actual detection.
      (boxes, scores, classes, num) = sess.run(
          [detection_boxes, detection_scores, detection_classes, num_detections],
          feed_dict={image_tensor: image_np_expanded})
      # Visualization of the results of a detection.
      vis_util.visualize_boxes_and_labels_on_image_array(
          image_np,
          np.squeeze(boxes),
          np.squeeze(classes).astype(np.int32),
          np.squeeze(scores),
          category_index,
          use_normalized_coordinates=True,
          line_thickness=8)
      plt.figure(figsize=IMAGE_SIZE)
      plt.imshow(image_np)

脚本的最后一部分抛出错误：

----------------------------------------------------------------------
ValueError                           Traceback (most recent call last)
<ipython-input-62-7493eea60222> in <module>()
     14       # the array based representation of the image will be used later in order to prepare the
     15       # result image with boxes and labels on it.
---> 16       image_np = load_image_into_numpy_array(image)
     17       # Expand dimensions since the model expects images to have shape: [1, None, None, 3]
     18       image_np_expanded = np.expand_dims(image_np, axis=0)

<ipython-input-60-af094dcdd84a> in load_image_into_numpy_array(image)
      2   (im_width, im_height) = image.size
      3   return np.array(image.getdata()).reshape(
----> 4       (im_height, im_width, 3)).astype(np.uint8)

ValueError: cannot reshape array of size 344960 into shape (490,704,3)

Edit:

所以我改变了这个函数的最后一行：

def load_image_into_numpy_array(image):
  (im_width, im_height) = image.size
  return np.array(image.getdata()).reshape(
      (im_height, im_width, 3)).astype(np.uint8)

to:

(im_height, im_width)).astype(np.uint8)

ValueError 已解决。但现在引发了另一个与数组格式相关的 ValueError：

----------------------------------------------------------------------
ValueError                           Traceback (most recent call last)
<ipython-input-107-7493eea60222> in <module>()
     20       (boxes, scores, classes, num) = sess.run(
     21           [detection_boxes, detection_scores, detection_classes, num_detections],
---> 22           feed_dict={image_tensor: image_np_expanded})
     23       # Visualization of the results of a detection.
     24       vis_util.visualize_boxes_and_labels_on_image_array(

~/.local/lib/python3.6/site-packages/tensorflow/python/client/session.py in run(self, fetches, feed_dict, options, run_metadata)
    898     try:
    899       result = self._run(None, fetches, feed_dict, options_ptr,
--> 900                          run_metadata_ptr)
    901       if run_metadata:
    902         proto_data = tf_session.TF_GetBuffer(run_metadata_ptr)

~/.local/lib/python3.6/site-packages/tensorflow/python/client/session.py in _run(self, handle, fetches, feed_dict, options, run_metadata)
   1109                              'which has shape %r' %
   1110                              (np_val.shape, subfeed_t.name,
-> 1111                               str(subfeed_t.get_shape())))
   1112           if not self.graph.is_feedable(subfeed_t):
   1113             raise ValueError('Tensor %s may not be fed.' % subfeed_t)

ValueError: Cannot feed value of shape (1, 490, 704) for Tensor 'image_tensor:0', which has shape '(?, ?, ?, 3)'

这是否意味着这个张量流模型不是为灰度图像设计的？有办法让它发挥作用吗？

SOLUTION

感谢 Matan Hugi，它现在工作得很好。我所要做的就是将此函数更改为：

def load_image_into_numpy_array(image):
    # The function supports only grayscale images
    last_axis = -1
    dim_to_repeat = 2
    repeats = 3
    grscale_img_3dims = np.expand_dims(image, last_axis)
    training_image = np.repeat(grscale_img_3dims, repeats, dim_to_repeat).astype('uint8')
    assert len(training_image.shape) == 3
    assert training_image.shape[-1] == 3
    return training_image

Tensorflow 预期输入以 NHWC 格式格式化，这意味着：（批次、高度、宽度、通道）。

第 1 步 - 添加最后一个维度：

last_axis = -1
grscale_img_3dims = np.expand_dims(image, last_axis)

步骤 2 - 重复最后一个维度 3 次：

dim_to_repeat = 2
repeats = 3
np.repeat(grscale_img_3dims, repeats, dim_to_repeat)

所以你的函数应该是：

def load_image_into_numpy_array(image):
    # The function supports only grayscale images
    assert len(image.shape) == 2, "Not a grayscale input image" 
    last_axis = -1
    dim_to_repeat = 2
    repeats = 3
    grscale_img_3dims = np.expand_dims(image, last_axis)
    training_image = np.repeat(grscale_img_3dims, repeats, dim_to_repeat).astype('uint8')
    assert len(training_image.shape) == 3
    assert training_image.shape[-1] == 3
    return training_image

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

NumPy

tensorflow

Tensorflow numpy 图像重塑 [灰度图像] 的相关文章

使用 Celery 时出现错误消息“无法找到记录器“多处理”的处理程序”

RabbitMQ http en wikipedia org wiki RabbitMQ现在似乎工作正常然而当我尝试 python m celery bin celeryd loglevel INFO 常规的celeryd不起作用我收
如何配置 VS Code 以便能够单步执行调试 Python 脚本时加载的共享库 (.so)？

从命令行使用 gdb 我可以在加载共享库时中断知道我有共享库的源代码如何在 VS Code 中获得相同的行为对我来说它以某种方式起作用这是我的设置 Ubuntu 18 04 调试我从 Python3 加载的 C 共享库更具体地说
在Langchain中，为什么ConversationalRetrievalChain不记住聊天记录并为每个聊天输入新的ConversationalRetrievalChain链？

我正在尝试使用 langchain 创建一个客户支持系统我通过 TextLoader 使用文本文档作为外部知识提供者为了记住聊天我使用 ConversationalRetrievalChain 和聊天列表我的问题是每次执行时con
Cassandra：在 session.execute() 期间“无法完成对任何主机的操作”

卡桑德拉版本 1 2 2Thrift API 版本 19 35 0CQL支持的版本 2 0 0 3 0 1 默认 3 0 1 适用于 python 3 4 的 cassandra 驱动程序使用 sudo 运行 cassandra bin c
如何在Python中绘制“Trace Explorer”？

我需要重新创建一个情节踪迹浏览器 https www bupar net trace explorer html与下面在 R 中创建的类似我希望使用 matplotlib 但找不到任何有关如何执行这样的跟踪资源管理器的示例或参考有人能
将 Python 脚本导入另一个脚本？

我正在阅读 Zed Shaw 的艰难学习 Python 正在学习第 26 课在本课中我们必须修复一些代码这些代码从另一个脚本调用函数他说我们不必导入它们来通过测试但我很好奇我们将如何做到这一点课程链接 http learnpy
IP保持不变

我正在尝试通过代码连接到 Tor 并更改我的身份到目前为止我得到的结果是我连接成功但无法更改我的身份这是我的代码 import socket import socks import httplib def connectTor sock
Pycharm 出现 Kivy 错误 [严重] [应用程序] 无法获取窗口，中止

我正在尝试让示例 Kivy 代码之一在我的机器上运行我使用的是 Pycharm 社区版 2017 1 安装了 anaconda python 2 7 和 Kivy 1 9 我已使用项目设置将 Kivy 模块和 Pygame 模块安装到项目
如何在pytorch中动态索引张量？

例如我有一个张量 tensor torch rand 12 512 768 我得到了一个索引列表说它是 0 2 3 400 5 32 7 8 321 107 100 511 我希望从给定索引列表的维度 2 上的 512 个元素中选择 1
Panda如何将行分组到不同的时间桶中？

我有一个带有名为时间戳的日期时间类型列的数据帧我想根据时间部分的时间戳将数据帧拆分为多个数据帧每个数据帧包含按其值模 x 分钟进行值的行其中 x 是变量请注意e and f不按原来的顺序以 10 分钟为模我希望所有时间都以3在一
将 *.appspot.com 重定向到自定义域：Google 应用引擎 (Django)

我直接将我的一些示例代码放在这里以获得更好的了解 url py r robots txt myapp views robots r myapp views home views py def home request my code ret
忽略 NaN 的列表理解

我正在尝试构建一个列表理解其条件是不导入 nan 值但运气不佳以下是当前代码以及结果输出什么条件会将 nan 从列表中删除 def generate labels filtered df columnName return labe
在 python 中以半小时为增量创建选择列表

我正在尝试创建一个
从 DST 感知日期时间对象在 Dataframe 中创建 pandas DatetimeIndex

我从在线 API 收集了一系列数据点每个数据点都有一个值和一个 ISO 时间戳不幸的是我需要循环它们所以我将它们存储在临时的dict然后从中创建一个 pandas 数据帧并将索引设置为时间戳列简化示例 from datetime i
使用 cv2 在 python 中创建多通道零垫

我想用 cv2 opencv 包装器在 python 中创建一个多通道 mat 对象我在网上找到了一些例子其中 c Mat zeros 被 numpy zeros 替换这看起来不错但似乎没有多通道类型适合看代码 import cv
在 C++ 中运行 python [关闭]

Closed 这个问题需要多问focused help closed questions 目前不接受答案我有一个用 C 编写的应用程序和一个测试系统也是用 C 编写的测试系统非常复杂并且很难改变我只想做一些小的改变我的班级是这样的
将2个暗淡数组“列表列表”输出到python中的文本文件

简单的问题我正在创建一个两个暗淡的数组 ddist 0 d for in 0 d 在下面的代码中使用列表它使用 gis 数据输出距离我只是想要一种简单的方法来获取数组列表的结果并将其输出到保持相同的 N N 结构的文本文件我过去曾
使用 flow_from_dataframe y_col 的正确“值”是什么

我正在用 pandas 读取 csv 文件并给出存储在中的列名称colname colnames file label Read data from file data pd read csv Hand Annotations 2 csv
异常：AttributeError：使用 Azure Function 和 Python 的“DefaultAzureCredential”对象没有属性“signed_session”

我编写了一个运行 Python3 的 Azure 函数来简单地打开 Azure VM 该函数应用程序具有系统分配的托管标识我已为其授予 VM 贡献者角色为了让该函数使用托管标识我使用了 DefaultAzureCredential 类
AWS Cognito 作为网站的 Django 身份验证后端

我对 Cognito 的理解是它可以用来代替本地 Django 管理数据库来对网站的用户进行身份验证然而我没有找到任何带有通过 Cognito 登录屏幕的基本 Hello World 应用程序的详细示例如果有人可以发布一篇文章逐步

随机推荐

Java POI：如何读取Excel单元格值而不是公式计算？

我正在使用 Apache POI API 从 Excel 文件中获取值除了包含公式的单元格之外一切都运行良好事实上 cell getStringCellValue 返回单元格中使用的公式而不是单元格的值我尝试使用evaluateF
使用 equals 方法比较字符串并 == [重复]

这个问题在这里已经有答案了可能的重复如何在 Java 中比较字符串 https stackoverflow com questions 513832 how do i compare strings in java Java Strin
如何使用 tqdm 迭代列表

我想知道处理某个列表需要多长时间 for a in tqdm list1 if a in list2 do something 但这不起作用如果我使用for a in tqdm range list1 我将无法检索列表值你知道怎么做吗
无法从 Django Docker 实例内部访问项目绝对 url

我有一个使用 Cookiecutter Django 启动的项目目前我正在添加 WeasyPrint 以将某些视图作为 PDF 文件提供这在开发中运行良好 Cookiecutter Django 使用 Caddy 作为 HTTP 服务器
禁止实例化为临时对象 (C++)

我喜欢在 C 中使用哨兵类但我似乎有一种精神困扰导致反复编写如下错误 MySentryClass arg other code 不用说这会失败因为哨兵在创建后立即死亡而不是按预期在作用域结束时死亡有没有某种方法可以防止 MySe
Django CreateView 不保存对象

我正在使用基本的博客应用程序练习 django 基于类的视图然而由于某种原因我的 Post 模型的 CreateView 没有将帖子保存在数据库中模型 py class Post models Model user models F
如何正确使用头文件成为一个完整的类？

初学者程序员我遵循工作正常的头文件的样式但我试图弄清楚在编译时如何不断收到所有这些错误我正在 Cygwin 中使用 g 进行编译 Ingredient h 8 13 error expected unqualified id befo
进化算法：最优重新群体分解

这确实是标题中的全部内容但对于任何对进化算法感兴趣的人来说这里有一个细分在 EA 中基本前提是随机生成一定数量的有机体实际上只是参数集针对问题运行它们然后让表现最好的有机体生存下来然后你会重新填充幸存者的杂交品种幸存者的
如何在 pandas 数据框中执行不同值的累积和

我有一个像这样的数据框 id date company 123 2019 01 01 A 224 2019 01 01 B 345 2019 01 01 B 987 2019 01 03 C 334 2019 01 03 C 908 201
Delphi中从C DLL获取字符串返回值

我有一个用 C 编写的遗留 DLL 其中包含一个返回字符串的函数我需要从 Delphi 访问该函数我所掌握的有关 DLL 的唯一信息是用于访问该函数的 VB 声明公开声明函数 DecryptStr Lib strlib Str As
根据标签对一行中的每个句子进行评分并总结文本。（爪哇）

我正在尝试用 Java 创建一个摘要器我正在使用斯坦福对数线性词性标注器 http nlp stanford edu software tagger shtml标记单词然后对于某些标记我对句子进行评分最后在摘要中我打印具有高分值
无法读取 PNG 签名：文件不以 PNG 签名开头

Gradle 构建失败并出现以下错误 Error C Users Roman gradle caches transforms 1 files 1 1 appcompat v7 26 0 2 aar bab547c3f1b8061ef942
使用 GhostScript 将 pdf 转换为图像 - 如何引用 gsdll32.dll？

我正在尝试使用 GhostScript 从 pdf 创建图像这是我的代码 GhostscriptWrapper ConvertToBMP inputPDFFilePath outputBMPFilePath 这是我的Ghostscript
复合组件属性中的枚举值

我的问题非常简单我想创建一个具有字符串属性 Type 的复合组件
将处理3嵌入到swing中

我正在尝试将Processing 3 集成到swing 应用程序中但是因为PApplet 不再扩展Applet 所以我不能立即将其添加为组件无论如何是否可以将Processing 3 草图嵌入到Swing 中如果我可以在没有PDE
Gradle 无法使用 OBJECT 库构建 CMake 项目，因为它需要输出文件

My 构建 gradle文件包含以下内容以使用 CMake 构建项目 externalNativeBuild cmake Provides a relative path to your CMake build script version
每个工作表循环的 Excel VBA

我正在编写代码基本上浏览工作簿中的每张工作表然后更新列宽下面是我写的代码我没有收到任何错误但它实际上也没有做任何事情任何帮助是极大的赞赏 Option Explicit Dim ws As Worksheet a As Rang
文本字体大小

我创造了不同的layouts layout layout small layout normal layout large layout xlarge 并为values values values ldpi values mdpi valu
如果其他类可见或显示，JQuery 隐藏类

发现类似的问题但没有什么能完全满足我的需要我在示例中保持简单并且我想使用 JQuery 我有两节课如果页面加载时显示类别 div 我想隐藏过滤器 div 目前没有与这两个类别相关的样式我相信我已经很接近了但它不起作用 div
Tensorflow numpy 图像重塑 [灰度图像]

我正在尝试使用我训练过的神经网络数据在 jupyter 笔记本中执行 Tensorflow object detection tutorial py 但它会抛出 ValueError 上面提到的文件是 YouTube 上用于对象检测的 Se

Tensorflow numpy 图像重塑 [灰度图像]

Tensorflow numpy 图像重塑 [灰度图像] 的相关文章

随机推荐

热门标签