pytorch: grad can be implicitly created only for scalar outputs

2023-05-16

这个错误很早就遇到过但是没看到网上叙述清楚的，这里顺便写一下。
这里贴一下autograd.grad()的注释

grad(outputs, inputs, grad_outputs=None, retain_graph=None, create_graph=False, only_inputs=True, allow_unused=False)
    Computes and returns the sum of gradients of outputs w.r.t. the inputs.
    ``grad_outputs`` should be a sequence of length matching ``output``
    containing the pre-computed gradients w.r.t. each of the outputs. If an
    output doesn't require_grad, then the gradient can be ``None``).
    If ``only_inputs`` is ``True``, the function will only return a list of gradients
    w.r.t the specified inputs. If it's ``False``, then gradient w.r.t. all remaining
    leaves will still be computed, and will be accumulated into their ``.grad``
    attribute.
    
    Arguments:
        outputs (sequence of Tensor): outputs of the differentiated function.
        inputs (sequence of Tensor): Inputs w.r.t. which the gradient will be
            returned (and not accumulated into ``.grad``).
        grad_outputs (sequence of Tensor): Gradients w.r.t. each output.
            None values can be specified for scalar Tensors or ones that don't require
            grad. If a None value would be acceptable for all grad_tensors, then this
            argument is optional. Default: None.
        retain_graph (bool, optional): If ``False``, the graph used to compute the grad
            will be freed. Note that in nearly all cases setting this option to ``True``
            is not needed and often can be worked around in a much more efficient
            way. Defaults to the value of ``create_graph``.
        create_graph (bool, optional): If ``True``, graph of the derivative will
            be constructed, allowing to compute higher order derivative products.
            Default: ``False``.
        allow_unused (bool, optional): If ``False``, specifying inputs that were not
            used when computing outputs (and therefore their grad is always zero)
            is an error. Defaults to ``False``.

如下代码

>>> a=Variable(torch.FloatTensor([1,2,3]),requires_grad=True)
>>> b=3*a    
>>> autograd.grad(outputs=b,inputs=a)  # 这里b为向量
RuntimeError: grad can be implicitly created only for scalar outputs

因为计算梯度时outputs需为标量(未指明grad_outputs或grad_outputs为None时)，所以上面的代码会报错，而如下代码可以正常运行：

>>> a=Variable(torch.FloatTensor([1,2,3]),requires_grad=True)
>>> b=3*a
>>> z=b.sum()    
>>> autograd.grad(outputs=z,inputs=a) # 这里z为标量
(tensor([ 3.,  3.,  3.]),)

也可以通过指定grad_outputs，这时计算梯度就不再需要outputs为标量了，如下

>>> a=Variable(torch.FloatTensor([1,2,3]),requires_grad=True)
>>> b=3*a
>>> autograd.grad(outputs=b,inputs=a,grad_outputs=torch.ones_like(a))
(tensor([ 3.,  3.,  3.]),)

grad_outputs在GPU下时可写作以下形式

    grad_outputs = Variable(torch.Tensor(torch.ones_like(a)),requires_grad=False)

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

Pytorch

grad

Can

implicitly

created

pytorch: grad can be implicitly created only for scalar outputs 的相关文章

Pytorch：获取最终层的正确尺寸

Pytorch 新手来了我正在尝试微调 VGG16 模型来预测 3 个不同的类别我的部分工作涉及将 FC 层转换为 CONV 层但是我的预测值不会落在 0 到 2 3 个类别之间有人可以向我指出有关如何计算最后一层的正确尺寸的好
Python中的嵌入层：如何正确使用Torchsummary？

这是一个最低限度工作可重现的示例 import torch import torch nn as nn from torchsummary import summary class Network nn Module def init s
.data 在 pytorch 中还有用吗？

我是 pytorch 的新手我读了很多大量使用张量的 pytorch 代码 data成员但我搜索 data在官方文档和Google中发现很少我猜 data包含张量中的数据但我不知道什么时候需要它什么时候不需要 data是一个属性
MNIST、torchvision 中的输出和广播形状不匹配

在 Torchvision 中使用 MNIST 数据集时出现以下错误 RuntimeError output with shape 1 28 28 doesn t match the broadcast shape 3 28 28 这是我的
推导 pytorch 网络的结构

对于我的用例我需要能够采用 pytorch 模块并解释模块中的层序列以便我可以以某种文件格式在层之间创建连接现在假设我有一个简单的模块如下所示 class mymodel nn Module def init self input
Win10 64位上CUDA 12的PyTorch安装

我需要在我的 PC 上安装 PyTorch 其 CUDA 版本 12 0 pytorch 2 的表 https i stack imgur com X13oS png in In 火炬网站 https pytorch org get sta
无法将 cuda:0 设备类型张量转换为 numpy。首先使用 Tensor.cpu() 将张量复制到主机内存

我试图展示 GAN 网络在某些指定时期的结果打印当前结果的功能之前是在 TF 中使用的我需要换成pytorch def show result G net z num epoch show False save False path r
PoseWarping：如何矢量化此 for 循环（z 缓冲区）

我正在尝试使用地面真实深度图姿势信息和相机矩阵将帧从视图 1 扭曲到视图 2 我已经能够删除大部分 for 循环并将其矢量化除了一个 for 循环扭曲时由于遮挡视图 1 中的多个像素可能会映射到视图 2 中的单个位置在这种情况下
如何使用 torch.stack？

我该如何使用torch stack将两个张量与形状堆叠a shape 2 3 4 and b shape 2 3 没有就地操作堆叠需要相同数量的维度一种方法是松开并堆叠例如 a size 2 3 4 b size 2 3 b torc
二维数组的按行 numpy.isin [重复]

这个问题在这里已经有答案了我有两个数组 A np array 3 1 4 1 1 4 B np array 0 1 5 2 4 5 2 3 5 是否可以使用numpy isin二维数组按行排列我想检查一下是否A i j is in B
Pytorch：了解 nn.Module 类内部如何工作

一般来说一个nn Module可以由子类继承如下所示 def init weights m if type m nn Linear torch nn init xavier uniform m weight class LinearRe
在 C++ API 中将一个张量的一大块复制到另一个张量中

我需要复制一行一个张量在c API 转换为另一个张量的某些部分其中开始和结束索引可用在 C 中我们可以使用类似的东西 int myints 10 20 30 40 50 60 70 std vector
删除 Torch 张量中的行

我有一个火炬张量如下 a tensor 0 2215 0 5859 0 4782 0 7411 0 3078 0 3854 0 3981 0 5200 0 1363 0 4060 0 2030 0 4940 0 1640 0 6025 0
pytorch grad 在 .backward() 之后为 None

我刚刚安装火炬 1 0 0 on Python 3 7 2 macOS 并尝试tutorial https pytorch org tutorials beginner blitz autograd tutorial html sphx g
torch-1.1.0-cp37-cp37m-win_amd64.whl 在此平台上不受支持的滚轮

我在开发 RNN 时需要使用 pyTorch 每当我尝试安装它时我都会收到一条错误消息指出 torch 1 1 0 cp37 cp37m win amd32 whl 在此平台上不受支持 pip3安装https download pyto
一次热编码期间出现 RunTimeError

我有一个数据集其中类值以 1 步从 2 到 2 i e 2 1 0 1 2 其中 9 标识未标记的数据使用一种热编码 self one hot encode labels 我收到以下错误 RuntimeError index 1 is
在非单一维度 1 处，张量 a (2) 的大小必须与张量 b (39) 的大小匹配

这是我第一次从事文本分类工作我正在使用 CamemBert 进行二进制文本分类使用 fast bert 库该库主要受到 fastai 的启发当我运行下面的代码时 from fast bert data cls import Bert
Pytorch ValueError：优化器得到一个空参数列表

当尝试创建神经网络并使用 Pytorch 对其进行优化时我得到了 ValueError 优化器得到一个空参数列表这是代码 import torch nn as nn import torch nn functional as F fro
如何使用pytorch构建多任务DNN，例如超过100个任务？

下面是使用 pytorch 为两个回归任务构建 DNN 的示例代码这forward函数返回两个输出 x1 x2 用于大量回归分类任务的网络怎么样例如 100 或 1000 个输出对所有输出例如 x1 x2 x100 进行硬编码绝对
Pytorch 损失为 nan

我正在尝试用 pytorch 编写我的第一个神经网络不幸的是当我想要得到损失时遇到了问题出现以下错误信息 RuntimeError Function LogSoftmaxBackward0 returned nan values in

随机推荐

QGraphicsView类

QGraphicsView提供一个显示QGraphicsScene内容的窗口 xff0c 该窗口可以滚动 xff0c 可以在构造时候把场景对象作为参数 xff0c 或者之后使用setScene 来设置view的场景 xff0c 然后调用了s
STM32 USART 接收任意长度字符

近段时间学习到 STM32 USART 部分 xff0c 基本上在接收数据的时候都是采用定长 xff0c 所以一直想实现接收任意长度的字符串这里的任意长度不是指的无限长 xff0c 而是在自己定义的缓冲区范围之类比如说缓冲区的大小是 1
关于RS485和RS422总线，一主多从回复信号被拉低收不到反馈数据的问题。

芯片 xff1a MAX13487EESA xff08 RS485 xff09 这里这个三个电阻不接 AK管不接也行如果你发现你在总线上挂接两个以上的RS485模块 xff0c 发现总线电压和只接一个时波形幅度降低了 xff0c 就是上面
ubuntu16.04 UNIX 网络编程卷一源码使用

参考源码目录 README文档 tar xvf unpv13e tar gz 解压然后进入源码目录 a configure 这一步没有出现问题 b cd lib c make 这一步没有出错 d cd libfree e make 这一步
HTTP认证之摘要认证——Digest

一概述 Digest认证是为了修复基本认证协议的严重缺陷而设计的 xff0c 秉承绝不通过明文在网络发送密码的原则 xff0c 通过密码摘要进行认证 xff0c 大大提高了安全性相对于基本认证 xff0c 主要有如下改进 xff
QFramework Pro 开发日志（六）一键生成类图功能介绍

这个功能连续开发了三天 xff0c 现在完成了一个基本的雏形先说说 xff0c 为啥做这个功能吧作为 Unity 开发者 xff0c 不管是在做游戏还是在做工具方案学习源码的时候 xff0c 多多少少都会需要魔改一些其他插件框架
HAL库教程9：串口接收不定长数据

串口收到的两组数据之间 xff0c 往往会有一定的时间间隔可以判断这个间隔 xff0c 来实现无需结束符 xff0c 无需指定长度 xff0c 串口可接收不定长数据的功能如果串口在一定的时间内没有收到新的数据 xff0c 可以认为一组数
odroid平台——ASUS Xtion Pro Live + Openni + ROS搭建（Xu4升级版）

之前的文章写了基于odroid xu3的Xtion 43 ROS搭建方法 xff0c 由于xu3停产了 xff0c 只能换用xu4 xff0c 但是换的过程中发现xu4没有usb2 0 xff0c 只有usb3 0 xff0c 但是很遗憾X
Tensorflow: Cannot dlopen some GPU libraries. Skipping registering GPU devices...

Cannot dlopen some GPU libraries Skipping registering GPU devices 很久没搞Tensorflow了 xff0c 又出了一些问题 xff0c 这里作个备份可能的问题为 xff1
目标检测模型、卷积网络的感受野与分形特征

概述最近几年深度学习的快速发展对目标检测 xff08 Object Detection xff09 领域也产生了巨大的影响 xff0c 各种SOTA xff08 State of Art xff09 的模型也层出不穷 xff0c 包括但不
点关于直线的距离、垂足、对称点公式

下面通过两种直线方程的形式 xff0c 求解点关于直线的距离垂足对称点公式问题描述1 xff1a 已知点的坐标 xff08 x0 xff0c y0 xff09 xff0c 直线的方程为Ax 43 By 43 C 61 0 xff1b
理解神经网络的数学原理（一）全连接模型的空间划分与编码逻辑

概述几年前就想写这篇文章 xff0c 但是在解析神经网络的数学原理问题上断断续续 xff0c 加上个人能力有限 xff0c 很多问题并没有研究的很明白 xff0c 以及神经网络本身高维问题的复杂性 xff0c 导致这个问题的理解也是有限的
理解神经网络的数学原理（二）多层感知机（MLP）的空间划分与编码逻辑

概述上一篇文章解析了单层全连接分类模型的 xff08 输入 xff09 空间划分 xff08 Space Partitioning xff09 与编码逻辑或数学原理 xff0c 这篇文章将主要是解析多层感知机 xff08 Multi L
使用电信e8-c家庭网关时，无线路由器的设置方法

因为电信e8 c家庭网关的默认IP地址为192 168 1 1 xff0c 而一般的路由器的默认IP xff08 LAN口 xff09 为192 168 1 1 xff0c 所以在进行e8 c与无线路由器设置时会造成无网络访问权限的情况 x
漫话中文自动分词和语义识别：中文分词算法

原文链接 xff1a http www matrix67 com blog archives 4212 漫话中文自动分词和语义识别 xff08 下 xff09 xff1a 句法结构和语义结构 Matrix67 The Aha Moments
分类器对未见过类别（unseen category）的识别问题

这篇文章比较旧了 xff0c 其实是讨论开集识别问题的 xff08 Open Set Problem xff09 xff0c 可以参考本人的新文章 xff1a 真实世界中的开集识别问题 Open Set Recognition Proble
神经网络如何学习到加法等算法 - 神经编码器-解释器（Neural Programmer-Interpreters）

算法的本质何为算法 xff08 algorithm xff09 xff1f 从狭义来讲 xff0c 算法是计算机科学里面的概念 xff0c 简单来说 xff0c 所谓算法就是定义良好的计算过程 xff0c 它取一个或者一组值作为输入 xf
使用pytorch预训练模型分类与特征提取

pytorch pytorch v0 1 这个是早期版本了应该是深度学习框架里面比较好使用的了 xff0c 相比于tensorflow xff0c mxnet 可能在用户上稍微少一点 xff0c 有的时候出问题不好找文章下面就使用pyt
深度学习训练中为什么要将图片随机剪裁（random crop）

图像分类中 xff0c 深度学习训练时将图片随机剪裁 xff08 random crop xff09 已经成为很普遍的数据扩充 xff08 data augmentation xff09 方法 xff0c 随机剪裁不但提高了模型精度 xff
pytorch: grad can be implicitly created only for scalar outputs

这个错误很早就遇到过但是没看到网上叙述清楚的 xff0c 这里顺便写一下这里贴一下autograd grad 的注释 grad outputs inputs grad outputs 61 None retain graph 61 Non

pytorch: grad can be implicitly created only for scalar outputs

pytorch: grad can be implicitly created only for scalar outputs 的相关文章

随机推荐

热门标签