Pytorch：获取最终层的正确尺寸

2024-02-28

Pytorch 新手来了！我正在尝试微调 VGG16 模型来预测 3 个不同的类别。我的部分工作涉及将 FC 层转换为 CONV 层。但是，我的预测值不会落在 0 到 2（3 个类别）之间。

有人可以向我指出有关如何计算最后一层的正确尺寸的好资源吗？

以下是 VGG16 的原始 fC 层：

(classifier): Sequential(
    (0): Linear(in_features=25088, out_features=4096, bias=True)
    (1): ReLU(inplace)
    (2): Dropout(p=0.5)
    (3): Linear(in_features=4096, out_features=4096, bias=True)
    (4): ReLU(inplace)
    (5): Dropout(p=0.5)
    (6): Linear(in_features=4096, out_features=1000, bias=True)
  )

我将 FC 层转换为 CONV 的代码：

 def convert_fc_to_conv(self, fc_layers):
        # Replace first FC layer with CONV layer
        fc = fc_layers[0].state_dict()
        in_ch = 512
        out_ch = fc["weight"].size(0)
        first_conv = nn.Conv2d(512, out_ch, kernel_size=(1, 1), stride=(1, 1))

        conv_list = [first_conv]
        for idx, layer in enumerate(fc_layers[1:]):
            if isinstance(layer, nn.Linear):
                fc = layer.state_dict()
                in_ch = fc["weight"].size(1)
                out_ch = fc["weight"].size(0)
                if idx == len(fc_layers)-4:
                    in_ch = 3
                conv = nn.Conv2d(out_ch, in_ch, kernel_size=(1, 1), stride=(1, 1))
                conv_list += [conv]
            else:
                conv_list += [layer]
            gc.collect()

        avg_pool = nn.AvgPool2d(kernel_size=2, stride=1, ceil_mode=False)
        conv_list += [avg_pool, nn.Softmax()]
        top_layers = nn.Sequential(*conv_list)  
        return top_layers

最终模型架构：

    Model(
    (features): Sequential(
    (0): Conv2d(3, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (1): ReLU(inplace)
    (2): Conv2d(64, 64, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (3): ReLU(inplace)
    (4): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (5): Conv2d(64, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (6): ReLU(inplace)
    (7): Conv2d(128, 128, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (8): ReLU(inplace)
    (9): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (10): Conv2d(128, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (11): ReLU(inplace)
    (12): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (13): ReLU(inplace)
    (14): Conv2d(256, 256, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (15): ReLU(inplace)
    (16): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (17): Conv2d(256, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (18): ReLU(inplace)
    (19): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (20): ReLU(inplace)
    (21): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (22): ReLU(inplace)
    (23): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False)
    (24): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (25): ReLU(inplace)
    (26): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (27): ReLU(inplace)
    (28): Conv2d(512, 512, kernel_size=(3, 3), stride=(1, 1), padding=(1, 1))
    (29): ReLU(inplace)
    (30): MaxPool2d(kernel_size=2, stride=2, padding=0, dilation=1, ceil_mode=False))

    (classifier): Sequential(
    (0): Conv2d(512, 4096, kernel_size=(1, 1), stride=(1, 1))
    (1): ReLU(inplace)
    (2): Dropout(p=0.5)
    (3): Conv2d(4096, 3, kernel_size=(1, 1), stride=(1, 1))
    (4): ReLU(inplace)
    (5): Dropout(p=0.5)
    (6): AvgPool2d(kernel_size=2, stride=1, padding=0)
    (7): Softmax()
  )
)

模型总结：

            Conv2d-1         [-1, 64, 224, 224]           1,792
              ReLU-2         [-1, 64, 224, 224]               0
            Conv2d-3         [-1, 64, 224, 224]          36,928
              ReLU-4         [-1, 64, 224, 224]               0
         MaxPool2d-5         [-1, 64, 112, 112]               0
            Conv2d-6        [-1, 128, 112, 112]          73,856
              ReLU-7        [-1, 128, 112, 112]               0
            Conv2d-8        [-1, 128, 112, 112]         147,584
              ReLU-9        [-1, 128, 112, 112]               0
        MaxPool2d-10          [-1, 128, 56, 56]               0
           Conv2d-11          [-1, 256, 56, 56]         295,168
             ReLU-12          [-1, 256, 56, 56]               0
           Conv2d-13          [-1, 256, 56, 56]         590,080
             ReLU-14          [-1, 256, 56, 56]               0
           Conv2d-15          [-1, 256, 56, 56]         590,080
             ReLU-16          [-1, 256, 56, 56]               0
        MaxPool2d-17          [-1, 256, 28, 28]               0
           Conv2d-18          [-1, 512, 28, 28]       1,180,160
             ReLU-19          [-1, 512, 28, 28]               0
           Conv2d-20          [-1, 512, 28, 28]       2,359,808
             ReLU-21          [-1, 512, 28, 28]               0
           Conv2d-22          [-1, 512, 28, 28]       2,359,808
             ReLU-23          [-1, 512, 28, 28]               0
        MaxPool2d-24          [-1, 512, 14, 14]               0
           Conv2d-25          [-1, 512, 14, 14]       2,359,808
             ReLU-26          [-1, 512, 14, 14]               0
           Conv2d-27          [-1, 512, 14, 14]       2,359,808
             ReLU-28          [-1, 512, 14, 14]               0
           Conv2d-29          [-1, 512, 14, 14]       2,359,808
             ReLU-30          [-1, 512, 14, 14]               0
        MaxPool2d-31            [-1, 512, 7, 7]               0
           Conv2d-32           [-1, 4096, 7, 7]       2,101,248
             ReLU-33           [-1, 4096, 7, 7]               0
          Dropout-34           [-1, 4096, 7, 7]               0
           Conv2d-35              [-1, 3, 7, 7]          12,291
             ReLU-36              [-1, 3, 7, 7]               0
          Dropout-37              [-1, 3, 7, 7]               0
        AvgPool2d-38              [-1, 3, 6, 6]               0
          Softmax-39              [-1, 3, 6, 6]               0

我编写了一个函数，以 Pytorch 模型作为输入，并将分类层转换为卷积层。目前它适用于 VGG 和 Alexnet，但您也可以将其扩展到其他模型。

import torch
import torch.nn as nn
from torchvision.models import alexnet, vgg16

def convolutionize(model, num_classes, input_size=(3, 224, 224)):
    '''Converts the classification layers of VGG & Alexnet to convolutions

    Input:
        model: torch.models
        num_classes: number of output classes
        input_size: size of input tensor to the model

    Returns:
        model: converted model with convolutions
    '''
    features = model.features
    classifier = model.classifier

    # create a dummy input tensor and add a dim for batch-size
    x = torch.zeros(input_size).unsqueeze_(dim=0)

    # change the last layer output to the num_classes
    classifier[-1] = nn.Linear(in_features=classifier[-1].in_features,
                               out_features=num_classes)

    # pass the dummy input tensor through the features layer to compute the output size
    for layer in features:
        x = layer(x)

    conv_classifier = []
    for layer in classifier:
        if isinstance(layer, nn.Linear):
            # create a convolution equivalent of linear layer
            conv_layer = nn.Conv2d(in_channels=x.size(1),
                                   out_channels=layer.weight.size(0),
                                   kernel_size=(x.size(2), x.size(3)))

            # transfer the weights
            conv_layer.weight.data.view(-1).copy_(layer.weight.data.view(-1))
            conv_layer.bias.data.view(-1).copy_(layer.bias.data.view(-1))
            layer = conv_layer

        x = layer(x)
        conv_classifier.append(layer)

    # replace the model.classifier with newly created convolution layers
    model.classifier = nn.Sequential(*conv_classifier)

    return model

def visualize(model, input_size=(3, 224, 224)):
    '''Visualize the input size though the layers of the model'''
    x = torch.zeros(input_size).unsqueeze_(dim=0)
    print(x.size())
    for layer in list(model.features) + list(model.classifier):
        x = layer(x)
        print(x.size())

这是输入通过模型时的样子

_vgg = vgg16()
vgg = convolutionize(_vgg, 100)
print('\n\nVGG')
visualize(vgg)

...

VGG
torch.Size([1, 3, 224, 224])
torch.Size([1, 64, 224, 224])
torch.Size([1, 64, 224, 224])
torch.Size([1, 64, 224, 224])
torch.Size([1, 64, 224, 224])
torch.Size([1, 64, 112, 112])
torch.Size([1, 128, 112, 112])
torch.Size([1, 128, 112, 112])
torch.Size([1, 128, 112, 112])
torch.Size([1, 128, 112, 112])
torch.Size([1, 128, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 56, 56])
torch.Size([1, 256, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 28, 28])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 14, 14])
torch.Size([1, 512, 7, 7])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 4096, 1, 1])
torch.Size([1, 100, 1, 1])

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

machinelearning

Pytorch

convneuralnetwork

Pytorch：获取最终层的正确尺寸的相关文章

最近的 AWS 区域的客户端 IP 地址

Question 我想从客户端设备将一些数据上传到 AWS 但我想上传到最近的 AWS 区域的 S3 存储桶同样我希望能够从最近的区域下载当然我会在每个区域设置一个存储桶我可以使用一个系统它可以获取客户端的 IP 地址然后确定
使用 Tkinter 进行多线程 Python

我用这些函数在画布上画小圆圈这是绘制圆圈的函数 class Fourmis def init self can posx posy name radius self can can self largeur can int self ca
uwsgi + Django REST框架：空闲时间后很少有缓慢的请求

我正在运行 Django REST 框架白天每分钟的请求率相当低我注意到一个我无法解释或重现的问题每天在夜间或清晨当我的 RPM 接近于零时我会收到 1 10 个超慢的请求我的平均响应时间100 到 200 毫秒之间但是这个
解析器生成

我正在做一个项目软件抄袭检测我打算用C语言来做这件事因为我应该创建一个令牌生成器和一个解析器但我不知道从哪里开始任何人都可以帮助我解决这个问题我创建了一个令牌数据库并将令牌与我的程序分开接下来我想做的就是比较两个程序以查明它是
将这个使用 lambda 解包的元组从 Python 2 移植到 Python 3 的最 Pythonic 方法

我有以下 Python 2 代码它在 lambda 中解压元组该 lambda 包含在 for 循环内 for lab lab pred length in zip labels labels pred sequence lengths
Python Subversion 包装器库

在颠覆的文档 http svnbook red bean com en 1 7 svn developer usingapi html svn developer usingapi otherlangs有一个从 Python 使用 Subv
cx_freeze：QODBC 驱动程序未加载

我的 python 应用程序如下所示 test py from PyQt4 import QtCore from PyQt4 import QtGui from PyQt4 import QtSql import sys import at
使用 string.whitespace 删除 Python 中的空格

Python 的 string whitespace 很棒 gt gt gt string whitespace t n x0b x0c r 如何在不手动输入 t n 等正则表达式的情况下将其与字符串一起使用例如它应该能够转动请不要伤
Seaborn 条形图条之间没有空格

我使用下面的代码创建了一个 Seaborn 条形图它来自https www machinelearningplus com plots top 50 matplotlib visualizations the master plots p
在 Qt Creator 中相互公开 QML 组件

我正在使用 Qt Quick 和 PySide2 开发仪表板应用程序但在 Qt Creator 的设计模式中公开我的 QML 组件时遇到问题我的文件夹结构如下所示 myapp mycomponents component1 qml co
如何判断Python对象是否是字符串？

如何检查 Python 对象是否是字符串常规字符串或 Unicode Python 2 Use isinstance obj basestring 对于要测试的对象obj Docs https docs python org 2 7 li
Django：通过外键将两个表连接到第三个表？

我有三个型号 class A Model class B Model id IntegerField a ForeignKey A class C Model id IntegerField a ForeignKey A 我想要得到 B i
使用每日频率格式化 x 轴

我正在尝试获取每日数据图我有 3 个月的数据每天都很难指出如何格式化 x 轴以便我可以获得每个日期可以使用以下命令更改主要刻度的频率set major locator mdates DayLocator interval 5 如下
Python docker 容器在完成运行应用程序后立即关闭，即使指定保留在 -d -t 中

我有一个 dockerfile FROM python 3 WORKDIR app ADD venv venv ADD data file1 csv gz data file1 csv gz ADD data file2 csv gz da
将 numpy 记录数组转换为字典列表的有效方法

如何转换下面的 numpy 记录数组 recs Bill 31 260 0 Fred 15 145 0 r rec fromrecords recs names name age weight formats S30 i2 f4 到字典列表
在 Mac OS x 10.7.5 中运行 Scrapy 所需的文件，使用 Python 2.7.3 IEPD_free（32 位）

我是第一次测试 scrapy 使用命令安装后 sudo easy install U scrapy 一切似乎都运行正常但是当我运行时 scrapy startproject tutorial 我得到以下信息 luismacbookpro
Twitter 不再使用请求库 python

我有一个 python 函数它使用 requests 库和 BeautifulSoup 来抓取特定用户的推文 import requests from bs4 import BeautifulSoup contents requests
numpy 中的分层抽样

在 numpy 中我有一个这样的数据集前两列是索引我可以通过索引将数据集分成多个块即第一个块是 0 0 第二个块是 0 1 第三个块 0 2 然后是 1 0 1 1 1 2 等等每个块至少有两个元素索引列中的数字可能会有所不同我
Python 中的数据可用性图表

我想知道Python是否有一些东西可以绘制具有多个变量的时间序列的数据可用性下面显示了一个示例取自Visavail js 时间数据可用性图表 https github com flrs visavail 1 description 以下
Python：ConfigParser.NoSectionError：没有部分：“TestInformation”

我使用上面的代码收到 ConfigParser NoSectionError No section TestInformation 错误 def LoadTestInformation self config ConfigParser Co

随机推荐

通过更短的拖动使 ViewPager 对齐

有什么办法可以让支持包ViewPager用更短的拖动来捕捉到下一页吗默认行为似乎是即使我拖动近 75 当我放开时页面仍然会弹回到上一页我想缩短捕捉阈值并使 ViewPager 捕捉到下一页请注意这适用于拖动手势猛击手势已经需要
管理频繁数据库轮询的良好 C#.NET 解决方案

我目前正在开发一个 c NET 桌面应用程序该应用程序将通过 WCF 和 WCF 数据服务通过互联网与数据库进行通信应用程序中有许多地方可能需要每隔一段时间刷新一次最简单的解决方案是将这些区域放在计时器上并重新查询数据库然而由于有
为另一个分区/目录运行 apt-get？

我已经从 Live Ubuntu CD 启动了系统并且需要修复一些软件包问题我已经安装了硬盘现在我想像正常启动一样运行 apt get 即更改 apt get 的工作目录以便它可以在我的硬盘上工作我以前做过这个但我不记得语法了
如何在 CURL 重定向上传递 cookie？

想象以下场景我打开一个 CURL 连接并通过 POST 传递一些 XML Logindata 服务器以 302 重定向进行响应其中设置会话 cookie 并将我重定向到以下欢迎页面如果我启用 FOLLOWLOCATION 则重定向
TypeScript：从装饰器推断返回类型？

当装饰器更改其返回类型时如何让 TypeScript 推断装饰方法的类型在下面的基本示例中我装饰一个方法以返回字符串化对象 function jsonStringify return function target decorated
AutoMapper 展平相同类型的复杂对象

我在映射以下复杂类型时遇到问题 RequestDTO int OldUserId string OldUsername int NewUserId string NewUsername Request User OldUser User N
React内联样式中的CSS伪代码“li::before”

我有一个名为 ExplanationLists 的 React 组件我想将动态内联样式添加到li带有 css 伪代码的 html 元素li after 这样我可以更好地用图形来设计要点例如 li before content dynam
如何使用具有多个输入参数的 HttpGet 属性？（并大摇大摆地工作）

它与下面的代码配合得很好我只有一个参数但如何处理两个输入参数如果我只使用 HttpGet 则不会发送任何参数尽管它在 Swagger 之外工作正常帮助 HttpGet Consumes application json HttpG
使用 angular2 进行服务器端渲染是什么？

我知道 angular2 用于服务器端渲染所以我想了解更多我对这种现象有以下疑问 1 什么是服务端渲染 2 它解决什么问题 3 它的应用有哪些 4 为什么使用服务端渲染 5 支持服务端渲染的技术有哪些 6 在Angular2中服务器端
如何在 git 上恢复旧提交中的特定文件

我想我的问题很接近这个one https stackoverflow com questions 20971306 hg how do i revert a single file several commits back 但我正在使用 g
使用 SQL Developer 或 Toad 等 IDE 工具的 Oracle 并行查询行为

一段时间以来我一直在努力抽出时间来写这个问题并尽可能地解释这个问题所以请提前原谅我的长文我的环境 Oracle Database 12 2 在 Red Hat 7 R A C 2 个节点上运行每个节点 16CPU 和 64GB R
在一个 SELECT 语句中设置两个标量变量？

我想做这个 Declare a int Declare b int SET a b SELECT StartNum EndNum FROM Users Where UserId 1223 PRINT a PRINT b 但这是无效的语法如
如何在 Gatsby URL 中添加发布日期？

All the Gatsby 入门演示 https github com gatsbyjs gatsby gatsby starters有一条像这样的路径 gatsby starter blog hi folks 我该如何设置 2015 0
Cron 作业 + Twitter

从 12 30 开始一直到 1 30 2 30 等我的应用程序每小时都会发布一条静态推文我目前正在使用 themattharris 的 twitter API 我也有一个 cron 工作 30 php q home1 USER NAM
PyQt5 和 datetime.datetime.strptime 之间的冲突

所以我正在编写一个工具可以使用基于 python 3 52 和 Qt5 的图形用户界面从文件中读取时间最少的操作 datetime datetime strptime Tue a 在隔离环境中工作输出 1900 01 01 00 00
在 php 5.5 中使用什么来代替 apc 用户数据缓存？

PHP 5 5 默认包含 zend opcache 这基本上意味着几乎没有人会使用 APC 但是用什么来代替 APC 的用户数据缓存部分 apc store apc fetch 类似呢我真正喜欢使用 APC 用户数据缓存的一个用例是静态
如何在vba中另存为.txt

我希望让我的宏将我创建的新工作表保存为 txt 文件这是我到目前为止的代码 Sub Move Move Macro Keyboard Shortcut Ctrl m Sheets Sheet1 Select Range A1 Select
绘制完成后清除CGPath路径

我已经在 iOS 中编写了一个在 TouchMoved 方法中绘图的程序 CGContextAddPath UIGraphicsGetCurrentContext path CGPathMoveToPoint path NULL lastP
OpenCV - Java：inRange 函数

我有我的形象mRgba当我这样做时 Core inRange mRgba B1 B2 mRgba 我得到了我期望的结果我的所有 RGBA 图像的阈值都在 B1 和 B2 之间现在我想这样做 Mat roi mRgba submat re
Pytorch：获取最终层的正确尺寸

Pytorch 新手来了我正在尝试微调 VGG16 模型来预测 3 个不同的类别我的部分工作涉及将 FC 层转换为 CONV 层但是我的预测值不会落在 0 到 2 3 个类别之间有人可以向我指出有关如何计算最后一层的正确尺寸的好

Pytorch：获取最终层的正确尺寸

Pytorch：获取最终层的正确尺寸 的相关文章

随机推荐

热门标签

Pytorch：获取最终层的正确尺寸的相关文章