【YOLOv3 decode】YOLOv3中解码理解decode_box

2023-05-16

文章目录

1 解码是什么意思
2 代码解读
3 生成网格中心代码详解
4 按照网格格式生成先验框的宽高代码详解
5 感谢链接

1 解码是什么意思

在利用YOLOv3网络结构提取到out0、out1、out2之后，不同尺度下每个网格点上均有先验框，网络训练过程会对先验框的参数进行调整，继而得到预测框，从不同尺度下预测框还原到原图输入图像上，同时包括该框内目标预测的结果情况(预测框位置、类别概率、置信度分数)，这个过程称之为解码。

2 代码解读

注释主要以VOC数据集，YOLOv3 net最后一层输出进行解读。

import torch
import numpy as np

class DecodeBox():
    def __init__(self, anchors, num_classes, input_shape, anchors_mask = [[6,7,8], [3,4,5], [0,1,2]]):
        super(DecodeBox, self).__init__()
        self.anchors        = anchors
        self.num_classes    = num_classes       # int   20
        self.bbox_attrs     = 5 + num_classes   # int   25
        self.input_shape    = input_shape       # (416, 416) 元组
        #-----------------------------------------------------------#
        #   13x13的特征层对应的anchor是[116,90],[156,198],[373,326]
        #   26x26的特征层对应的anchor是[30,61],[62,45],[59,119]
        #   52x52的特征层对应的anchor是[10,13],[16,30],[33,23]
        #-----------------------------------------------------------#
        self.anchors_mask   = anchors_mask

    # ----------------------------------------------#
    #   得到out0、out1、out2不同尺度下每个网格点上的的预测情况(预测框位置、类别概率、置信度分数)
    # ----------------------------------------------#
    def decode_box(self, inputs):   # input一共有三组数据，out0，out1，out2
        outputs = []
        for i, input in enumerate(inputs):      # 一次只能对一个特征层的输出进行解码操作
            # -----------------------------------------------#
            #   输入的input一共有三个，他们的shape分别是    针对voc数据集
            #   batch_size, 75, 13, 13          batch_size, channels, weight, height
            #   batch_size, 75, 26, 26
            #   batch_size, 75, 52, 52
            # -----------------------------------------------#
            batch_size      = input.size(0)
            input_height    = input.size(2)
            input_width     = input.size(3)

            # -----------------------------------------------#
            #   输入为416x416时
            #   stride_h = stride_w = 32、16、8
            #   一个特征点对应原来图上多少个像素点
            # -----------------------------------------------#
            stride_h = self.input_shape[0] / input_height       # 输出特征图和resize之后的原图上对应步长，映射回去的操作
            stride_w = self.input_shape[1] / input_width
            #-------------------------------------------------#
            #   把先验框的尺寸调整成特征层的大小形式，用来对应两者宽和高
            #   此时获得的scaled_anchors大小是相对于特征层的，anchors是大数据kmeans聚类经验所得
            #   out0越小，stride越大，用来检测大目标
            #-------------------------------------------------#
            scaled_anchors = [(anchor_width / stride_w, anchor_height / stride_h) for anchor_width, anchor_height in self.anchors[self.anchors_mask[i]]]

            #-----------------------------------------------#
            #   输入的input一共有三个，他们的shape分别是
            #   batch_size, 3, 13, 13, 25
            #   batch_size, 3, 26, 26, 25
            #   batch_size, 3, 52, 52, 25
            #   batch_size,3*(5+num_classes),13,13 -> batch_size,3,5+num_classes,13,13 -> batch_size, 3, 13, 13, 25
            #   此处参考链接：https://blog.csdn.net/weixin_45377629/article/details/124028098
            #-----------------------------------------------#
            prediction = input.view(batch_size, len(self.anchors_mask[i]),
                                    self.bbox_attrs, input_height, input_width).permute(0, 1, 3, 4, 2).contiguous()

            #-----------------------------------------------#
            #   先验框的中心位置的调整参数
            #   x shape: torch.size([batch_size,3,13,13])
            #   y shape: torch.size([batch_size,3,13,13]) 
            #-----------------------------------------------#
            x = torch.sigmoid(prediction[..., 0])  # sigmoid可以把输出值固定到0~1之间
            y = torch.sigmoid(prediction[..., 1])   # 先验框中心点的调整只能在其右下角的网格里面
            #-----------------------------------------------#
            #   先验框的宽高调整参数
            #-----------------------------------------------#
            w = prediction[..., 2]
            h = prediction[..., 3]
            #-----------------------------------------------#
            #   获得置信度，是否有物体，有物体的概率是多少
            #-----------------------------------------------#
            conf        = torch.sigmoid(prediction[..., 4])
            #-----------------------------------------------#
            #   种类置信度，属于某类别的概率是多少
            #-----------------------------------------------#
            pred_cls    = torch.sigmoid(prediction[..., 5:])

            FloatTensor = torch.cuda.FloatTensor if x.is_cuda else torch.FloatTensor
            LongTensor  = torch.cuda.LongTensor if x.is_cuda else torch.LongTensor

            #----------------------------------------------------------#
            #   生成网格，先验框中心=网格左上角
            #   grid_x shape：torch.size([batch_size,3,13,13])
            #   grid_y shape：torch.size([batch_size,3,13,13])
            #   关于该行代码解读，详细参考本文第3节
            #----------------------------------------------------------#
            grid_x = torch.linspace(0, input_width - 1, input_width).repeat(input_height, 1).repeat(
                batch_size * len(self.anchors_mask[i]), 1, 1).view(x.shape).type(FloatTensor)
            grid_y = torch.linspace(0, input_height - 1, input_height).repeat(input_width, 1).t().repeat(
                batch_size * len(self.anchors_mask[i]), 1, 1).view(y.shape).type(FloatTensor)

            #----------------------------------------------------------#
            #   按照网格格式生成先验框的宽高
            #   batch_size,3,13,13
            #   关于该行代码解读，详细参考本文第4节
            #----------------------------------------------------------#
            anchor_w = FloatTensor(scaled_anchors).index_select(1, LongTensor([0]))
            anchor_h = FloatTensor(scaled_anchors).index_select(1, LongTensor([1]))
            anchor_w = anchor_w.repeat(batch_size, 1).repeat(1, 1, input_height * input_width).view(w.shape)
            anchor_h = anchor_h.repeat(batch_size, 1).repeat(1, 1, input_height * input_width).view(h.shape)

            #----------------------------------------------------------#
            #   利用预测结果对先验框进行调整
            #   首先调整先验框的中心，从先验框中心向右下角偏移
            #   再调整先验框的宽高。
            #----------------------------------------------------------#
            pred_boxes          = FloatTensor(prediction[..., :4].shape)
            pred_boxes[..., 0]  = x.data + grid_x
            pred_boxes[..., 1]  = y.data + grid_y
            pred_boxes[..., 2]  = torch.exp(w.data) * anchor_w
            pred_boxes[..., 3]  = torch.exp(h.data) * anchor_h

            #----------------------------------------------------------#
            #   将输出结果归一化成小数的形式
            #----------------------------------------------------------#
            _scale = torch.Tensor([input_width, input_height, input_width, input_height]).type(FloatTensor)
            output = torch.cat((pred_boxes.view(batch_size, -1, 4) / _scale,
                                conf.view(batch_size, -1, 1), pred_cls.view(batch_size, -1, self.num_classes)), -1)
            outputs.append(output.data)
        return outputs      # 得到out0、out1、out2不同尺度下每个网格点上的的预测情况(预测框位置、类别概率、置信度分数)

if __name__ == '__main__':
    anchors = [10.0, 13.0, 16.0, 30.0, 33.0, 23.0, 30.0, 61.0, 62.0, 45.0, 59.0, 119.0, 116.0, 90.0, 156.0, 198.0, 373.0, 326.0]
    # anchors: ndarray：(9, 2)
    anchors = np.array(anchors).reshape(-1,2)
    num_classes = 20    # voc类别个数
    anchors_mask = [[6, 7, 8], [3, 4, 5], [0, 1, 2]]
    input_shape = [416,416]
    bbox_util = DecodeBox(anchors, num_classes, (input_shape[0], input_shape[1]), anchors_mask)

    # ---------------------------------------------------------#
    #   将图像输入网络当中进行预测！
    # ---------------------------------------------------------#
    net = YoloBody(anchors_mask, num_classes)       # 此地YoloBody可见https://www.jianshu.com/p/27f3b967646c
    outputs = net(images)                           # 此地images表示输入图片，outputs为三个输出out0, out1, out2
    outputs = bbox_util.decode_box(outputs)         # 得到out0、out1、out2不同尺度下每个网格点上的预测情况(预测框位置、类别概率、置信度分数)

3 生成网格中心代码详解

先验框中心=网格左上角，下面这行代码到底如何理解呢？

grid_x = torch.linspace(0, input_width - 1, input_width).repeat(input_height, 1).repeat(
                batch_size * len(self.anchors_mask[i]), 1, 1).view(x.shape).type(FloatTensor)

以宽为5，高为5， batch_size为1为例，详细解读见下方代码及输出。

import torch

if __name__ == "__main__":
    input_width = 5
    input_height = 5
    batch_size = 1
    anchors_mask = [[6,7,8], [3,4,5], [0,1,2]]
    
    a = torch.linspace(0, input_width - 1, input_width)     # torch.linspace左闭右闭
    print(a)    # 输出一个张量列表
    """
    tensor([0., 1., 2., 3., 4.])
    """
    
    b = a.repeat(input_height, 1)
    print(b)
    """
    tensor([[0., 1., 2., 3., 4.],
            [0., 1., 2., 3., 4.],
            [0., 1., 2., 3., 4.],
            [0., 1., 2., 3., 4.],
            [0., 1., 2., 3., 4.]])
    """
    c = b.repeat(batch_size * 3, 1, 1)         # len(anchors_mask[i]) = 3
    print(c)
    """
    tensor([[[0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.]],

        [[0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.]],

        [[0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.],
         [0., 1., 2., 3., 4.]]])
    """
    d = c.view(batch_size, 3, input_height, input_width)         # 对已知的进行reshape
    print(d)
    """
    tensor([[[[0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.]],

         [[0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.]],

         [[0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.],
          [0., 1., 2., 3., 4.]]]])
    """
    e = d.type(FloatTensor)     # 数据类型

4 按照网格格式生成先验框的宽高代码详解

按照网格格式生成先验框的宽高，其代码如下：

#----------------------------------------------------------#
#   按照网格格式生成先验框的宽高
#   batch_size,3,13,13
#----------------------------------------------------------#
anchor_w = FloatTensor(scaled_anchors).index_select(1, LongTensor([0]))
anchor_h = FloatTensor(scaled_anchors).index_select(1, LongTensor([1]))
anchor_w = anchor_w.repeat(batch_size, 1).repeat(1, 1, input_height * input_width).view(w.shape)
anchor_h = anchor_h.repeat(batch_size, 1).repeat(1, 1, input_height * input_width).view(h.shape)

对于上面这四行代码，我们以最小特征层为例，详细理解：

import torch

if __name__ == "__main__":
    #-----------------------------------------------------------------------------#
    #   把先验框的尺寸调整成特征层的大小形式，用来对应两者宽和高
    #   此时获得的scaled_anchors大小是相对于特征层的，anchors是大数据kmeans聚类经验所得
    #   out0越小，stride越大，用来检测大目标
    #   此以最小特征层为例，batch_size, 75, 13, 13
    #-----------------------------------------------------------------------------#
    scaled_anchors = [(3.625,2.8125), (4.875,6.1875), (11.65625, 10.1875)]

    x_is_cuda = False   # x.is_cuda = False，表示没用cuda
    FloatTensor = torch.cuda.FloatTensor if x_is_cuda else torch.FloatTensor
    LongTensor  = torch.cuda.LongTensor if x_is_cuda else torch.LongTensor

    # ------------------------------#
    #   解读第 1 行anchor_w
    # ------------------------------#
    a = LongTensor([0])
    print(a)    # tensor([0])

    b = FloatTensor(scaled_anchors)
    print(b)    # 保留的小数点位数变了
    """
    tensor([[ 3.6250,  2.8125],
        [ 4.8750,  6.1875],
        [11.6562, 10.1875]])
    """
    # ----------------------------------------------------------#
    #   tensor.index_select(dim, index)
    #       dim  ：表示要查找的维度，对于二维，0代表行,1代表列
    #       index：表示要索引的序列,是一个tensor对象
    #   a = tensor([0])，表示要索引的为宽
    #   a = tensor([1])，表示要索引的为高
    # ----------------------------------------------------------#
    anchor_w = b.index_select(1, a)
    print(anchor_w)     # anchor_w shape: torch.size([3,1])
    """
    tensor([[ 3.6250],
        [ 4.8750],
        [11.6562]])
    """
    
    # ------------------------------#
    #   解读第 2 行anchor_h
    #       类似上面
    # ------------------------------#
    anchor_h = b.index_select(1, LongTensor([1]))
    """
    tensor([[ 2.8125],
        [ 6.1875],
        [10.1875]])
    """
    
    # ----------------------------------------------------#
    #   解读第 3 行anchor_w
    #       w.shape 和 h.shape: torch.size([1,3,13,13])
    # ----------------------------------------------------#
    batch_size = 1      # 以batch_size=1为例
    input_height = 13   # 最小特征层输出，宽高均为13
    input_width = 13
    
    # ------------------------------------#
    #   tensor.repeat(dim1,dim2,...)
    #   复制多个tensor
    # ------------------------------------#
    c = anchor_w.repeat(batch_size, 1)
    print(c)
    """
    tensor([[ 3.6250],
        [ 4.8750],
        [11.6562]])
    若batch_size = 2, c 的结果：
    tensor([[ 3.6250],
        [ 4.8750],
        [11.6562],
        [ 3.6250],
        [ 4.8750],
        [11.6562]])
    毕竟有几张图片，先验框的宽，参数个数就应该有几倍，每张图片都有
    """
    d = c.repeat(1, 1, input_height * input_width)
    print(d.shape)          # torch.Size([1, 3, 169])
	
	# ---------------------------------------------------#
	#	每个像素点，都有三个先验框，每个先验框，都有宽
	#	有点各用各的，的感觉
	# ---------------------------------------------------#
    anchor_w = d.view(1,3,13,13)
    print(anchor_w.shape)   # torch.Size([1, 3, 13, 13])，先验框的宽就都生成了，高类似

5 感谢链接

https://www.bilibili.com/video/BV1Hp4y1y788?p=6&spm_id_from=pageDriver

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

YOLOv3

Decode

Box

中解码理解

【YOLOv3 decode】YOLOv3中解码理解decode_box 的相关文章

Swift - 将字典 [String:Any] 编码和解码到 plist 中

我试图将字典存储在我的 Marker 类中但它抛出一个错误指出它不可编码或可解码我可以看到错误是由 String Any 引起的但我该如何解决它 var buttonActions String String String Any
Tkinter：如何创建选择框

I need to create a choice box where i can click on arrow and it give me list of choices And if i click on one of them it
如何在 SQL 中解码 XML 实体？

如何转换解码文本 1594 1604 1575 1605 1581 1587 1610 1606 格式为普通文本我正在使用 SQL Server 2012 我只想将它们全部更新为普通文本由于某种原因 Jeroen 的答案似乎在 SQL
Swift - 解码/编码具有不同类型的泛型数组

如何解码编码不同泛型类型的数组我有一个数据结构它具有符合协议的属性Connection 因此我使用泛型 Data structure which saves two objects which conform to the Conne
检查设备是否可以使用 Cordova 解码视频

我在用着VR view https developers google com vr concepts vrview在我的 Ionic Cordova 应用程序中 VR view 的文档指出某些较旧的设备无法解码大于 1080p 1920
PHP 中JavaScript 的decodeURIComponent 的等价物是什么？

我有一个包含 unicode 字符的字符串我正在通过 HTTP 传输该字符串该字符串是用 Javascript 编码的encodeURIcomponent php 中是否有与 Javascript 等效的函数decodeURICompo
Python 3.4 解码字节

我正在尝试用 python 编写一个文件并且在编写文件之前找不到解码字节对象的方法基本上我正在尝试解码这个字节字符串 Les xc3 x83 xc2 xa9vad xc3 x83 xc2 xa9s 这是我试图恢复的原始文本 Les v
有没有比 Html.fromHtml() 更快的方法将 html 字符解码为字符串？

我正在使用 Html fromHtml STRING toString 将可能包含或不包含 html 和或 html 实体的字符串转换为纯文本字符串这相当慢我想我最后的计算是平均花费了大约 22 毫秒对于大量的这些它可以在一分钟内
如何在android中将字节数组（.h264格式）解码为视频？

在我的应用程序中我必须将字节数组即 h264 格式解码为视频和来自直播的字节数组代码如下 static final int VIDEO BUF SIZE 100000 static final int FRAME INFO SIZE
读取文本文件的行并收到 Charmap 解码错误

我使用 python3 3 和 sqlite3 数据库我有一个大约 270mb 的大文本文件我可以在 Windows7 中使用写字板打开它该文件中的每一行如下所示术语 t编号 n 我想读取每一行并将值保存在数据库中我的代码如下所示
UnicodeDecodeError：“charmap”编解码器无法解码位置 7240 中的字节 0x8d：字符映射到 <未定义>

我是一名学生正在做硕士论文作为我论文的一部分我正在与python 我正在阅读日志文件 csv格式化并将提取的数据写入另一个 csv格式良好的文件但是当读取文件时我收到此错误回溯最近一次调用最后一次文件 C Users SG
如何在 Swift 5 中解码像“\xc3\xa6”这样的 utf8 文字？

我正在从蓝牙特性中获取 WiFi SSID 列表每个 SSID 都表示为一个字符串有些具有 UTF8 文字例如 xc3 xa6 我尝试了多种方法来解码这个像 let s xc3 xa6 let dec s utf8 由此我期望 pri
htmlspecialchars_decode() 不适用于空格

我正在尝试使用 htmlspecialchars decode 但它不解码 nbsp 进入空间这个问题有解决办法吗 My code query mysql query select from nowosci order by id des
如何解码字节对象的字符串表示形式？

我有一个字符串其中包含编码字节 str1 b Output file xeb xac xb8 xed x95 xad xeb xb6 x84 xec x84 x9d xlsx Created 我想解码它但我不能因为它已经变成了一个字符
我如何检查 base64 字符串是否是文件（什么类型？）？

我参加了 Spentalkux 挑战https 2020 ractf co uk https 2020 ractf co uk 这是我第一次参加CTF挑战所以我解决了https github com W3rni0 RACTF 2020 b
IllegalArgumentException Base64到图像解码android

我想将 Base64 格式的 Web 服务中的图像解码为位图并在我的 Android 应用程序中使用它这是我的方法 public Bitmap getCaptcha throws IOException List
适用于 .NET 的最快 PNG 解码器

我们的网络服务器需要先处理许多大图像的组合然后再将结果发送到网络客户端此过程对性能至关重要因为服务器每小时可以接收数千个请求现在我们的解决方案从 HD 加载 PNG 文件每个大约 1MB 并将它们发送到显卡以便在 GPU 上完
python：无效的base64编码字符串：数据字符数（5）不能多于4的倍数1

输出以下错误异常值 Base64 编码字符串无效数据字符数 5 不能多于 4 的倍数 1异常位置 b64decode 中的 D Program Files Python lib base64 py 第 87 行我这样输入 python
Swift 4 使用随机密钥解码嵌套 JSON [重复]

这个问题在这里已经有答案了我是 Swift 4 的新手正在尝试从 Wikipedia API 解码此 JSON 我正在努力定义一个结构因为我发现的所有示例教程都仅嵌套 1 2 层深度除此之外当其中一个密钥是随机的时如何解码数据
在Python中从字节串创建zip文件对象？

我有一个字节串保证它是 zip 文件的字节表示形式知道这个字节串后如何在 Python 中创建 zip 文件对象 Use io BytesIO https docs python org 3 library io html io By

随机推荐

java常见面试题

目录基础语法 1 Java 语言的优点 xff1f 2 Java 如何实现平台无关 xff1f 3 JVM xff0c JDK 和 JRE 的区别 xff1f 4 Java 按值调用还是引用调用 xff1f 5 浅拷贝和深拷贝的区别 xf
分段分页存储

2020 4 27 在家的网课 xff0c 无聊 xff0c 记录一下分页 xff0c 分段 xff0c 段页式存储笔记昨天刚学了分页存储 xff0c 听得我一脸懵逼 xff0c 好在课下花了很长时间才弄懂 1 分页存储管理 1 分页存储
解压码

BN00001 22kke BN00002 88cde BN00003 00ike BN00004 76cdb BN00005 09dbm BN00006 0mndc BN00007 cd78d BN00008 bdmf8 BN00009
保险项目业务流程

1 整个项目分为四分模块 xff1a 信息采集模块信息验证审批生成合同 xff08 开单 xff09 信息采集模块 xff1a 包括购买保险产品 xff0c 客户个人信息 1 纸质文档给客户填写 xff0c 在回来录入系统 2 客户直
IDEA使用maven自定义archetype

标题自定义archetype 在pom文件中添加archetype plugin span class token generics span class token punctuation lt span plugin span clas
自定义Perperties文件内容读取

新建properties文件放在resources目录下 properties文件内容 url span class token operator 61 span jdbc span class token operator span my
如何使用Google TV设置Chromecast

Justin Duino 贾斯汀杜伊诺 Justin Duino Google changed up its streaming platform with the release of the Chromecast with Googl
使用CSS中的Hover控制显示子元素或者兄弟元素

lt DOCTYPE html gt lt html lang 61 34 en 34 gt lt head gt lt meta charset 61 34 UTF 8 34 gt lt meta name 61 34 viewport
maven项目中的jdbc连接步骤

在maven项目pom xml中到入驱动包 xff08 以下是驱动包代码 xff09 lt dependencies gt lt https mvnrepository com artifact mysql mysql connector
executeUpdate()与executeQuery()的使用

增删改用executeUpdate xff08 xff09 返回值为int型 xff0c 表示被影响的行数例子查用executeQuery 返回的是一个集合 next xff08 xff09 表示指针先下一行 xff0c 还有f
Access denied for user ''@'localhost' (using password: YES)错误解决方法

远程登录被拒绝 xff0c 要改一个表数据的属性让他可以远程登录解决方法如下 xff0c 执行命令 xff1a mysql gt use mysql mysql gt select host user from user 查看结果是不是r
基于yolov5和Tesseract-OCR车牌识别项目 Linux系统上搭建运行（大概结构）

项目大概分为两部分 xff0c 首先使用yolov5进行目标检测并截图 xff1b 然后对图片一系列的处理后使用Tesseract OCR进行字符识别 xff08 本文为简易版框架结构 xff0c 如果看完感兴趣可以在文末跳转看细节操作 x
ubuntu20.04使用微软Azure Kinect DK 实现三维重建demo记录

本文仅为在ubuntu20 04实现Azure Kinect DK 三维重建demo xff0c 此文记录实现过程仅供学习 xff0c 同时为大家避坑 xff0c 文中参考大量文章已列至末尾 1 ros安装 2 安装微软 DK的sdk 3
常见一面问题

1 智能指针常用的c 43 43 库 Standard Template Library STL Algorithms 算法 Containers 容器 Functions 函数 Iterators 迭代器 Boost 同样是大量C 43
ROS datatype/md5sum错误

I got this error today Problem ERROR 1576785283 032878520 Client rostopic 21515 1576784759002 wants topic timestamp to h
快速安装Pytorch和Torchvision

文章目录 1 Linux下激活自己的虚拟环境并查看Python版本2 查看需要安装的Pytorch和Torchvision版本3 直接命令行安装3 1 如果不报错的话3 2 ERROR Could not install packages
【Darknet-53】YOLOv3 backbone Darknet-53 详解

文章目录 1 模型计算量与参数量2 Darknet 53网络3 感谢链接 1 模型计算量与参数量模型计算量与参数量的计算方式主要有两种 xff0c 一种是使用thop库 xff0c 一种是使用torchsummaryX 使用pip ins
ubuntu 默认命令行_从命令行在Ubuntu上设置默认浏览器

ubuntu 默认命令行 Ubuntu Linux has a default browser functionality that will automatically launch the correct browser when cl
【DeeplabV3+】DeeplabV3+网络结构详解

文章目录 1 常规卷积与空洞卷积的对比1 1 空洞卷积简介1 2 空洞卷积的优点 2 DeeplabV3 43 模型简介3 DeeplabV3 43 网络代码4 mobilenetv2网络代码5 感谢链接聊DeeplabV3 43 网络前
【YOLOv3 decode】YOLOv3中解码理解decode_box

文章目录 1 解码是什么意思2 代码解读3 生成网格中心代码详解4 按照网格格式生成先验框的宽高代码详解5 感谢链接 1 解码是什么意思在利用YOLOv3网络结构提取到out0 out1 out2之后 xff0c 不同尺度下每个网格点