3d稀疏卷积——spconv源码剖析（四）

2023-05-16

普通3d稀疏卷积RuleBook构建

我们继续看普通稀疏卷积RuleBook的建立过程，返回src/spconv/spconv_ops.cc,看getIndicePairs函数的普通3D稀疏卷积部分

    // torch.numel()统计元素的个数 N*2*27/2+1
    auto indicePairUnique = torch::full({indicePairs.numel() / 2 + 1}, std::numeric_limits<int>::max(),torch::dtype(torch::kInt32).device(indices.device()));
    // [N*27,4]
    torch::Tensor outInds = torch::zeros({numAct * kernelVolume, coorDim + 1},torch::dtype(torch::kInt32).device(indices.device()));
    if (indices.device().type() == torch::kCPU) { // CPU
      numActOut = create_conv_indice_pair_cpu(indices, outInds, gridOut, indicePairs, indiceNum, kernelSize, stride,padding, dilation, outSpatialShape, transpose, false, useHash);
    }
#ifdef TV_CUDA
    else if (indices.device().type() == torch::kCUDA) { // GPU
      numActOut = create_conv_indice_pair_p1_cuda(indices,          // torch.Size([N, 4]) voxel空间索引 
                                                  indicePairs,      // torch.Size([2,27,N]),-1填充 保存 rulebook
                                                  indiceNum,        // torch.Size([27]) 用于保存卷积核每一个位置上的总的计算的次数
                                                  indicePairUnique, // N*27+1
                                                  kernelSize,       // [3,3,3]
                                                  stride,           // [2,2,2]
                                                  padding,          // [1,1,1]
                                                  dilation,         // [0,0,0]
                                                  outSpatialShape,  // [21, 720, 720]
                                                  transpose         // False
                                                  );
      if (numActOut > 0) {
        auto res = torch::_unique(indicePairUnique);
        indicePairUnique = std::get<0>(res);
        numActOut = create_conv_indice_pair_p2_cuda(indices, 
                                                    outInds, 
                                                    gridOut, 
                                                    indicePairs, 
                                                    indiceNum, 
                                                    indicePairUnique,
                                                    outSpatialShape, 
                                                    transpose, 
                                                    false, 
                                                    useHash);
        if (numActOut == -1) {
          auto device = indices.device();
          outInds = outInds.to({torch::kCPU});
          indicePairs = indicePairs.to({torch::kCPU});
          indiceNum = indiceNum.to({torch::kCPU});
          indices = indices.to({torch::kCPU});
          numActOut = create_conv_indice_pair_cpu(indices, outInds, gridOut, indicePairs, indiceNum, kernelSize,stride, padding, dilation, outSpatialShape, transpose, false,useHash);

          return {outInds.to(device).slice(0, 0, numActOut),indicePairs.to(device), indiceNum.to(device)};
        }
      }
    }

普通3d稀疏卷积调用create_conv_indice_pair_p1_cuda和create_conv_indice_pair_p2_cuda,我们先看
create_conv_indice_pair_p1_cuda函数,位于src/spconv/indice.cu

int create_conv_indice_pair_p1_cuda(
    torch::Tensor indicesIn,                // torch.Size([N, 4])
    torch::Tensor indicePairs,              // torch.Size([2,27,N]),-1填充 保存 rulebook
    torch::Tensor indiceNum,                // torch.Size([27]) 用于保存卷积核每一个位置上的总的计算的次数
    torch::Tensor indicePairUnique,         // N*27+1
    std::vector<int64_t> kernelSize,        // [3,3,3]
    std::vector<int64_t> stride,            // [2,2,2]
    std::vector<int64_t> padding,           // [1,1,1]
    std::vector<int64_t> dilation,          // [0,0,0]
    std::vector<int64_t> outSpatialShape,   // [21, 720, 720]
    bool transpose                          // False
    ) {
  auto stream = at::cuda::getCurrentCUDAStream();
  auto ndim = kernelSize.size();         // 3
  auto numActIn = indicesIn.size(0);     // N
  auto kernelVolume = indiceNum.size(0); // 27
  if (numActIn == 0)
    return 0;
  tv::dispatch_torch<int32_t>(indicesIn.scalar_type(), [&](auto IndexValue) {
    using Index = TV_DECLTYPE(IndexValue);
    using IndexGrid = int32_t;
    tv::dispatch_int<2, 3, 4>(ndim, [&](auto I) {
      constexpr int NDim = TV_DECLTYPE(I)::value;
      // 将参数信息复制到tv::SimpleVector类型相关变量上
      tv::SimpleVector<Index, NDim> ks(kernelSize.begin(), kernelSize.end());
      tv::SimpleVector<Index, NDim> st(stride.begin(), stride.end());
      tv::SimpleVector<Index, NDim> pa(padding.begin(), padding.end());
      tv::SimpleVector<Index, NDim> di(dilation.begin(), dilation.end());
      tv::SimpleVector<Index, NDim> ou(outSpatialShape.begin(),outSpatialShape.end());
      tv::DispatchInt<max_kernel_vol_t>()(kernelVolume, std::less_equal<int>(), [&](auto I2) {
            constexpr int MaxKernelVolume = TV_DECLTYPE(I2)::value;
            if (transpose) { // False
              prepareDeConvIndicePairsKernel<Index, NDim, MaxKernelVolume>
                  <<<tv::cuda::getBlocks(numActIn), tv::cuda::CUDA_NUM_THREADS,
                     0, stream>>>(tv::torch2tv<Index>(indicesIn),
                                  tv::torch2tv<Index>(indicePairs),
                                  tv::torch2tv<Index>(indiceNum),
                                  tv::torch2tv<Index>(indicePairUnique), ks, st,pa, di, ou);
              TV_CHECK_CUDA_ERR_V2("prepareDeConvIndicePairsKernel failed");
            } else {
              prepareIndicePairsKernel<Index, NDim, MaxKernelVolume>
                  <<<tv::cuda::getBlocks(numActIn), tv::cuda::CUDA_NUM_THREADS,0, stream>>>(
                                    tv::torch2tv<Index>(indicesIn),         // torch.Size([N, 4])
                                    tv::torch2tv<Index>(indicePairs),       // torch.Size([2,27,N]),-1填充 保存 rulebook
                                    tv::torch2tv<Index>(indiceNum),         // torch.Size([27]) 用于保存卷积核每一个位置上的总的计算的次数
                                    tv::torch2tv<Index>(indicePairUnique), 
                                    ks,                                     // 卷积核尺寸
                                    st,                                     // 步长
                                    pa,                                     // 填充
                                    di,                                     // 膨胀卷积
                                    ou                                      // 输出形状
                                    );
              TV_CHECK_CUDA_ERR_V2("prepareIndicePairsKernel failed");
            }
#ifdef TV_LOG_KERNEL_INFO
            cudaFuncAttributes attr;
            checkCudaErrors(cudaFuncGetAttributes(
                &attr,
                prepareDeConvIndicePairsKernel<Index, NDim, MaxKernelVolume>));
            tv::ssprint("prepareIndicePairsKernel<", tv::type_s<Index>, NDim,
                        MaxKernelVolume, ">", attr.numRegs);
#endif
          });
    });
  });
  return 1;
}

重点看prepareIndicePairsKernel核函数

template <typename Index, unsigned NDim, int KernelMaxVolume = 256,typename Index1D = int>
__global__ void prepareIndicePairsKernel(
    tv::TensorView<const Index> indicesIn,  // torch.Size([N, 4])
    tv::TensorView<Index> indicePairs,      // torch.Size([2,27,N]),-1填充 保存 rulebook
    tv::TensorView<Index> indiceNum,        // torch.Size([27]) 用于保存卷积核每一个位置上的总的计算的次数
    tv::TensorView<Index1D> indicePairUnique,
    const tv::SimpleVector<Index, NDim> kernelSize,     // 卷积核尺寸
    const tv::SimpleVector<Index, NDim> stride,         // 步长
    const tv::SimpleVector<Index, NDim> padding,        // 填充
    const tv::SimpleVector<Index, NDim> dilation,       // 膨胀卷积
    const tv::SimpleVector<Index, NDim> outSpatialShape // 输出形状[21, 720, 720]
    ) {
  auto numActIn = indicesIn.dim(0); // N
  Index spatialVolume = 1; // 没用到
#pragma unroll
  for (int i = 0; i < NDim; ++i) {
    spatialVolume *= outSpatialShape[i]; // 21*720*720
  }
  Index kernelVolume = 1;
#pragma unroll
  for (int i = 0; i < NDim; ++i) {
    kernelVolume *= kernelSize[i];      // 3*3*3
  }
  Index numValidPoints = 0;
  Index validPoints[KernelMaxVolume * (NDim + 1)]; // 27*4
  Index *pointPtr = nullptr;
  auto indicePairsDim2 = indicePairs.dim(2); // N
  Index index;
  for (int ix : tv::KernelLoopX<int>(numActIn)) {
    numValidPoints = getValidOutPos<Index, NDim>(
        indicesIn.data() + ix * (NDim + 1) + 1, 
        kernelSize.data(),      // 卷积核尺寸 3,3,3
        stride.data(),          // 步长       2,2,2
        padding.data(),         // 填充       1,1,1     
        dilation.data(),        // 膨胀卷积   1,1,1
        outSpatialShape.data(), // 输出形状[21, 720, 720]
        validPoints             // 输出哈希表
        );
    // 依靠 getValidOutPos 计算得到的 out 数组完成rulebook的建立
    for (Index i = 0; i < numValidPoints; ++i) {
      pointPtr = validPoints + i * (NDim + 1);
      auto offset = pointPtr[NDim]; // 表示输出用到卷积核那个weight来计算
      Index oldNum = atomicAdd(indiceNum.data() + offset, Index(1)); // 代表Rulebook的count
      // 输入张量到输入序号
      indicePairs(0, offset, oldNum) = ix;
      // 输出序号
      index = tv::ArrayIndexRowMajor<NDim, NDim>::runPtrs(
        pointPtr, outSpatialShape.data(), 0) +
        spatialVolume * indicesIn(ix, 0);
      // 输出张量到输出序号
      indicePairs(1, offset, oldNum) = index;
      indicePairUnique[offset * indicePairsDim2 + oldNum] = index;
    }
  }
}

getValidOutPos作用根据输入点计算输出哈希表和输出所用到的卷积核权重的位置，同时返回有效输出个数

直接看下列代码，注释比较详细了

template <typename Index, unsigned NDim>
TV_HOST_DEVICE Index getValidOutPos(const Index *input_pos,  // 有效active点的输入位置坐标
                                    const Index *kernelSize, // [3,3,3]
                                    const Index *stride,     // [2,2,2] 
                                    const Index *padding,    // [1,1,1] 
                                    const Index *dilation,   // [0,0,0]
                                    const Index *outSpatialShape, // [21, 720, 720]
                                    Index *out   // 输出哈希表
                                    ) {
  Index lowers[NDim];     // 输入点对应输出点坐标的上限
  Index uppers[NDim];     // 输入点对应输出点坐标的下限
  Index counter[NDim];
  Index counterSize[NDim];//  各个维度的输出点数
  Index pointCounter = 0; // 有效的输出点数
  Index val;              // 输出序号
  Index numPoints = 1;
  Index m, offset;
  bool valid = false;
#pragma unroll
  // 在各个维度上计算输入点对应输出点的上限和下限
  for (int i = 0; i < NDim; ++i) {
    lowers[i] = (input_pos[i] - (kernelSize[i] - 1) * dilation[i] - 1 + stride[i] + padding[i]) / stride[i];
    uppers[i] = (input_pos[i] + padding[i]) / stride[i];
  }

#pragma unroll
  // 计算每个输入对应输出点数numPoints
  for (unsigned i = 0; i < NDim; ++i) {
    counterSize[i] = ((uppers[i] - lowers[i]) / dilation[i] + 1);
    numPoints *= counterSize[i];
  }

#pragma unroll
  // 初始化
  for (int i = 0; i < NDim; ++i) {
    counter[i] = 0;
  }

  // 对输出数组做一个有效的填充，可以把out理解为一个[N][Ndim+1]的二维数组
  // 每一行表示一个输出位置i,out[i][0],...,out[i][Ndim-1]存储第i个输出位置的索引
  for (int i = 0; i < numPoints; ++i) {
    valid = true;
    m = 1;
    offset = 0;
#pragma unroll
    // 各特征维度遍历，存储第i个输出位置的各个维度索引
    for (int j = NDim - 1; j >= 0; --j) {
      val = uppers[j] - counter[j] * dilation[j];// 输出序号
      // 输入对应的输出哈希表
      out[pointCounter * (NDim + 1) + j] = val;
      // 越界
      if (val < 0 || (val > outSpatialShape[j] - 1)) {
        valid = false;
        // break;
      }
      // 输入对应每个输出点在卷积核上的偏移
      offset += m * (input_pos[j] - val * stride[j] + padding[j]) / dilation[j];
      m *= kernelSize[j]; // m*3
    }
    // out[i][Ndim]存储于输入相作用kernel的偏移(offset)（即用卷积核中的那个权重计算）
    out[pointCounter * (NDim + 1) + NDim] = offset;
    if (valid)
      ++pointCounter;
    // 让counter[2]值在0~counterSize[2]循环变化
    counter[NDim - 1] += 1; 
#pragma unroll
    // 下面for循环作用：计算输出点各个维度的索引值（即counter[2],counter[1],counter[0])
    // 遍历完第2维度后,有counter[2]=counterSize[2]-->counter[1]++,counter[2]=0
    // 遍历完第1维度后,有counter[1]=counterSize[1]-->counter[0]++,counter[1]=0
    // 遍历完第0维度后,有counter[1]=counterSize[1]
    for (int c = NDim - 1; c >= 0; --c) {
      if (counter[c] == counterSize[c] && c > 0) {
        counter[c - 1] += 1;
        counter[c] = 0;
      }
    }
  }
  return pointCounter;
}

关于输出上下限如何得出，计算过程如下：

以 1-dim卷积为例：给定输入点的输出点取决于内核大小 k、步长 s、扩张 d 和填充 p

对于输入位置 x，它到特征图边界的距离为： x + p x+p x+p
假设输出点的最小值为n，有以下关系：
s ∗ ( n − 1 ) + k ′ = x + p s*(n-1)+k'=x+p s∗(n−1)+k′=x+p
其中 k ′ k' k′是有效内核大小，它取决于内核大小和膨胀： k ′ = ( k − 1 ) ∗ ( d − 1 ) + k k'=(k-1)*(d-1)+k k′=(k−1)∗(d−1)+k

带入 k ′ k' k′等式变为：
s ∗ ( n − 1 ) + ( k − 1 ) ∗ ( d − 1 ) + k = x + p s*(n-1)+(k-1)*(d-1)+k=x+p s∗(n−1)+(k−1)∗(d−1)+k=x+p
重新排列，计算lowers为：
n = ( x − d ∗ ( k − 1 ) − 1 + s + p ) / s n=(x-d*(k-1)-1+s+p)/s n=(x−d∗(k−1)−1+s+p)/s
同理，假设输出点的最大值为n，则有如下关系：
s ∗ n = x + p s*n=x+p s∗n=x+p
则计算uppers为：
n = ( x + p ) / s n=(x+p)/s n=(x+p)/s
参考：https://github.com/traveller59/spconv/issues/224

对于counter变量含义可以参考注释代码，如哪些地方理解有误，也麻烦大家指出来。

create_conv_indice_pair_p2_cuda位于:src/spconv/indice.cu

int create_conv_indice_pair_p2_cuda(
    torch::Tensor indicesIn,                // torch.Size([N, 4]) indices
    torch::Tensor indicesOut,               // torch.Size([N*27, 4])
    torch::Tensor gridsOut,                 // [4,21*720*720]
    torch::Tensor indicePairs,              // torch.Size([2,27,N])
    torch::Tensor indiceNum,                // torch.Size([27]) 用于保存卷积核每一个位置上的总的计算的次数
    torch::Tensor indicePairUnique,         // N*27+1
    std::vector<int64_t> outSpatialShape,   // [21, 720, 720]
    bool transpose,                         // False
    bool resetGrid,                         // False
    bool useHash                            // False
    ) {
  auto stream = at::cuda::getCurrentCUDAStream();
  auto ndim = outSpatialShape.size(); // 3
  auto numActIn = indicesIn.size(0);  // N
  int batchSize = gridsOut.size(0);   // 4
  int numAct = indicePairUnique.size(0) - 1;// 不重复输出序号个数-1

  auto kernelVolume = indiceNum.size(0);
  if (numActIn == 0)
    return 0;
  bool failed = false;
  tv::dispatch_torch<int32_t>(indicesIn.scalar_type(), [&](auto IndexValue) {
    using Index = TV_DECLTYPE(IndexValue);
    using IndexGrid = int32_t;
    tv::dispatch_int<2, 3, 4>(ndim, [&](auto I) {
      constexpr int NDim = TV_DECLTYPE(I)::value;
      using IndexGrid = int32_t;
      tv::SimpleVector<Index, NDim> ou(outSpatialShape.begin(),outSpatialShape.end());
      if (useHash) { // False
          ...... // 略
      } else {   // True
        assignGridAndIndiceOutKernel<Index, IndexGrid, NDim>
            <<<tv::cuda::getBlocks(numAct), tv::cuda::CUDA_NUM_THREADS, 0,stream>>>(
                tv::torch2tv<Index>(indicesOut),    // torch.Size([N*27, 4])
                tv::torch2tv<IndexGrid>(gridsOut),  // [4,21*720*720] 
                numAct,                             // 不重复输出序号个数-1
                tv::torch2tv<Index>(indicePairs),   // torch.Size([2,27,N])
                tv::torch2tv<Index>(indicePairUnique),  // 不重复输出序号
                ou,                                 // 输出形状
                batchSize                           // 4
                );
        TV_CHECK_CUDA_ERR_V2("assignGridAndIndiceOutKernel failed");
        assignIndicePairsKernel<Index, IndexGrid, NDim>
            <<<tv::cuda::getBlocks(numActIn), tv::cuda::CUDA_NUM_THREADS, 0,
               stream>>>(tv::torch2tv<Index>(indicesOut),
                         tv::torch2tv<IndexGrid>(gridsOut), numActIn,
                         tv::torch2tv<Index>(indicePairs),
                         tv::torch2tv<Index>(indicePairUnique), ou);
        TV_CHECK_CUDA_ERR_V2("assignIndicePairsKernel failed");
#ifdef TV_LOG_KERNEL_INFO
          ...... // 日志略
#endif
      }

      if (resetGrid && (!useHash)) { // False
        resetGridKernel<Index, IndexGrid, NDim>
            <<<tv::cuda::getBlocks(numAct), tv::cuda::CUDA_NUM_THREADS, 0,stream>>>(indicePairUnique.data_ptr<Index>(),tv::torch2tv<IndexGrid>(gridsOut), numAct);
        TV_CHECK_CUDA_ERR_V2("resetGridKernel failed");
      }
    });
  });
  if (failed){
    return -1;
  }
  return numAct;
}

assignGridAndIndiceOutKernel位于:include/spconv/indice.cu.h

template <typename Index, typename IndexGrid, unsigned NDim>
__global__ void assignGridAndIndiceOutKernel(
    tv::TensorView<Index> indicesOut,                     // torch.Size([N*27, 4]) 需要计算的
    tv::TensorView<IndexGrid> gridsOut,                   // [4,21*720*720]        需要计算的
    int numAct,                                           // 不重复输出序号个数-1
    tv::TensorView<Index> indicePairs,                    // torch.Size([2,27,N])
    tv::TensorView<Index> indicePairUnique,               // 不重复输出序号
    const tv::SimpleVector<Index, NDim> outSpatialShape,  // 输出形状
    int batchSize                                         // 4
    ) {

  Index index;
  auto indicesOutPtr = indicesOut.data();
  for (int ix : tv::KernelLoopX<int>(numAct)) {
    index = indicePairUnique[ix];
    gridsOut[index] = ix;

    index = tv::rowArrayIdxInv<Index, NDim>(index, indicesOutPtr + ix * (NDim + 1) + 1, outSpatialShape.data());
    indicesOut[ix * (NDim + 1)] = index % batchSize;
  }
}

rowArrayIdxInv位于：include/tensorview/tensorview.h

template <typename Index, unsigned NDim>
TV_HOST_DEVICE_INLINE Index rowArrayIdxInv(Index index, Index *output,const Index *shape) {
#pragma unroll
  for (int i = NDim - 1; i >= 0; --i) {
    output[i] = index % shape[i];
    index -= output[i];
    index /= shape[i];
  }
  return index;
}

继续看assignIndicePairsKernel：

template <typename Index, typename IndexGrid, unsigned NDim>
__global__ void
assignIndicePairsKernel(tv::TensorView<Index> indicesOut,
                        tv::TensorView<IndexGrid> gridsOut, int numActIn,
                        tv::TensorView<Index> indicePairs,
                        tv::TensorView<Index> indicePairUnique,
                        const tv::SimpleVector<Index, NDim> outSpatialShape) {

  Index index;
  int kernelVolume = indicePairs.dim(1);
  auto indicePairsOut = indicePairs.subview(1); // 从rulebook中获取输出张量到输出序号的哈希表

  for (int ix : tv::KernelLoopX<int>(numActIn)) {
    for (int i = 0; i < kernelVolume; ++i) {
      index = indicePairsOut(i, ix);
      if (index > -1) {
        indicePairsOut(i, ix) = gridsOut[index];
      }
    }
  }
}

subview位于：include/tensorview/tensorview.h ，意思应该获取子集

  TV_HOST_DEVICE_INLINE TensorView<T, -1, PtrTraits, Tindex>
  subview(SimpleVector<int> ids) const {
    Shape start = ids;
    for (int i = ids.size(); i < ndim(); ++i) {
      start.push_back(0);
    }
    return TensorView<T, Rank, PtrTraits, Tindex>(
        ptr_ + rowArrayIdx(shape_, start), shape_.subshape(ids.size()));
  }

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

3d稀疏卷积——spconv源码剖析（四）的相关文章

工具类:把一个对象转换成hashmap类型

public static Map lt String Object gt objectToMap Object obj if obj 61 61 null return null Map lt String Object gt map 6
zookeeper 启动失败错误: 找不到或无法加载主类 org.apache.zookeeper.server.quorum.QuorumPeerMain

问题描述在服务器上安装完 zookeeper后 xff0c 启动zk服务报错原因分析 xff1a 于是乎 xff0c 去查看了 zk的日志文件 xff0c 发现了报错信息啥 xff1f 找不到启动类 xff01 xff01 xff0
Gazebo7 无法加载模型问题解决（黑屏）

Gazebo7 无法加载模型 Gazebo7 无法加载模型问题解决 xff08 黑屏 xff09 打开gazebo的时候会发现一直处于这种状态 xff0c 这是因为model库加载不正确导致的解决方法 xff1a 通过直接下载所有模型到用
Factor Graph, 因子图

引言因子图方法广泛应用于机器人姿态估计 xff0c 多种信号融合领域 xff0c 在机器视觉无人机导航无人驾驶领域应用广泛 xff0c 与传统卡尔曼滤波方式相比 xff0c 能提供更高精度 xff0c 更平滑的全局结果因子图本质上是
Hexo 最常用的几个命令

Hexo 约有二十个命令 xff0c 但普通用户经常使用的大概只有下列几个 hexo s hexo s 启动本地服务器 xff0c 用于预览主题默认地址 xff1a http localhost 4000 hexo s 是 hexo se
ESP32 CAM与服务器（python）TCP视频传输

ESP32 CAM 代码基于Arduino实现网络调试助手 https soft 3dmgame com down 213757 html include lt Arduino h gt include lt WiFi h gt inc
ESP32 CAM与服务器（python）UDP视频传输

ESP32 CAM Arduino代码 include 34 esp camera h 34 include lt WiFi h gt include 34 AsyncUDP h 34 include lt vector gt const
STM32F103C8T6开发板+MPU6050刷四轴飞控

下载betaflight NAZE固件 xff1a https github com betaflight betaflight releases tag v3 2 5 安装betaflight Configurator调参软件 STM32
js-Map和Set与Array互转

一 xff1a Map转Array span class token builtin class name let span map span class token operator 61 span new Map span class
STM32通用定时器之输出比较模式与PWM模式

STM32通用定时器之输出比较与PWM 通用定时器其实就两个基本的功能 xff1a 输入输出主要介绍一下输出英文手册是这么说的 xff1a Input capture Output compare PWM generation Edg
Linux bridge table(brctl)

目录一 brctl简介二常用命令 2 1 查看所有网桥信息 2 2 查看指定网桥信息 2 3 新建网桥 2 4 启用停用网桥 2 5 添加网桥端口 2 6 移除网桥端口 2 7 删除网桥需先移除相关端口三实验案例 3 1 测试
idea 2019- 3.3版本得控制台中文乱码问题（本人遇到的所有解决办法都在其中，如有缺漏希望大神查缺补漏）

一 File gt Settings gt File Encoding Global Encoding Project Encoding Default encodeing for properties files这三个位置都设置成UTF
Mapreduce的简单实现和步骤

package com qfedu bigdata MR import org apache hadoop conf Configuration import org apache hadoop fs Path import org apa
vscode中修改/重置gitlab远程仓库地址(3种方式)

vscode中修改重置gitlab远程仓库地址方法1 xff1a 更换git远程仓库地址 1 查看当前remotes git remote v 2 修改remotes git remote set url origin https gi
数据挖掘和数据分析1-基本操作:批量合并相同格式的表格

大家好 xff0c 我是微学AI xff0c 今天给大家带来数据挖掘和数据分析基本操作模板1 批量合并相同格式的表格 xff0c 数据是人工智能中最重要的成分 xff0c 可以说没有数据就没有人工智能 xff0c 人工智能是基于一个个历史数
JS遍历数组的方法【详解】

法一 xff1a for循环法二 xff1a forEach遍历 xff08 可以同时取出数组中的值和值对应的下标 xff09 必须搭配函数使用 xff0c 而且可以直接取出数组中的每个对象和对象对应的下标 let arr 61 39 e
STM32单片机仿三菱PLC程序源码

Keil STM32 FX2N PLC源码断电保持模拟量 485 MODBUS RTC时钟开源不易 xff0c 请大家多多支持 xff0c 点点关注和收藏 xff01 xff01 xff01 本程序基于STM32F1XX系列开发板开
Arduino中文取模定义规范和使用

Arduino取模定义规范和使用 esp8266气象站资源链接包含所需库 ESP8266气象站 1 以数组形式定义汉字图片取模后的数据 span class token keyword static span span class tok
51单片机外部中断0触发蜂鸣器+Proteus仿真

51单片机外部中断0触发蜂鸣器 Proteus仿真 Proteus仿真为了体现仿真观看效果在蜂鸣器旁边并了一组led 触发的时候导通NPN三极管实例代码 span class token comment 实验现象下载程序后操作
Proteus仿真DHT11和实际硬件的差异以及读不到数据原因分析

Proteus仿真DHT11和实际硬件的差异以及读不到数据原因分析在Proteus仿真里面仿真LCD1602显示DHT11数据发现不能显示有点尴尬读取DHT11明明是按照数据手册上的时序来写的为什么不能显示第一种原因可能是我

随机推荐

TCA9548A-I2C多路复用器介绍

TCA9548A I2C多路复用器介绍在某TB平台销售的这种模块 xff0c 对其质量本人持怀疑态度 xff0c 自己购买了这个模块 xff0c I2C地址都搜索不到 xff0c 不管是A0 A2是接GND还是VCC xff0c 有点坑
【Proteus仿真】4位比较器（DM7485）

Proteus仿真 4位比较器 xff08 DM7485 xff09 Proteus仿真 5485 DM5485 DM7485 4位比较器介绍引脚功能图电源电压真值表仿真资源和数据手册链接 xff1a https span cla
【CH559L单片机】串口下载程序说明

CH559L单片机串口下载程序说明 x1f4cc 相关篇硬件开源电路 CH559L开发板和CH55x DAP Link二合一开发板分享 x1f4e2 CH559L单片机想通过串口来实现程序的烧录 xff0c 折腾了我2天了 xff0c
mpu6050角度滤波

文章 xff1a https www cnblogs com we1238 articles 7562028 html 输入量通过mup6050姿态传感器 xff0c 我们可以分别得到X Y Z轴三个方向的加速度和角速度分量输出量我们
对于java文件不能访问的情况分析

java io IOException 拒绝访问 at java io WinNTFileSystem createFileExclusively Native Method at java io File createNewFile Fi
1.MVC和MV VM的关系图解

一 xff0c MVC是后端服务器的结构二 MVVM是前端视图层的概念 xff0c 主要关注于视图分离 xff0c 也就是说 xff1a MVVM把前端视图层 xff0c 分为了三部分 model xff0c view xff0c vm
Ubuntu 22.04 root直接登录设置

Ubuntu 22 04 root直接登录设置 1 安装openssh server软件包 1 1 apt update 1 2 apt install openssh server 3 编辑 etc ssh sshd config 修改下
SONiC学习笔记（2）--SONiC架构中各模块间的交互

SONiC学习笔记 xff08 2 xff09 SONiC架构中各模块间的交互 LLDP state 交互SNMP state 交互Routing state 交互Port state 交互参考资料 LLDP state 交互下图描述了L
NVIDIA Jetson Xavier NX载板 RTSO-6002使用TF（MicroSD）卡重新刷机

本教程适用于已经挂载过SD卡的NX系统刷机总结为 xff1a xff08 1 xff09 先将系统烧录至NX板子自带系统上 xff08 2 xff09 再卸载SD卡格式化重新分区等 xff08 3 xff09 拷贝roof到SD卡 x
public和private的区别

1 封装的概念 1 public该类或非该类均可以访问同一个类 xff1a 左大括号和右大括号之间同一个包的类 xff1a 在一个包 xff08 package xff09 中 xff0c class和public class为同一个包
Ubuntu18.04下运行ROS-Gazebo仿真出现Resource not found问题

出现错误 xff1a Resource not found The following package was not found in span class token operator lt span arg default span
ubuntu20.04配置TVM环境

官方安装教程 xff1a https tvm hyper ai docs install from source 安装环境配置信息 xff1a system xff1a ubuntu20 span class token punctuati
一文读懂TensorRT整数量化

接下来有空也会整理一些实战性的东西 xff0c 比如结合pointpillars网络 xff0c 用TensorRT进行PTQ int8量化和利用pytorch quantization进行QAT量化感兴趣可以关注下 xff01 待继续整
CUDA动态并行实现快速排序

简介排序是任何应用的基本构造块的关键算法之一有许多排序算法已经被广泛研究 xff0c 常见的排序算法时间和空间复杂度如下 xff1a 一些排序算法属于分治算法的范畴这些算法适用于并行性 xff0c 并适合 GPU 等架构 xff0c
3d稀疏卷积——spconv源码剖析（一）

本节主要是介绍下卷积的理论基础结合spconv代码剖析从第二小节开始介绍 xff0c 本节介绍2D和3D卷积基础理论和稀疏卷积分类 xff0c 后再详细介绍下3d稀疏卷积的工作原理 2D卷积 2D卷积 xff1a 卷积核在输入图像的二维空
基于Spring Cloud Zuul搭建网关服务

1 网关服务所谓何在微服务架构风格中 xff0c 一个大应用被拆分成为了多个小的服务系统提供出来 xff0c 这些小的系统他们可以自成体系 xff0c 也就是说这些小系统可以拥有自己的数据库 xff0c 框架甚至语言等 xff0c 这些小
Redis 命令

命令描述Redis GEOADD 命令将指定的地理空间位置 xff08 纬度经度名称 xff09 添加到指定的key中Redis GEODIST 命令返回两个给定位置之间的距离Redis GEOHASH 命令返回一个或多个位置元
3d稀疏卷积——spconv源码剖析（二）

本文基于OpenPCDet框架中CeneterPoint算法 xff0c 对spconv库中稀疏卷积源码进行剖析 xff1a 首先看OpenPCDet下的pcdet models backbones 3d spconv backbone p
3d稀疏卷积——spconv源码剖析（三）

构建Rulebook 下面看ops get indice pairs xff0c 位于 xff1a spconv ops py 构建Rulebook由ops get indice pairs接口完成 get indice pairs函数具体
3d稀疏卷积——spconv源码剖析（四）

普通3d稀疏卷积RuleBook构建我们继续看普通稀疏卷积RuleBook的建立过程 xff0c 返回src spconv spconv ops cc 看getIndicePairs函数的普通3D稀疏卷积部分 span class tok

3d稀疏卷积——spconv源码剖析（四）

普通3d稀疏卷积RuleBook构建

3d稀疏卷积——spconv源码剖析（四） 的相关文章

随机推荐

热门标签

3d稀疏卷积——spconv源码剖析（四）的相关文章