【实例分割｜AdaptIS】数据集制作｜魔改 backbone

2023-05-16

文章目录

相关博客笔记
数据集制作
AdaIN
相关 Github 项目
- Fork
- 改进
魔改 backbone
- 替换为 Unet 的 Pytorch 官方实现
- - 代码实现
  - - 步骤1
    - 步骤2
    - 步骤3

数据集制作

'''
Auther: zth
Date: 2022-08-28 10:44:06
LastEditTime: 2022-08-28 11:21:07
Description: 从 labelme 标注的实例分割 json 标签文件生成适用于 AdaptIS 的数据集格式
'''
import json
import os
import numpy as np
import PIL.Image
from labelme import utils


def json_to_label(json_path):
    data = json.load(open(json_path))
    imageData = data['imageData']
    img = utils.img_b64_to_arr(imageData)
    label_name_to_value = {'_background_': 0, '1': 1}
    lbl, ins = utils.shapes_to_label(img.shape, data['shapes'],
                                     label_name_to_value)
    img_rgb = PIL.Image.fromarray(img)
    ins_pil = PIL.Image.fromarray(ins.astype(np.uint32), mode='I')

    return img_rgb, lbl, ins_pil


def main():
    jsons_path = "/Users/zth/Projects/0.线虫项目/2.制作COCO数据集/before"
    rgb_save_path = ""
    lbl_save_path = ""
    ins_pil_save_path = ""

    for dir, sub_dir, files in os.walk(jsons_path):
        if len(files) != 0:
            for file in files:
                img_rgb, lbl, ins_pil = json_to_label(file)
                name = os.path.splitext(file)[0]
                img_rgb_path = os.path.join(rgb_save_path, name + "_rgb.jpg")
                lbl_path = os.path.join(lbl_save_path, name + ".png")
                ins_pil_path = os.path.join(ins_pil_save_path,
                                            name + "_im.png")
                img_rgb.save(img_rgb_path)  # 保存 JPEG RGB 图像
                utils.lblsave(lbl_path, lbl)  # 保存语义分割掩码
                ins_pil.save(ins_pil_path)  # 保存实例标签


if __name__ == "__main__":
    main()

AdaIN

url: https://zhuanlan.zhihu.com/p/158657861
title: "AdaIN 笔记"
description: "论文 Arbitrary Style Transfer in Real-time with Adaptive Instance Normalization的阅读笔记ICCV 2017的一篇论文，有点老，不过是一篇很棒的论文，做了非常多的实验，靠谱、实在，对我的研究也有非常大的帮助。…"
host: zhuanlan.zhihu.com
image: https://picx.zhimg.com/v2-7e1b8b9a5eeb9f8218f8f024dbe377ec_720w.jpg?source=172ae18b

Fork

MyAdaptis
AdaptIS.CustomDataset

改进

RoboTec
iterdet

魔改 backbone

替换为 Unet 的 Pytorch 官方实现

官方的 Pytorch 实现中使用的是 Unet 论文中的官方实现，其中的跳连结构存在 crop 的操作，会导致部分像素的损失，所以改为 Pytorch 官方实现的 Unet。
步骤：
- 步骤1：导入 Unet 网络文件 adaptis/model/toy/unet_pytorch.py
- 步骤2：修改 adaptis/model/toy/models.py 文件，保证 input_channel=3，output_channel=32 即可
- 步骤3：修改 train.py 中的 init_model() 函数

代码实现

步骤1

'''
Auther: zth
Date: 2022-09-24 14:01:10
LastEditTime: 2022-09-24 15:18:23
Description: 
'''
from typing import Dict
import torch
import torch.nn as nn
import torch.nn.functional as F


class DoubleConv(nn.Sequential):
    def __init__(self, in_channels, out_channels, mid_channels=None):
        if mid_channels is None:
            mid_channels = out_channels
        super(DoubleConv, self).__init__(
            nn.Conv2d(in_channels, mid_channels, kernel_size=3, padding=1, bias=False),
            nn.BatchNorm2d(mid_channels),
            nn.ReLU(inplace=True),
            nn.Conv2d(mid_channels, out_channels, kernel_size=3, padding=1, bias=False),
            nn.BatchNorm2d(out_channels),
            nn.ReLU(inplace=True)
        )


class Down(nn.Sequential):
    def __init__(self, in_channels, out_channels):
        super(Down, self).__init__(
            nn.MaxPool2d(2, stride=2),
            DoubleConv(in_channels, out_channels)
        )


class Up(nn.Module):
    def __init__(self, in_channels, out_channels, bilinear=True):
        super(Up, self).__init__()
        if bilinear:
            self.up = nn.Upsample(scale_factor=2, mode='bilinear', align_corners=True)
            self.conv = DoubleConv(in_channels, out_channels, in_channels // 2)
        else:
            self.up = nn.ConvTranspose2d(in_channels, in_channels // 2, kernel_size=2, stride=2)
            self.conv = DoubleConv(in_channels, out_channels)

    def forward(self, x1: torch.Tensor, x2: torch.Tensor) -> torch.Tensor:
        x1 = self.up(x1)
        # [N, C, H, W]
        diff_y = x2.size()[2] - x1.size()[2]
        diff_x = x2.size()[3] - x1.size()[3]

        # padding_left, padding_right, padding_top, padding_bottom
        x1 = F.pad(x1, [diff_x // 2, diff_x - diff_x // 2,
                        diff_y // 2, diff_y - diff_y // 2])

        x = torch.cat([x2, x1], dim=1)
        x = self.conv(x)
        return x


class OutConv(nn.Sequential):
    def __init__(self, in_channels, num_classes):
        super(OutConv, self).__init__(
            nn.Conv2d(in_channels, num_classes, kernel_size=1)
        )


class UNetP(nn.Module):
    def __init__(self,
                 in_channels: int = 1,
                 num_classes: int = 2,
                 bilinear: bool = True,
                 base_c: int = 64):
        super(UNetP, self).__init__()
        self.in_channels = in_channels
        self.num_classes = num_classes
        self.bilinear = bilinear
        self.feature_channels = base_c

        self.in_conv = DoubleConv(in_channels, base_c)
        self.down1 = Down(base_c, base_c * 2)
        self.down2 = Down(base_c * 2, base_c * 4)
        self.down3 = Down(base_c * 4, base_c * 8)
        factor = 2 if bilinear else 1
        self.down4 = Down(base_c * 8, base_c * 16 // factor)
        self.up1 = Up(base_c * 16, base_c * 8 // factor, bilinear)
        self.up2 = Up(base_c * 8, base_c * 4 // factor, bilinear)
        self.up3 = Up(base_c * 4, base_c * 2 // factor, bilinear)
        self.up4 = Up(base_c * 2, base_c, bilinear)
        self.out_conv = OutConv(base_c, num_classes)

    def get_feature_channels(self):
        return self.feature_channels

    def forward(self, x: torch.Tensor) -> Dict[str, torch.Tensor]:
        x1 = self.in_conv(x)
        x2 = self.down1(x1)
        x3 = self.down2(x2)
        x4 = self.down3(x3)
        x5 = self.down4(x4)
        x = self.up1(x5, x4)
        x = self.up2(x, x3)
        x = self.up3(x, x2)
        x = self.up4(x, x1)
        logits = self.out_conv(x)

        # return {"out": logits}
        return logits

步骤2

from adaptis.model.toy.unet_pytorch import UNetP

def get_unetp_model(channel_width=32, max_width=512, with_proposals=False, rescale_output=(0.2, -1.7), norm_layer=nn.BatchNorm2d):
    unet = UNetP(in_channels=3, num_classes=channel_width, bilinear=True, base_c=channel_width)
    in_channels = unet.get_feature_channels()

    return AdaptIS(
        backbone=unet,
        adaptis_head=ToyAdaptISHead(
            basic_blocks.SimpleConvController(3, in_channels, channel_width, norm_layer=norm_layer),
            in_channels,
            channels=channel_width,
            norm_radius=42,
            with_coord_features=True,
            rescale_output=rescale_output,
            norm_layer=norm_layer
        ),
        segmentation_head=basic_blocks.ConvHead(2, in_channels=in_channels, num_layers=3, norm_layer=norm_layer),
        proposal_head=basic_blocks.ConvHead(1, in_channels=in_channels, num_layers=2, norm_layer=norm_layer),
        with_proposals=with_proposals
    )

步骤3

def init_model():
    model_cfg = edict()
    model_cfg.syncbn = True

    model_cfg.input_normalization = {
        'mean': [0.5, 0.5, 0.5],
        'std': [0.5, 0.5, 0.5]
    }

    model_cfg.input_transform = transforms.Compose([
        transforms.ToTensor(),
        transforms.Normalize(model_cfg.input_normalization['mean'],
                             model_cfg.input_normalization['std']),
    ])

    # training using DataParallel is not implemented
    norm_layer = torch.nn.BatchNorm2d

    # model = get_unet_model(norm_layer=norm_layer)
    model = get_unetp_model(norm_layer=norm_layer)
    model.apply(initializer.XavierGluon(rnd_type='gaussian', magnitude=1.0))

    return model, model_cfg

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

【实例分割｜AdaptIS】数据集制作｜魔改 backbone 的相关文章

【已解决】WARNING: Ignoring invalid distribution xxx

问题解决方案解释问题 WARNING Ignoring invalid distribution umpy c users xxx appdata roaming python python36 site packages 解决方案在报
transformer

简介 transformer最早于2017年google机器翻译团队提出 xff0c 也就是著名的 Attention Is All You Need xff0c transformer完全取代了以往的RNN和CNN结构 xff0c 改为由
单精度float与双精度double

单精度双精度 xff1a 单精度 xff0c 也即float xff0c 一般在计算机中存储占用4字节 xff0c 也32位 xff0c 有效位数为7位 xff1b 双精度 xff08 double xff09 在计算机中存储占用8字节
【完美解决】Github action报错remote: Write access to repository not granted.

报错及效果图报错代码效果图解决方案必要步骤可能有效的步骤报错及效果图本解决方案是笔者通过Github action运行项目时报错的解决方案 xff0c 如果是本地运行报此错 xff0c 未必有效果报错代码 remote Write
【已解决】error: failed to push some refs to ‘git@github.com:BATdalao/Github-green.git‘

文章目录报错及效果图报错代码最终效果图解决方案报错及效果图报错代码 git push To github com xxx xxx git rejected main gt main fetch first error failed
【已解决】winmm.dll被报病毒的解决方案

安装typora时的winmm dll被报病毒 xff0c 关闭防火墙可以 xff0c 但是重启电脑会再次报病毒在windows安全中心的如图路径 xff0c 找到该报错并设置允许即可安全性问题 xff1a winmm dll作者曾发文
【已解决】UnicodeDecodeError: ‘gbk‘ codec can‘t decode byte 0xad in position 10: illegalmultibytesequence

报错代码 xff1a f span class token operator 61 span span class token builtin open span span class token punctuation span span
【已解决】kex_exchange_identification: Connection closed by remote host fatal: Could not read from

文章目录报错及效果图报错代码成功效果图解决方案必要的解决方法可能有用的解决方法报错及效果图报错代码 kex exchange identification Connection closed by remote span class
【已解决】VMware Player 无法与 VMware Workstation 一起安装。请先卸载 VMware Workstation，再尝试安装VMware Player

文章目录报错本解决方案适用情境解决方案必要的解决方法可能有用的解决方法报错 VMware Player 无法与 VMware Workstation 一起安装请先卸载 VMware Workstation xff0c 再尝试安装VM
基于蓝牙智能家庭影音控制系统---粤嵌GEC6818嵌入式系统实训

版本介绍普通版完整版至尊版版本介绍分为普通版完整版至尊版三个版本普通版可以满足实训要求 xff0c 提供代码 xff0c 不提供技术指导实现功能 xff1a 1所有界面自行设计 xff0c 要求尽可能好看 2 执行程序 xff
【已解决】Flask当中render_template函数使用过程当中css文件无法正常渲染

文章目录报错可能原因解决方案必要的解决方法可能有用的解决方法报错 Flask当中render template函数使用过程当中css文件无法正常渲染 xff0c 直接显示的html 可能原因当在Flask应用程序中使用render
【已解决】License checkout failed. License Manager Error -8 Make sure the HostlD of the license

文章目录报错图解决方案报错图安装matlab2020b xff0c 双击matlab exe报错解决方案下载对应的破解包 xff0c 一般安装教程里面都有 1 将破解文件中 34 Crack R2020a bin win64 ma
【已解决】AttributeError: module ‘nmap‘ has no attribute ‘PortScanner‘

文章目录报错解决方案必要的解决方法下载安装nmap代码中添加exe路径可能有用的解决方法报错 AttributeError module nmap has no attribute PortScanner 解决方案必要的解决方法抛
Trajectory Forecasting：TrajNet++

概述由于自动驾驶和服务机器人等人工智能新兴应用的需求不断增长 xff0c 拥挤场景中的轨迹预测已成为近年来的一个重要话题轨迹预测的一项重要挑战是有效地建模社交互动在过去的几年中 xff0c 已经提出了几种新颖的方法然而 xff0c
【已解决】AttributeError: ‘Index‘ object has no attribute ‘to_list‘

文章目录报错及效果图报错代码效果图解决方案必要的解决方法报错及效果图报错代码 AttributeError span class token punctuation span span class token string 39 I
【代码】读取图像，计算面宽比，并保存至表格

计算面宽比读取某一文件夹下的图片并计算面宽比 xff0c 并保存至表格安装dlib报错怎么办计算面宽比此处计算 xff08 第一个点和第17个点之间的距离 xff09 xff08 第28个点和第52个点之间的距离 xff09 span
探究肺癌患者的CT图像的图像特征并构建一个诊断模型

目标效果图操作说明代码目标探究肺癌患者的CT图像的图像特征并构建一个诊断模型效果图操作说明代码中我以建立10张图为例 xff0c 多少你自己定准备工作 xff1a 1 准备肺癌或非肺癌每个各10张图 xff0c 在本地创建一个名
【已解决】Pygame无法显示中文

文章目录报错截图及效果图报错图效果图解决方案其他问题报错截图及效果图报错图效果图解决方案添加这行代码即可 font span class token operator 61 span pygame span class tok
【已解决】Resource wordnet not found. Please use the NLTK Downloader to obtain the resource

文章目录报错代码解决方案必要的解决方法可能有用的解决方法非常重要报错代码 Resource wordnet not found Please use the NLTK Downloader to obtain the resource
Launch启动文件的使用方法

Launch启动文件的使用方法案例一 xff1a 运行两个节点案例二 xff1a 加载参数与命名空间案例三 xff1a 小海龟跟随的launch启动方法案例四 xff1a remap修改节点名 Launch文件可以通过XML文件实现多节点

随机推荐

什么是死锁？死锁如何解决？

1 死锁是什么 xff1f 死锁是指两个或多个事务在同一资源上相互占用 xff0c 并请求锁定对方的资源 xff0c 从而导致恶性循环的现象当多个进程因竞争资源而造成的一种僵局 xff08 互相等待 xff09 xff0c 若无外力作用
Ubuntu20.04+ros+PX4学习第三天

激光slam学习 xff1a 激光slam所用到的传感器 xff1a 惯性测量单元 xff08 IMU xff09 43 轮式里程计 43 激光雷达轮式里程计算角度误差会很大 xff0c 一般用IMU计算角度 xff0c 轮式里程计用来算
C语言实现将彩色bmp图像转化为灰图、灰度图像反色

彩色图像转灰度图像彩色 xff08 24位 xff09 bmp图像结构 xff1a span class token keyword typedef span span class token keyword struct span sp
【实例分割｜Mask2Former】解决模型推理预测的代码中存在的一些问题

文章目录取消终端输出网络结构推理置信度设置预测实例存在多个轮廓预测模型返回筛选后实例取消终端输出网络结构在运行 demo py 时 xff0c 终端会输出大量网络结构信息 xff0c 影响调试代码需要在 Detectron2 中的
梯度消失与梯度爆炸

简介梯度消失问题和梯度爆炸问题 xff0c 总的来说可以称为梯度不稳定问题 ReLU激活函数 xff0c 用Batch Normal xff0c 用残差结构解决梯度消失问题正则化来限制梯度爆炸梯度消失梯度消失的原始是反向传播时的链式法
【Pytorch｜Bug】解决 RuntimeError: Error(s) in loading state_dict for Network: size mismatch

文章目录问题背景解决方法问题背景 Github开源项目 xff1a https github com zhang tao whu e2ec python train net py coco finetune bs span class
【数据集｜COCO】COCO格式数据集制作与数据集参数计算

文章目录 1 批量修改 JSON 文件中的参数1 1 问题背景1 2 代码实现 2 划分训练集和测试集2 1 问题背景2 2 环境配置2 3 代码实现 3 生成 JSON 标签文件3 1 环境配置3 2 代码实现 4 计算训练集三通道均值4
【实例分割｜Detectron2】在主干网络 ResNet50 中添加 SE 注意力模块

文章目录在 Detectron2 中使用 SE 模块在 Mask2Former 中使用 SE 模块在 Detectron2 中使用 SE 模块直接在 detectron2 detectron2 modeling backbone re
【实例分割｜Detectron2】计算实例漏检数量

文章目录问题背景代码实现计算 COCO 标签中实例数量结合 Detectron2 模型预测代码问题背景对于单张图像中实例超过 100 的情形 maskrcnn 预测超过 100 个实例 xff0c 难免会出现漏检的情形 xff0c 通
【现代控制理论期末考点】

文章目录要考的不考的注适用于 SEU Automation 要考的输入输出求状态空间虚拟输出相关基本实现矩阵有三阶及以上的简答题有不少概念性内容如果不清楚答一下定义解释一下也有分离散的内容连续系统离散化离散状态
【PCL点云库｜法向量】pcl::Normal 和 pcl::PointNormal 的区别

表示法向坐标和表面曲率估计的点结构 span class token comment brief A point structure representing normal coordinates and the surface curva
【PCL点云库｜点云配准】getRemainingCorrespondences() 和 getCorrespondences() 的区别

文章目录解析源码解析同时调用 setInputCorrespondences 和 getCorrespondences 相当于调用 getRemainingCorrespondences 以下两段代码对于同一点云数据计算得到的结果是相
【PCL点云库｜点云配准】determineCorrespondences() 和 determineReciprocalCorrespondences() 的区别

文章目录 determineCorrespondences xff1a Determine the correspondences between input and target cloud xff08 确定输入和目标云之间的对应关系 x
【Super4PCS】安装时报错 fatal error: pcl/point_types.h: No such file or directory

文章目录问题背景解决方法参考链接问题背景在安装 Super4PCS 时 xff0c 执行 make install 时报错如下 xff1a span class token punctuation span span class to
【Super4PCS】编译代码时报错 Could not find a package configuration file provided by “OpenGR

文章目录问题背景解决方法问题背景在编译我自己写的代码时 xff0c 发生报错如下 xff1a CMake Error at CMakeLists txt 6 span class token punctuation span find
[Extensive Reading]background modeling：MOG2

简介 xff1a MOG2背景建模方法发表于2004年 xff0c 由Zoran Zivkovic提出 xff0c MOG2的改进过程大致是 xff0c 单高斯背景建模 xff0c 混合高斯背景建模 xff0c MOG到MOG2 原理 xf
【C++｜Bug】解决 error: default argument given for parameter x of xxx

问题背景 Scanning dependencies of target test1 span class token punctuation span span class token number 10 span span class
【Python】npy 格式深度图可视化

span class token keyword import span numpy span class token keyword as span np span class token keyword import span matp
【sklearn｜Bug】ModuleNotFoundError: No module named ‘sklearn.neighbors.kde‘

问题背景发生如下报错 xff1a ModuleNotFoundError No module named span class token string 39 sklearn neighbors kde 39 span 原因在网上看到一
【实例分割｜AdaptIS】数据集制作｜魔改 backbone

文章目录相关博客笔记数据集制作AdaIN相关 Github 项目Fork改进魔改 backbone替换为 Unet 的 Pytorch 官方实现代码实现步骤1步骤2步骤3 相关博客笔记论文笔记 xff1a AdaptISAdaptIS

【实例分割｜AdaptIS】数据集制作｜魔改 backbone

文章目录

相关博客笔记

数据集制作

AdaIN

相关 Github 项目

Fork

改进

魔改 backbone

替换为 Unet 的 Pytorch 官方实现

代码实现

步骤1

步骤2

步骤3

【实例分割｜AdaptIS】数据集制作｜魔改 backbone 的相关文章

随机推荐

热门标签