【数据集｜COCO】COCO格式数据集制作与数据集参数计算

2023-05-16

文章目录

1. 批量修改 JSON 文件中的参数
- 1.1 问题背景
- 1.2 代码实现
2. 划分训练集和测试集
- 2.1 问题背景
- 2.2 环境配置
- 2.3 代码实现
3. 生成 JSON 标签文件
- 3.1 环境配置
- 3.2 代码实现
4. 计算训练集三通道均值
- 4.1 问题背景
- 4.2 代码实现
5. bbox和seg标签可视化
- 5.1 问题背景
- 5.2 代码实现

1. 批量修改 JSON 文件中的参数

1.1 问题背景

在不同的电脑上进行标注，生成的标签文件中的图像路径名与 json 文件名不一致，故需要进行修改。

1.2 代码实现

import os
import json

folder_path = "/home/zth/HardDisk/Datasets/nematode/datasets/before"
for json_name in os.listdir(folder_path):
    modified_data = {}

    # 修改
    with open(os.path.join(folder_path, json_name),'rb') as f:
        data = json.load(f)
        print(json_name)
        data['imagePath'] = json_name
        modified_data = data
    f.close()

    # 写入
    with open(os.path.join(folder_path, json_name),'w') as r:
        json.dump(modified_data, r, indent=4, ensure_ascii=False) # 格式化并写入
    r.close()

2. 划分训练集和测试集

2.1 问题背景

按照 8:2 的比例随机划分训练集和测试集。

2.2 环境配置

pip install pytest-shutil
pip install scikit-learn

2.3 代码实现

import os
import shutil
from sklearn.model_selection import train_test_split

# 创建文件夹
def mkdir(path):
    folder = os.path.exists(path)
    if not folder:
        os.makedirs(path)
        print(f'-- new folder "{path}" --')
    else:
        print(f'-- the folder "{path}" is already here --')

# 设置原图和原标签路经及目标文件夹路径
image_format = ".jpg"
image_path = "images"
label_path = "labels"
train_set_save_path = "coco/train"
test_set_save_path = "coco/test"
mkdir(train_set_save_path)
mkdir(test_set_save_path)

file_pathes = os.listdir(image_path)
# 获取文件夹下所有指定格式的图像的名称（不包含后缀名）
img_names = []
for file_path in file_pathes:
    if os.path.splitext(file_path)[1] == image_format:
        file_name = os.path.splitext(file_path)[0]
        img_names.append(file_name)

# 划分训练集和验证集
train_set, test_set = train_test_split(img_names, test_size=0.2, random_state=42)
print(f"train_set size: {len(train_set)}, val_set size: {len(test_set)}")

# 训练集处理：将图像和标签文件移动到目标文件夹
for file_name in train_set:
    img_src_path = os.path.join(image_path, file_name+image_format)
    img_dst_path = os.path.join(train_set_save_path, file_name+image_format)
    shutil.copyfile(img_src_path, img_dst_path)

    json_src_path = os.path.join(label_path, file_name+".json")
    json_dst_path = os.path.join(train_set_save_path, file_name+".json")
    shutil.copyfile(json_src_path, json_dst_path)

# 验证集处理：将图像和标签文件移动到目标文件夹
for file_name in test_set:
    img_src_path = os.path.join(image_path, file_name+image_format)
    img_dst_path = os.path.join(test_set_save_path, file_name+image_format)
    shutil.copyfile(img_src_path, img_dst_path)

    json_src_path = os.path.join(label_path, file_name+".json")
    json_dst_path = os.path.join(test_set_save_path, file_name+".json")
    shutil.copyfile(json_src_path, json_dst_path)

3. 生成 JSON 标签文件

3.1 环境配置

pip install scikit-image

3.2 代码实现

# -*- coding:utf-8 -*-

import os
import argparse
import json
import matplotlib.pyplot as plt
import skimage.io as io
from labelme import utils
import numpy as np
import glob
import PIL.Image

def mkdir(path):
    folder = os.path.exists(path)
    if not folder:
        os.makedirs(path)
        print(f'-- new folder "{path}" --')
    else:
        print(f'-- the folder "{path}" is already here --')

class MyEncoder(json.JSONEncoder):
    def default(self, obj):
        if isinstance(obj, np.integer):
            return int(obj)
        elif isinstance(obj, np.floating):
            return float(obj)
        elif isinstance(obj, np.ndarray):
            return obj.tolist()
        else:
            return super(MyEncoder, self).default(obj)


class labelme2coco(object):
    def __init__(self, labelme_json=[], save_json_path='./train.json'):
        self.labelme_json = labelme_json
        self.save_json_path = save_json_path
        self.images = []
        self.categories = []
        self.annotations = []
        self.label = []
        self.annID = 1
        self.height = 0
        self.width = 0

        self.save_json()

    def data_transfer(self):

        for num, json_file in enumerate(self.labelme_json):
            print(json_file)
            with open(json_file, 'r', encoding="utf8", errors='ignore') as fp:
                data = json.load(fp)  # 加载json文件
                self.images.append(self.image(data, num))
                for shapes in data['shapes']:
                    label = shapes['label']
                    if label not in self.label:
                        self.categories.append(self.categorie(label))
                        self.label.append(label)
                    points = shapes['points']  # 这里的point是用rectangle标注得到的，只有两个点，需要转成四个点
                    points.append([points[0][0], points[1][1]])
                    points.append([points[1][0], points[0][1]])
                    self.annotations.append(self.annotation(points, label, num))
                    self.annID += 1

    def image(self, data, num):
        image = {}
        img = utils.img_b64_to_arr(data['imageData'])  # 解析原图片数据
        height, width = img.shape[:2]
        img = None
        image['height'] = height
        image['width'] = width
        image['id'] = num + 1
        image['file_name'] = data['imagePath'].split('/')[-1]

        self.height = height
        self.width = width

        return image

    def categorie(self, label):
        categorie = {}
        categorie['supercategory'] = 'Cancer'
        categorie['id'] = len(self.label) + 1  # 0 默认为背景
        categorie['name'] = label
        return categorie

    def annotation(self, points, label, num):
        annotation = {}
        annotation['segmentation'] = [list(np.asarray(points).flatten())]
        annotation['iscrowd'] = 0
        annotation['image_id'] = num + 1
        # annotation['bbox'] = str(self.getbbox(points)) # 使用list保存json文件时报错（不知道为什么）
        # list(map(int,a[1:-1].split(','))) a=annotation['bbox'] 使用该方式转成list
        annotation['bbox'] = list(map(float, self.getbbox(points)))
        annotation['area'] = annotation['bbox'][2] * annotation['bbox'][3]
        # annotation['category_id'] = self.getcatid(label)
        annotation['category_id'] = self.getcatid(label)  # 注意，源代码默认为1
        annotation['id'] = self.annID
        return annotation

    def getcatid(self, label):
        for categorie in self.categories:
            if label == categorie['name']:
                return categorie['id']
        return 1

    def getbbox(self, points):
        polygons = points

        mask = self.polygons_to_mask([self.height, self.width], polygons)
        return self.mask2box(mask)

    def mask2box(self, mask):
        '''从mask反算出其边框
        mask：[h,w]  0、1组成的图片
        1对应对象，只需计算1对应的行列号（左上角行列号，右下角行列号，就可以算出其边框）
        '''
        # np.where(mask==1)
        index = np.argwhere(mask == 1)
        rows = index[:, 0]
        clos = index[:, 1]

        # 解析左上角行列号
        left_top_r = np.min(rows)  # y
        left_top_c = np.min(clos)  # x

        # 解析右下角行列号
        right_bottom_r = np.max(rows)
        right_bottom_c = np.max(clos)

        return [left_top_c, left_top_r, right_bottom_c - left_top_c,
                right_bottom_r - left_top_r]  # [x1,y1,w,h] 对应COCO的bbox格式

    def polygons_to_mask(self, img_shape, polygons):
        mask = np.zeros(img_shape, dtype=np.uint8)
        mask = PIL.Image.fromarray(mask)
        xy = list(map(tuple, polygons))
        PIL.ImageDraw.Draw(mask).polygon(xy=xy, outline=1, fill=1)
        mask = np.array(mask, dtype=bool)
        return mask

    def data2coco(self):
        data_coco = {}
        data_coco['images'] = self.images
        data_coco['categories'] = self.categories
        data_coco['annotations'] = self.annotations
        return data_coco

    def save_json(self):
        self.data_transfer()
        self.data_coco = self.data2coco()
        # 保存json文件
        json.dump(self.data_coco, open(self.save_json_path, 'w'), indent=4, cls=MyEncoder)  # indent=4 更加美观显示


if __name__ == "__main__":

    mkdir("coco/annotations")

    train_labelme_json = glob.glob(r'coco/train/*.json')
    labelme2coco(train_labelme_json, 'coco/annotations/instances_train.json')

    test_labelme_json = glob.glob(r'coco/test/*.json')
    labelme2coco(test_labelme_json, 'coco/annotations/instances_test.json')

4. 计算训练集三通道均值

4.1 问题背景

配置文件中有如下参数，这是 IMAGENET 数据集的均值和方差：

MODEL:
  PIXEL_MEAN: [123.675, 116.280, 103.530]
  PIXEL_STD: [58.395, 57.120, 57.375]

除以 255.0 就得到 mean=(0.485, 0.456, 0.406)，std=(0.229, 0.224, 0.225) 。

Detectron2 代码中的注释如下：

pixel_mean : per-channel mean to normalize input image
pixel_std : per-channel stddev to normalize input image

在训练代码时，最好替换为自定义数据集的均值和标准差。

4.2 代码实现

"""
计算训练集的三通道均值和标准差
适用于训练集中存在不同尺寸的图像
"""

from importlib.resources import path
import os
from PIL import Image
import matplotlib.pyplot as plt
import numpy as np
import imageio.v2 as imageio
from tqdm import trange

def get_mean_std(pathDir: list):

    # 计算三通道的均值
    R_channel = 0
    G_channel = 0
    B_channel = 0
    all_num = 0 # 像素点总数量
    print("计算三通道均值：")
    for idx in trange(len(pathDir)):
        filename = pathDir[idx]
        img = imageio.imread(os.path.join(filepath, filename))# / 255.0
        R_channel = R_channel + np.sum(img[:, :, 0])
        G_channel = G_channel + np.sum(img[:, :, 1])
        B_channel = B_channel + np.sum(img[:, :, 2])

        all_num = img.shape[0] * img.shape[1] + all_num

    R_mean = R_channel / all_num
    G_mean = G_channel / all_num
    B_mean = B_channel / all_num

    # 计算三通道的标准差
    R_channel = 0
    G_channel = 0
    B_channel = 0
    print("计算三通道标准差：")
    for idx in trange(len(pathDir)):
        filename = pathDir[idx]
        img = imageio.imread(os.path.join(filepath, filename))# / 255.0
        R_channel = R_channel + np.sum((img[:, :, 0] - R_mean) ** 2)
        G_channel = G_channel + np.sum((img[:, :, 1] - G_mean) ** 2)
        B_channel = B_channel + np.sum((img[:, :, 2] - B_mean) ** 2)

    R_std = np.sqrt(R_channel / all_num)
    G_std = np.sqrt(G_channel / all_num)
    B_std = np.sqrt(B_channel / all_num)
    
    return [R_mean, G_mean, B_mean], [R_std, G_std, B_std]

if __name__ == "__main__":
    filepath = 'coco/train'  # 数据集目录

    # 对目录下的 jpg 图像进行处理
    image_paths = []
    for filename in os.listdir(filepath):
        if os.path.splitext(filename)[1] == ".jpg":
            image_paths.append(filename)

    # 计算均值和标准差        
    mean, std = get_mean_std(image_paths)

    # 打印结果（保留三位小数）
    print("PIXEL_MEAN: ", [round(i,3) for i in mean])
    print("PIXEL_STD: ", [round(i,3) for i in std])

5. bbox和seg标签可视化

5.1 问题背景

从已经制作的 COCO 的标签中可视化 bbox 和分割标注，对比 labelme 中每个图像标签的标注结果，检查图像和标签是否对应的上，否则在训练时容易出现 loss 为 nan 值的情况。

5.2 代码实现

'''
Auther: zth
Date: 2022-08-16 00:22:45
LastEditTime: 2022-08-16 00:38:41
Description: 
'''
import cv2
import random
import json, os
from pycocotools.coco import COCO
from skimage import io
from matplotlib import pyplot as plt

train_json = 'coco/annotations/instances_train.json'
train_path = 'coco/train/'


def visualization_bbox_seg(num_image, json_path, img_path,
                           *str):  # 需要画图的是第num副图片， 对应的json路径和图片路径

    coco = COCO(json_path)

    if len(str) == 0:
        catIds = []
    else:
        catIds = coco.getCatIds(
            catNms=[str[0]])  # 获取给定类别对应的id 的dict（单个内嵌字典的类别[{}]）
        catIds = coco.loadCats(catIds)[0]['id']  # 获取给定类别对应的id 的dict中的具体id

    list_imgIds = coco.getImgIds(catIds=catIds)  # 获取含有该给定类别的所有图片的id
    img = coco.loadImgs(
        list_imgIds[num_image - 1])[0]  # 获取满足上述要求，并给定显示第num幅image对应的dict
    image = io.imread(img_path + img['file_name'])  # 读取图像
    image_name = img['file_name']  # 读取图像名字
    image_id = img['id']  # 读取图像id

    img_annIds = coco.getAnnIds(
        imgIds=img['id'], catIds=catIds, iscrowd=None)  # 读取这张图片的所有seg_id
    img_anns = coco.loadAnns(img_annIds)

    for i in range(len(img_annIds)):
        x, y, w, h = img_anns[i - 1]['bbox']  # 读取边框
        image = cv2.rectangle(image, (int(x), int(y)),
                              (int(x + w), int(y + h)), (0, 255, 255), 2)

    plt.rcParams['figure.figsize'] = (20.0, 20.0)
    plt.imshow(image)
    coco.showAnns(img_anns)
    plt.show()


if __name__ == "__main__":
    visualization_bbox_seg(30, train_json, train_path,
                           '1')  # 最后一个参数不写就是画出一张图中的所有类别

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

COCO

格式数据集制作与数据集参数计算

【数据集｜COCO】COCO格式数据集制作与数据集参数计算的相关文章

COCO Dataset person_keypoints.json 解析

DataSet COCO json person keypointsperson keypoins json 结构one imageperson keypoint jsonkeypointDisplay above image ID 61
MS COCO数据集人体关键点评估（Keypoint Evaluation）（来自官网）

COCO系列文章 xff1a MS COCO数据集目标检测评估 xff08 Detection Evaluation xff09 xff08 来自官网 xff09 MS COCO数据集人体关键点评估 xff08 Keypoint Evalu
Dataset之COCO数据集：COCO数据集的简介、下载、使用方法之详细攻略

COCO数据集的简介 MS COCO的全称是Microsoft Common Objects in Context xff0c 起源于微软于2014年出资标注的Microsoft COCO数据集 xff0c 与ImageNet竞赛一样 xf
COCO数据集解析

1 简介官方网站 xff1a http cocodataset org 全称 xff1a Microsoft Common Objects in Context xff08 MS COCO xff09 支持任务 xff1a Detecti
detectron2训练自己的数据集和转coco格式

参考关于coco的格式 https detectron2 readthedocs io en latest tutorials datasets html register a dataset 注册并训练自己的数据集合https blog
COCO数据集介绍

转载自 xff1a https zhuanlan zhihu com p 29393415 COCO的全称是Common Objects in COntext xff0c 是微软团队提供的一个可以用来进行图像识别的数据集 MS COCO数
COCO格式数据集可视化为框

使用pycocotools读取和opencv绘制 xff0c 实现COCO格式数据边框显示的可视化 xff0c 可视化前后的示例为 xff1a 代码 xff1a coding utf 8 import os import sys getop
将visdrone数据集转化为coco格式并在mmdetection上训练,附上转好的json文件

visdrone是一个无人机的目标检测数据集 xff0c 在很多目标检测的论文中都能看到它的身影标签从0到11分别为 ignored regions pedestrian people bicycle car van truck tric
win10安装pycocotools遇到的问题(gcc.exe failed with exit status 1)

背景安装pycocotools一直过不去一直报错 PS C Users peter gt pip install git https github com philferriere cocoapi git subdirectory Pyt
在 Windows 下安装 COCO API（pycocotools）

本内容将介绍在 Windows 下安装 COCO API pycocotools 本来 COCO 对 Windows 是不支持的不过为了支持 Windows 有人对 COCO 做了一些修改下面是 COCO 在 GitHub 上源码地址信
Paperreading之三Simple Baselines for Human Pose Estimation

本次paper是coco2018关键点检测项目的亚军方案方法非常的简洁明了但是效果很惊艳达到了state of the art paper的标题也是写了simple baseline 整篇paper包含一个sota的姿态估计和姿态跟踪
coco数据集的评价指标

Average Precision AP IoU 0 50 0 95 area all maxDets 100 0 000 Average Precision AP IoU 0 50 area all maxDets 100 0 000 A
手把手实战教学！语义分割从0到1：一、数据集制作

本篇博客是手把手实战教学语义分割从0到1 系列的第一篇实战教学将重点介绍语义分割相关数据集以及如何制作自己的数据集本系列总的介绍以及其他章节的汇总见 https blog csdn net oYeZhou article d
COCO数据集转VOC（提取自己需要的类）

github https github com zcc720 COCO2VOC git 接上篇VOC数据集提取自己需要的类这次我们依然从coco数据集中提取我们想要的类并转为voc格式用于目标检测一去官网下载数据集 train20
深度学习目标检测工具箱mmdetection，训练自己的数据

文章目录一简介二安装教程 1 使用conda创建Python虚拟环境可选 2 安装PyTorch 1 1 3 安装Cython 4 安装mmcv 5 安装mmdetection 6 测试Demo 7 准备自己的数据 8 训练一
coco数据集

1 win10安装cocoapi pip install git https github com philferriere cocoapi git subdirectory PythonAPI win10安装cocoapi 君莫笑 CSD
MS COCO数据集输出数据的结果格式（result format）和如何参加比赛（participate）（来自官网）

COCO系列文章 MS COCO数据集目标检测评估 Detection Evaluation 来自官网 MS COCO数据集人体关键点评估 Keypoint Evaluation 来自官网 MS COCO数据集输出数据的结果格式 resul
COCO数据处理(二)根据自己提取的类的json文件生成对应的mask二值图并画在原图上

文章目录 COCO数据集根据json文件生成mask二值图文件目录目录说明代码一生成mask图代码二将mask图画在原图上效果图 COCO数据集根据json文件生成mask二值图文件目录目录说明 data coco a
COCO数据集的使用笔记

一简介官方网站 http cocodataset org 全称 Microsoft Common Objects in Context MS COCO 支持任务 Detection Keypoints Stuff Panoptic Ca
COCO数据集格式（详解）及COCO标注可视化。json转COCO等代码

coco数据集JSON文件格式分为一下几个字段 info info dict licenses license list 内部是dict images image list 内部是dict annotations annotation li

随机推荐

【已解决】kex_exchange_identification: Connection closed by remote host fatal: Could not read from

文章目录报错及效果图报错代码成功效果图解决方案必要的解决方法可能有用的解决方法报错及效果图报错代码 kex exchange identification Connection closed by remote span class
【已解决】VMware Player 无法与 VMware Workstation 一起安装。请先卸载 VMware Workstation，再尝试安装VMware Player

文章目录报错本解决方案适用情境解决方案必要的解决方法可能有用的解决方法报错 VMware Player 无法与 VMware Workstation 一起安装请先卸载 VMware Workstation xff0c 再尝试安装VM
基于蓝牙智能家庭影音控制系统---粤嵌GEC6818嵌入式系统实训

版本介绍普通版完整版至尊版版本介绍分为普通版完整版至尊版三个版本普通版可以满足实训要求 xff0c 提供代码 xff0c 不提供技术指导实现功能 xff1a 1所有界面自行设计 xff0c 要求尽可能好看 2 执行程序 xff
【已解决】Flask当中render_template函数使用过程当中css文件无法正常渲染

文章目录报错可能原因解决方案必要的解决方法可能有用的解决方法报错 Flask当中render template函数使用过程当中css文件无法正常渲染 xff0c 直接显示的html 可能原因当在Flask应用程序中使用render
【已解决】License checkout failed. License Manager Error -8 Make sure the HostlD of the license

文章目录报错图解决方案报错图安装matlab2020b xff0c 双击matlab exe报错解决方案下载对应的破解包 xff0c 一般安装教程里面都有 1 将破解文件中 34 Crack R2020a bin win64 ma
【已解决】AttributeError: module ‘nmap‘ has no attribute ‘PortScanner‘

文章目录报错解决方案必要的解决方法下载安装nmap代码中添加exe路径可能有用的解决方法报错 AttributeError module nmap has no attribute PortScanner 解决方案必要的解决方法抛
Trajectory Forecasting：TrajNet++

概述由于自动驾驶和服务机器人等人工智能新兴应用的需求不断增长 xff0c 拥挤场景中的轨迹预测已成为近年来的一个重要话题轨迹预测的一项重要挑战是有效地建模社交互动在过去的几年中 xff0c 已经提出了几种新颖的方法然而 xff0c
【已解决】AttributeError: ‘Index‘ object has no attribute ‘to_list‘

文章目录报错及效果图报错代码效果图解决方案必要的解决方法报错及效果图报错代码 AttributeError span class token punctuation span span class token string 39 I
【代码】读取图像，计算面宽比，并保存至表格

计算面宽比读取某一文件夹下的图片并计算面宽比 xff0c 并保存至表格安装dlib报错怎么办计算面宽比此处计算 xff08 第一个点和第17个点之间的距离 xff09 xff08 第28个点和第52个点之间的距离 xff09 span
探究肺癌患者的CT图像的图像特征并构建一个诊断模型

目标效果图操作说明代码目标探究肺癌患者的CT图像的图像特征并构建一个诊断模型效果图操作说明代码中我以建立10张图为例 xff0c 多少你自己定准备工作 xff1a 1 准备肺癌或非肺癌每个各10张图 xff0c 在本地创建一个名
【已解决】Pygame无法显示中文

文章目录报错截图及效果图报错图效果图解决方案其他问题报错截图及效果图报错图效果图解决方案添加这行代码即可 font span class token operator 61 span pygame span class tok
【已解决】Resource wordnet not found. Please use the NLTK Downloader to obtain the resource

文章目录报错代码解决方案必要的解决方法可能有用的解决方法非常重要报错代码 Resource wordnet not found Please use the NLTK Downloader to obtain the resource
Launch启动文件的使用方法

Launch启动文件的使用方法案例一 xff1a 运行两个节点案例二 xff1a 加载参数与命名空间案例三 xff1a 小海龟跟随的launch启动方法案例四 xff1a remap修改节点名 Launch文件可以通过XML文件实现多节点
什么是死锁？死锁如何解决？

1 死锁是什么 xff1f 死锁是指两个或多个事务在同一资源上相互占用 xff0c 并请求锁定对方的资源 xff0c 从而导致恶性循环的现象当多个进程因竞争资源而造成的一种僵局 xff08 互相等待 xff09 xff0c 若无外力作用
Ubuntu20.04+ros+PX4学习第三天

激光slam学习 xff1a 激光slam所用到的传感器 xff1a 惯性测量单元 xff08 IMU xff09 43 轮式里程计 43 激光雷达轮式里程计算角度误差会很大 xff0c 一般用IMU计算角度 xff0c 轮式里程计用来算
C语言实现将彩色bmp图像转化为灰图、灰度图像反色

彩色图像转灰度图像彩色 xff08 24位 xff09 bmp图像结构 xff1a span class token keyword typedef span span class token keyword struct span sp
【实例分割｜Mask2Former】解决模型推理预测的代码中存在的一些问题

文章目录取消终端输出网络结构推理置信度设置预测实例存在多个轮廓预测模型返回筛选后实例取消终端输出网络结构在运行 demo py 时 xff0c 终端会输出大量网络结构信息 xff0c 影响调试代码需要在 Detectron2 中的
梯度消失与梯度爆炸

简介梯度消失问题和梯度爆炸问题 xff0c 总的来说可以称为梯度不稳定问题 ReLU激活函数 xff0c 用Batch Normal xff0c 用残差结构解决梯度消失问题正则化来限制梯度爆炸梯度消失梯度消失的原始是反向传播时的链式法
【Pytorch｜Bug】解决 RuntimeError: Error(s) in loading state_dict for Network: size mismatch

文章目录问题背景解决方法问题背景 Github开源项目 xff1a https github com zhang tao whu e2ec python train net py coco finetune bs span class
【数据集｜COCO】COCO格式数据集制作与数据集参数计算

文章目录 1 批量修改 JSON 文件中的参数1 1 问题背景1 2 代码实现 2 划分训练集和测试集2 1 问题背景2 2 环境配置2 3 代码实现 3 生成 JSON 标签文件3 1 环境配置3 2 代码实现 4 计算训练集三通道均值4