VOC2007数据标签格式转COCO2017格式

2023-05-16

以下代码是将voc2007的数据标签格式转为coco2017数据标签格式.
可直接基于voc2007的.xml所有文件进行处理, 也可先转化为 .txt文件路径之后再处理, 此处我是直接基于.xml所有文件进行处理的.

# -*- coding: utf-8 -*-

import xml.etree.ElementTree as ET
import os
import json

coco = dict()
coco['images'] = []
coco['type'] = 'instances'
coco['annotations'] = []
coco['categories'] = []

category_set = dict()
image_set = set()

category_item_id = -1
image_id = 20180000000
annotation_id = 0


def addCatItem(name):
    global category_item_id
    category_item = dict()
    category_item['supercategory'] = 'none'
    category_item_id += 1
    category_item['id'] = category_item_id
    category_item['name'] = name
    coco['categories'].append(category_item)
    category_set[name] = category_item_id
    return category_item_id


def addImgItem(file_name, size):
    global image_id
    if file_name is None:
        raise Exception('Could not find filename tag in xml file.')
    if size['width'] is None:
        raise Exception('Could not find width tag in xml file.')
    if size['height'] is None:
        raise Exception('Could not find height tag in xml file.')
    image_id += 1
    image_item = dict()
    image_item['id'] = image_id
    image_item['file_name'] = file_name
    image_item['width'] = size['width']
    image_item['height'] = size['height']
    coco['images'].append(image_item)
    image_set.add(file_name)
    return image_id


def addAnnoItem(object_name, image_id, category_id, bbox):
    global annotation_id
    annotation_item = dict()
    annotation_item['segmentation'] = []
    seg = []
    # bbox[] is x,y,w,h
    # left_top
    seg.append(bbox[0])
    seg.append(bbox[1])
    # left_bottom
    seg.append(bbox[0])
    seg.append(bbox[1] + bbox[3])
    # right_bottom
    seg.append(bbox[0] + bbox[2])
    seg.append(bbox[1] + bbox[3])
    # right_top
    seg.append(bbox[0] + bbox[2])
    seg.append(bbox[1])

    annotation_item['segmentation'].append(seg)

    annotation_item['area'] = bbox[2] * bbox[3]
    annotation_item['iscrowd'] = 0
    annotation_item['ignore'] = 0
    annotation_item['image_id'] = image_id
    annotation_item['bbox'] = bbox
    annotation_item['category_id'] = category_id
    annotation_id += 1
    annotation_item['id'] = annotation_id
    coco['annotations'].append(annotation_item)


def _read_image_ids(image_sets_file):
    ids = []
    with open(image_sets_file) as f:
        for line in f:
            ids.append(line.rstrip())
    return ids


"""通过txt文件生成"""


# split ='train' 'va' 'trainval' 'test'
def parseXmlFiles_by_txt(data_dir, json_save_path, split='train'):
    print("hello")
    labelfile = split + ".txt"
    image_sets_file = data_dir + "/ImageSets/Main/" + labelfile
    ids = _read_image_ids(image_sets_file)

    for _id in ids:
        xml_file = data_dir + f"/Annotations/{_id}.xml"

        bndbox = dict()
        size = dict()
        current_image_id = None
        current_category_id = None
        file_name = None
        size['width'] = None
        size['height'] = None
        size['depth'] = None

        tree = ET.parse(xml_file)
        root = tree.getroot()
        if root.tag != 'annotation':
            raise Exception('pascal voc xml root element should be annotation, rather than {}'.format(root.tag))

        # elem is <folder>, <filename>, <size>, <object>
        for elem in root:
            current_parent = elem.tag
            current_sub = None
            object_name = None

            if elem.tag == 'folder':
                continue

            if elem.tag == 'filename':
                file_name = elem.text
                if file_name in category_set:
                    raise Exception('file_name duplicated')

            # add img item only after parse <size> tag
            elif current_image_id is None and file_name is not None and size['width'] is not None:
                if file_name not in image_set:
                    current_image_id = addImgItem(file_name, size)
                    print('add image with {} and {}'.format(file_name, size))
                else:
                    raise Exception('duplicated image: {}'.format(file_name))
                    # subelem is <width>, <height>, <depth>, <name>, <bndbox>
            for subelem in elem:
                bndbox['xmin'] = None
                bndbox['xmax'] = None
                bndbox['ymin'] = None
                bndbox['ymax'] = None

                current_sub = subelem.tag
                if current_parent == 'object' and subelem.tag == 'name':
                    object_name = subelem.text
                    if object_name not in category_set:
                        current_category_id = addCatItem(object_name)
                    else:
                        current_category_id = category_set[object_name]

                elif current_parent == 'size':
                    if size[subelem.tag] is not None:
                        raise Exception('xml structure broken at size tag.')
                    size[subelem.tag] = int(subelem.text)

                # option is <xmin>, <ymin>, <xmax>, <ymax>, when subelem is <bndbox>
                for option in subelem:
                    if current_sub == 'bndbox':
                        if bndbox[option.tag] is not None:
                            raise Exception('xml structure corrupted at bndbox tag.')
                        bndbox[option.tag] = int(option.text)

                # only after parse the <object> tag
                if bndbox['xmin'] is not None:
                    if object_name is None:
                        raise Exception('xml structure broken at bndbox tag')
                    if current_image_id is None:
                        raise Exception('xml structure broken at bndbox tag')
                    if current_category_id is None:
                        raise Exception('xml structure broken at bndbox tag')
                    bbox = []
                    # x
                    bbox.append(bndbox['xmin'])
                    # y
                    bbox.append(bndbox['ymin'])
                    # w
                    bbox.append(bndbox['xmax'] - bndbox['xmin'])
                    # h
                    bbox.append(bndbox['ymax'] - bndbox['ymin'])
                    print('add annotation with {},{},{},{}'.format(object_name, current_image_id, current_category_id,
                                                                   bbox))
                    addAnnoItem(object_name, current_image_id, current_category_id, bbox)
    json.dump(coco, open(json_save_path, 'w'))


"""直接从xml文件夹中生成"""


def parseXmlFiles(xml_path, json_save_path):
    for f in os.listdir(xml_path):
        if not f.endswith('.xml'):
            continue

        bndbox = dict()
        size = dict()
        current_image_id = None
        current_category_id = None
        file_name = None
        size['width'] = None
        size['height'] = None
        size['depth'] = None

        xml_file = os.path.join(xml_path, f)
        print(xml_file)

        tree = ET.parse(xml_file)
        root = tree.getroot()
        if root.tag != 'annotation':
            raise Exception('pascal voc xml root element should be annotation, rather than {}'.format(root.tag))

        # elem is <folder>, <filename>, <size>, <object>
        for elem in root:
            current_parent = elem.tag
            current_sub = None
            object_name = None

            if elem.tag == 'folder':
                continue

            if elem.tag == 'filename':
                file_name = elem.text
                if file_name in category_set:
                    raise Exception('file_name duplicated')

            # add img item only after parse <size> tag
            elif current_image_id is None and file_name is not None and size['width'] is not None:
                if file_name not in image_set:
                    current_image_id = addImgItem(file_name, size)
                    print('add image with {} and {}'.format(file_name, size))
                else:
                    raise Exception('duplicated image: {}'.format(file_name))
                    # subelem is <width>, <height>, <depth>, <name>, <bndbox>
            for subelem in elem:
                bndbox['xmin'] = None
                bndbox['xmax'] = None
                bndbox['ymin'] = None
                bndbox['ymax'] = None

                current_sub = subelem.tag
                if current_parent == 'object' and subelem.tag == 'name':
                    object_name = subelem.text
                    if object_name not in category_set:
                        current_category_id = addCatItem(object_name)
                    else:
                        current_category_id = category_set[object_name]

                elif current_parent == 'size':
                    if size[subelem.tag] is not None:
                        raise Exception('xml structure broken at size tag.')
                    size[subelem.tag] = int(subelem.text)

                # option is <xmin>, <ymin>, <xmax>, <ymax>, when subelem is <bndbox>
                for option in subelem:
                    if current_sub == 'bndbox':
                        if bndbox[option.tag] is not None:
                            raise Exception('xml structure corrupted at bndbox tag.')
                        bndbox[option.tag] = int(option.text)

                # only after parse the <object> tag
                if bndbox['xmin'] is not None:
                    if object_name is None:
                        raise Exception('xml structure broken at bndbox tag')
                    if current_image_id is None:
                        raise Exception('xml structure broken at bndbox tag')
                    if current_category_id is None:
                        raise Exception('xml structure broken at bndbox tag')
                    bbox = []
                    # x
                    bbox.append(bndbox['xmin'])
                    # y
                    bbox.append(bndbox['ymin'])
                    # w
                    bbox.append(bndbox['xmax'] - bndbox['xmin'])
                    # h
                    bbox.append(bndbox['ymax'] - bndbox['ymin'])
                    print('add annotation with {},{},{},{}'.format(object_name, current_image_id, current_category_id,
                                                                   bbox))
                    addAnnoItem(object_name, current_image_id, current_category_id, bbox)
    json.dump(coco, open(json_save_path, 'w'))


if __name__ == '__main__':
    # # 通过txt文件生成
    # voc_data_dir="D:/github/FireAndSmoke/VOC2007"
    # json_save_path="D:/github/FireAndSmoke/VOCdevkit/voc2007trainval.json"
    # parseXmlFiles_by_txt(voc_data_dir,json_save_path,"trainval")

    # 通过文件夹生成
    ## .xml文件夹路径
    ann_path = "/home/wsy/data/06_dataset_transform/01_trafficlight/voc_data1/xml/train"   # .xml file directory
    ## 生成的coco2017的 .json 标签名, 转换时注意更换训练和评估的文件夹和文件名.
    json_save_path = "/home/wsy/data/06_dataset_transform/01_trafficlight/voc_data1/transform_coco2017/Annotations/bdd100k_labels_images_det_coco_train.json"
    parseXmlFiles(ann_path, json_save_path)

借鉴参考: https://www.cnblogs.com/cyssmile/p/15371392.html

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

VOC2007

COCO2017

数据标签格式转

VOC2007数据标签格式转COCO2017格式的相关文章

mp4 转 jpg 图片

先在 rosbag2video py 脚本中修改相关配置参数然后运行脚本 python mp4 2 jpg py 代码如下 span class token keyword import span span class token mod
C# WinForm界面设计教程

WinForm 是 Windows Form 的简称 xff0c 是基于 NET Framework 平台的客户端 xff08 PC软件 xff09 开发技术 xff0c 一般使用 C 编程 C WinForm 编程需要创建 Windows
2021 山东大学软件学院软件工程应用与实践--YOLOV5项目代码分析——（5）

2021SC 64 SDUSC detection class Detections detections class for yolov5 inference results def init self imgs pred files n
Centos7更改ssh端口报错解决

Centos7 ssh 端口问题解决报错如下 xff1a failed to start openssh server daemonssh failed to start openssh server daemon 问题总结 xff1a
Python 列表的常见操作

Python 列表的常见操作从形式上看 xff0c 列表会将所有元素都放在一对中括号里面 xff0c 相邻元素之间用逗号分隔 xff0c 如下所示 xff1a element1 element2 element3 elementn x
TortoiseGit的安装教程以及用法

TortoiseGit的安装教程以及用法 TortoiseGit 是基于 TortoiseSVN 的 Git 版本的 Windows Shell 界面它是开源的 xff0c 可以完全使用免费软件构建 TortoiseGit 支持你执行常规
在idea中向Maven项目添加插件时出现的“找不到插件”错误解决

大家用idea在pom xml中添加插件时 xff0c 一开始可能都会遇见添加的插件全报红的时候现在我们把鼠标移到报红的地方 xff0c 他给了我们一个not found错误 xff0c 也就是找不到插件那么解决方法也非常简单 xff0
安装Ubuntu双系统

制作U盘启动盘 1 电脑插入U盘 xff0c 格式化U盘 2 下载解压好安装包链接 xff1a https pan baidu com s 1my5JrSz QRPwRxIOdU4JJQ 提取码 xff1a 1234 解压完成打开文件夹
纯C+纯手写+手动编译一个windows 窗体应用（过程记录）

xff08 纯C 43 纯手写 43 手动编译 xff09 一个Windows 窗体应用本篇文章仅仅是作者的一个类似笔记一样的东西 xff0c 作为记录所以请勿出现如下不友善评论啊 xff0c 这不是某某IDE直接就可以生成的吗搞这
STM32寄存器的简介、地址查找，与直接操作寄存器

1 STM32寄存器的简介 2 STM32寄存器地址查找 3 直接操作STM32寄存器 1 STM32寄存器的简介 1 寄存器映射在存储器 Block2 这块区域 xff0c 设计的是片上外设 xff0c 它们以四个字节为一个单元 xff
python对目录下所有图片重命名

执行下列代码程序即可创建一个python文件 file rename py 然后执行 python file rename py span class token operator span span class token operat
word文档的公式编号方法（笔记）

所谓制表位 xff0c 可以简单理解为在word中插入一个无形的表格 xff0c 制表位后的文本就按照他前面的制表位对齐比如我们可以把公式行设置成这样 xff1a 制表位1 xff0c 位置在20字符处 xff0c 格式居中对齐公式制

随机推荐

error: src refspec master does not match any. 错误的解决办法

我们在使用git bash指令将项目上传到github时 xff0c 总是遇到一些错误无法解决下面是我遇到的一个问题 error src refspec master does not match any error failed to
（学习笔记）机器人自主导航从零开始第四步———Rviz、Gazebo、Meshlab的安装

前言本文参考资料 xff1a rviz ROS 维基 http wiki ros org rviz Gazebo Tutorial Ubuntu gazebosim org http gazebosim org tutorials tut
蜂鸣器及其实验

蜂鸣器电路图蜂鸣器按驱动方式可分为有源蜂鸣器 xff08 内含驱动线路 xff09 和无源蜂鸣器 xff08 外部驱动 xff09 这里的源指的是激励源无源蜂鸣器内部没有激励源 xff0c 只有给它一定频率的方波信号 xff0c
有一些软件包无法被安装。如果您用的是 unstable 发行版，这也许是因为系统无法达到您要求的状态造成的。E: 无法修正错误，因为您要求某些软件包保持现状，就是它们破坏了软件包间的依赖关系。

在Ubuntu中使用apt get命令安装编译所需要的库和工具时遇到 xff1a 有一些软件包无法被安装如果您用的是 unstable 发行版 xff0c 这也许是因为系统无法达到您要求的状态造成的该版本中可能会有一些您需要的软件包
MapReduce详细解析完整流程

MapReduce框架结构及核心运行机制 MRAppMaster 负责整个程序的运行过程的调度和状态协调MapTask 负责map阶段的整个数据处理流程ReduceTask 负责reduce阶段的整个数据处理流程整体流程图 MapRedu
如何配置路由器接口IP，手把手教你配置DHCP

目录配置命令 DHCP xff1a 动态主机配置协议路由器网线 xff1a RJ 45双绞线 xff08 家用最常用 xff09 非屏蔽线最佳距离100m xff1b 民用1000M S 商用100000M S 数字信号二进制光
1.C语言0基础自学-从第一行代码开始

目录声明 h头文件代码的开始大括号变量局部变量 xff08 scpoe xff09 全局变量变量的生命周期定义一个变量声明 include lt stdio h gt 声明这个源文件里需要包含一个名为stdio h的头文件
LXC是什么、什么是docker、docker产生的背景

LXC LXC是什么 LXC xff0c 其名称来自Linux软件容器 xff08 Linux Containers xff09 的缩写 xff0c 一种操作系统层虚拟化 xff08 Operating system level virtu
bdd100k数据标签格式转到VOC2007格式

需要修改的部分 xff1a 1 BDD FOLDER xff1a 修改成自己的bdd数据集root路径 2 如果训练的为 traffic light 类 xff0c 且类别为 red green yellow none xff0c 这些属性
用KEIL5打开KEIL4的文件

有时候我们会遇到 xff0c 下载了KEIL5时需要打开KEIL4文件的时候这时候 xff0c 我们需要将两个兼容一下我遇到了挺多的问题 xff0c 下面自己总结一下 xff1a 我是按照开发板的指导书的步骤来进行的上面写的 xff0
恒流源电路

一恒流源概述恒流源是指在功率范围内 xff0c 对外输出的电流基本是恒定的二恒流源特点不因负载输出电压变化而改变 xff1b 不因环境温度变化而改变 xff1b 内阻为无限大以使其电流可以全部流出到外面 xff1b 能够提供
关于ST-Link V2 报错internal commend error的处理办法

1 检查相关配置是否正确确定接线没有问题 xff1a Vcc 接 Vcc GND 接 GND SWCLK 接 SWCLK SWDIO 接 SWDIO 首先 xff0c 确保电脑的CH驱动已经安装成功且正常运行判断方法 xff1a 点击设
docker容器技术基础入门及LXC的配置

docker容器技术基础入门及LXC的配置 1 docker简介1 2 容器与虚拟化的区别 xff1a 1 3 docker的三个基本概念1 3 1镜像1 3 2 分层储存1 3 3 容器1 3 4 仓库 2 docker产生的背景2 1
简单易懂的51单片机LCD1602显示protues仿真程序

时序图仿真效果 include 34 AT89X51 h 34 typedef unsigned char u8 typedef unsigned int u16 define lcd1602 DB P3 sbit RS 61 P2 5
html5的思维导图--超详细

下面是最近我学习总结的学习思维导图
2023年天梯赛(l1 - l2全部题解）（第十二届）

span class token macro property span class token directive hash span span class token directive keyword include span spa
c语言strtok函数完美实现

看到网上好多错误的strtok实现 xff0c 也不能说错 xff0c 准确的说是没有完全的实现strtok xff0c 现自己写了下 xff0c 目前还没有找到bug xff0c 如果有不对的欢迎指出大多数网上的strtok实现的代码跑
使用vscode操作本地git

学习目标 xff1a 使用vscode操作本地git 基础的git的命令使用本地git进行版本穿梭修改本地git的用户名和邮箱 git有三个状态 xff1a 工作区暂存区版本库使用vscode操作本地git xff1a 1 首先
Linux环境下安装Docker

1 安装Docker 1 1在linux系统中下载前置环境 1 安装wget命令 wget命令是Linux系统用于从Web下载文件的命令行工具 2 安装依赖环境 3 设置Docker镜像源 xff0c 因为默认的服务器很慢所以我选择国内镜像
VOC2007数据标签格式转COCO2017格式

以下代码是将voc2007的数据标签格式转为coco2017数据标签格式可直接基于voc2007的 xml所有文件进行处理也可先转化为 txt文件路径之后再处理此处我是直接基于 xml所有文件进行处理的 span class toke

VOC2007数据标签格式转COCO2017格式

VOC2007数据标签格式转COCO2017格式 的相关文章

随机推荐

热门标签

VOC2007数据标签格式转COCO2017格式的相关文章