百度easydl数据标注

2023-05-16

一/ 百度easydl数据标注脚本

1 官方标注工具，链接如下，由lableme改进而形成

GitHub - Baidu-AIP/Easyyibiao

2 官网数据导入格式三种分别为：

布局如图所示

2.1json 通用格式 .json

{"labels": [{"y1": 579, "x2": 466, "x1": 328, "y2": 718, "name": "other","meta":{"points":[{"y": 718,"x": 400},{"y": 626,"x": 328},{"y": 579,"x": 393},{"y": 672,"x": 466}]}}]}

2.2 xml ,这个比较好扩展 object节点，复制增加即可

<?xml version="1.0" encoding="utf-8"?>
<annotation>
    <filename>00036.jpg</filename>
    <segmented>0</segmented>
    <owner>
        <name>Lmars, Wuhan University</name>
        <flickrid>I do not know</flickrid>
    </owner>
    <folder>RSDS2016</folder>
    <object>
        <name>other</name>
        <pose>Left</pose>
        <truncated>1</truncated>
        <difficult>0</difficult>
        <quad>
            <x1>400</x1>
            <y1>718</y1>
            <x2>328</x2>
            <y2>626</y2>
            <x3>393</x3>
            <y3>579</y3>
            <x4>466</x4>
            <y4>672</y4>
        </quad>
        <bbox>
            <x1>328</x1>
            <y1>579</y1>
            <x2>466</x2>
            <y2>718</y2>
        </bbox>
    </object>
</annotation>

2.3 coco json

{"info": {"contributor": "nihao", "data_created": "2021", "version": "1.0", "year": 2021}, "licenses": "licenses", "image_nums": 1, "images": [{"file_name": "00036.jpg", "id": 1, "width": 1024, "height": 768}], "categories": [{"id": 1, "name": "other", "supercategory": "other"}], "annotations": [{"category_id": 1, "bbox":[328, 579, 138, 139],"area": 9430, "segmentation": [[400, 718, 328, 626, 393, 579, 466, 672]], "iscrowd": 0, "image_id": 1, "id": 1, "shape": "quad"}]}

3 接下来，通过程序自动生成xml文件

txt2xml.py

import os
from lxml.etree import Element, SubElement, tostring

def txt_xml(img_name, txt_path, img_xml, xml_path):
    #读取txt的信息
    clas=[]
    imh, imw = 800, 800
    txt_img=os.path.join(txt_path,img_name)
    with open(txt_img,"r") as f:
        for line in f.readlines():
            line = line.strip('\n')
            list = line.split(" ")
            clas.append(list)       # [0, x1, y1, x2, y2]

    node_root = Element('annotation')
    node_folder = SubElement(node_root, 'folder')
    node_folder.text = '1'
    # filename
    node_filename = SubElement(node_root, 'filename')
    node_filename.text = img_name.split(".")[0]+".jpg"
    # path
    node_path = SubElement(node_root, 'path')
    node_path.text = str(txt_img).split('.')[0] + '.jpg'
    # source
    node_source = SubElement(node_root, 'source')
    node_database = SubElement(node_source, 'database')
    node_database.text = 'Unknown'
    # size
    # node_size = SubElement(node_root, 'size')
    # node_width = SubElement(node_size, 'width')
    # node_width.text = str(imw)
    # node_height = SubElement(node_size, 'height')
    # node_height.text = str(imh)
    # node_depth = SubElement(node_size, 'depth')
    # node_depth.text = '3'
    # segmented
    node_segmented = SubElement(node_root, 'segmented')
    node_segmented.text = '0'
    # object
    for i in range(len(clas)):
        node_object = SubElement(node_root, 'object')
        node_name = SubElement(node_object, 'name')
        node_name.text = 'other'
        node_pose=SubElement(node_object, 'pose')
        node_pose.text="Left"
        node_truncated=SubElement(node_object, 'truncated')
        node_truncated.text="1"
        node_difficult = SubElement(node_object, 'difficult')
        node_difficult.text = '0'
        # bndbox
        node_bndbox = SubElement(node_object, 'quad')
        x1 = SubElement(node_bndbox, 'x1')
        x1.text = str(clas[i][1])
        y1 = SubElement(node_bndbox, 'y1')
        y1.text = str(clas[i][2])
        
        x2 = SubElement(node_bndbox, 'x2')
        x2.text = str(clas[i][3])
        y2 = SubElement(node_bndbox, 'y2')
        y2.text = str(clas[i][4])
        x3 = SubElement(node_bndbox, 'x3')
        x3.text = str(clas[i][5])
        y3 = SubElement(node_bndbox, 'y3')
        y3.text = str(clas[i][6])
        x4 = SubElement(node_bndbox, 'x4')
        x4.text = str(clas[i][7])
        y4 = SubElement(node_bndbox, 'y4')
        y4.text = str(clas[i][8])  
    xml = tostring(node_root, pretty_print=True)  # 格式化显示，该换行的换行
    img_newxml = os.path.join(xml_path, img_xml)
    file_object = open(img_newxml, 'wb')
    file_object.write(xml)
    file_object.close()

if __name__ == "__main__":
    #标注文件夹所在位置
    txt_path=r"temp"
    #txt转化成xml格式后存放的文件夹
    xml_path=r"temp1"
    if not os.path.exists(xml_path):
        os.mkdir(xml_path)
    for img_name in os.listdir(txt_path):
        print(img_name)
        img_xml=img_name.split(".")[0]+".xml"
        txt_xml(img_name, txt_path, img_xml, xml_path)

最终效果图

参考 yolo图像检测数据集格式转换：xml 与 txt格式相互转换_uncle_ll的博客-CSDN博客_yolo数据集txt格式

二/ 利用PIL 模块生成相应的图片字符图片

参考link ：https://github.com/mpcabd/python-arabic-reshaper

https://github.com/MichalBusta/E2E-MLT

效果：

pip install --upgrade arabic-reshaper

conda install -c mpcabd arabic-reshaper

pip install --upgrade arabic-reshaper python-bidi pillow

代码

#产生阿拉伯文图片
import arabic_reshaper

text_to_be_reshaped = '2023 06 14/2022 06 16'
text_to_be_reshaped1='JXG'
text_to_be_reshaped2='14 06 2023/16 06 2022 X21'

reshaped_text = arabic_reshaper.reshape(text_to_be_reshaped)

'''
At this stage the text is reshaped, all letters are in their correct form
based on their surroundings, but if you are going to print the text in a
left-to-right context, which usually happens in libraries/apps that do not
support Arabic and/or right-to-left text rendering, then you need to use
get_display from python-bidi.
Note that this is optional and depends on your usage of the reshaped text.
'''
from bidi.algorithm import get_display
bidi_text = get_display(reshaped_text)

# At this stage the text in bidi_text can be easily rendered in any library
# that doesn't support Arabic and/or right-to-left, so use it as you'd use
# any other string. For example if you're using PIL.ImageDraw.text to draw
# text over an image you'd just use it like this...

from PIL import Image, ImageDraw, ImageFont

# We load Arial since it's a well known font that supports Arabic Unicode
# font = ImageFont.truetype('Arial', 40)
font = ImageFont.truetype('/PaddleOCR/StyleText/fonts/arabic.ttf', 50)
font1= ImageFont.truetype('PaddleOCR/StyleText/fonts/en_standard.ttf',40)
image = Image.new('RGBA', (800, 600), (255,255,255,0))
image_draw = ImageDraw.Draw(image)
image_draw.text((350,10), text_to_be_reshaped1, fill=(255,255,255,200), font=font1)
image_draw.text((10,10), bidi_text, fill=(255,255,255,200), font=font)
image_draw.text((10,70), text_to_be_reshaped2, fill=(255,255,255,200), font=font1)


# image.show()
image.save("temp.png")

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

easydl

数据标注

百度easydl数据标注的相关文章

对项目的梳理、流程和总结

过程我在制作中国汽车技术研究中心的一个演讲PPT前 xff0c 也已经有第一版的基础了 xff0c 不过 xff0c 第一版的PPT客户并不满意 xff0c 因为这个风格不是客户想要的 xff0c 所以客户对第一版的PPT并不是很满意
【ROS】xxx is neither a launch file in package xxx nor is xxx a launch file name……解决

在ros中新增加一个功能包时 xff0c 如果没有处理得当的话 xff0c 在执行时很有可能报如下错误 xff1a xxx is neither a launch file in package xxx nor is xxx a launc
FreeRTOS——流和消息缓冲区

FreeRTOS 基础系列文章基本对象 FreeRTOS 任务 FreeRTOS 队列 FreeRTOS 信号量 FreeRTOS 互斥量 FreeRTOS 任务通知 FreeRTOS 流和消息缓冲区 FreeRTOS 软件定时器 Fre
FreeRTOS——静态与动态内存分配

FreeRTOS 基础系列文章基本对象 FreeRTOS 任务 FreeRTOS 队列 FreeRTOS 信号量 FreeRTOS 互斥量 FreeRTOS 任务通知 FreeRTOS 流和消息缓冲区 FreeRTOS 软件定时器 Fre
CAS 6.5.5项目初始化搭建运行

一项目背景介绍公司项目重构 xff0c 决定使用CAS中央认证系统在GitHub上找到最新的稳定版本6 5 5 CAS项目在5 x版本的运行环境是jdk8 xff0c 使用maven做的项目管理 6 x使用的是jdk11作为运行环境
GoogleTest中gMock的使用

GoogleTest中的gMock是一个库 xff0c 用于创建mock类并使用它们当你编写原型或测试 prototype or test 时 xff0c 完全依赖真实对象通常是不可行或不明智的 not feasible or wise
基于Autoware制作高精地图（一）

基于Autoware制作高精地图 xff08 一 xff09 开始进入正题 xff0c 也是最近在忙的一件事 xff0c 制作高精地图高精地图的制作大概分为以下四个流程 xff08 不一定完全正确 xff09 xff1a 1 构建点云地图
Ubuntu sh文件编写，开多终端，自动读取密码

Ubuntu sh文件编写 xff0c 开多终端 xff0c 自动读取密码开启多个终端自动读取密码在最近的项目调试中经常需要开多个终端启动多个launch xff0c 这样的操作多了难免会感到烦躁并且时间一长再回去使用一些功能包的时候就
控制理论——自动控制原理若干概念

1 对自动控制系统的基本要求稳定性被控量因扰动偏离期望值后 xff0c 经过过渡过程可以恢复到原来的期望值状态快速性包含两方面 xff1a 过渡过程的时间最大超调量 xff08 震荡幅度 xff09 准确性指稳态误差 xff1a
Optitrack下通过mavros实现offbord控制

参考文章 xff1a 树莓派通过MAVROS与Pixhawk PX4通信 PX4使用Optitrack进行室内定位通过optitrack与妙算连接在同一局域网下 xff0c 关闭防火墙 xff0c 并设置刚体发布 vrpn安装 cd ca
【场景图生成】Unbiased Scene Graph Generation from Biased Training

文章下载地址 xff1a https arxiv org pdf 2002 11949 pdf 代码地址 xff1a GitHub KaihuaTang Scene Graph Benchmark pytorch 发表地点 xff1a CV

随机推荐

【场景图生成】Graphical Contrastive Losses for Scene Graph Parsing

文章下载地址 xff1a Graphical Contrastive Losses for Scene Graph Parsing 代码地址 xff1a https github com NVIDIA ContrastiveLosses4V
jquery无法获取到textarea中的值详解

问题描述 xff1a 今天在springboot中jquery读取前端的值通过jquery打包为json传入后端 xff0c 发现其中textarea区域中的内容无法获取解决办法 xff1a 首先看你的textarea中是否有 name属
阿里云大学——Java语言基础自测考试 - 初级难度

1 假设有如下程序 xff1a span class token keyword public span span class token keyword class span span class token class name Dem
could not transfer artifact org.springframework.boot:spring-boot-starter-parent

Springboot异常 could not transfer artifact org springframework boot spring boot starter parent pom 2 3 0 RELEASE from to c
阿里云ECS搭建个人简历网站

能在自己的网站上搭建简历是不是很酷 xff0c 今天我就教大家如何在自己的服务器上搭建一个个人简历网站因为主流网站的搭站环境是LAMP环境 xff0c 所以第一步就是先去把服务器环境一修改为LAMP环境停止ECS实例运行点击使用就
GitHub加速神器FastGithub的使用

clone GitHub上的项目时经常超时 pull或push的时候也有类似情况有时GitHub也打不开 xff0c 这里推荐GitHub上的一个工具FastGithub xff0c 开启它后 xff0c 可大大减少超时情况的发生这里介
阿里云ECS打造属于自己的WEB——IDE编程环境

首先感谢 64 1430059860老哥的指导 xff0c 在阿里的官方视频卡着以后就一直进去入不了下一步了 xff0c 特向我的组长老哥带带 xff0c 最终搭建成功停止实例选择更换操作系统 xff08 如果使用centoS建议更换ub
给阿里云服务器装一个图形化界面——Gnome

我这里使用的是ubantu系统第一步 xff1a apt get update更新一下源第二步下载Gnome图形化界面 apt get install gnome shell ubuntu gnome desktop第三步下载完成 a
0基础使用阿里云打造自己的私人云盘

平时我们使用云盘例如有百度云 xff0c 蓝奏云 xff0c 小米云盘 xff0c 虽然给我们带来不少的便利 xff0c 但是也存在私人数据泄露和文件下载速度过慢的风险 xff0c 所以 xff0c 打造一款属于自己的私人云盘是一个很好的选
Redis无法加载配置文件中日志文件的解决方法

Can t open the log file Permission denied logfile usr local redis etc redis6380 log Can t open the log file Permission d
Request method ‘PUT‘ not supported

今天写后端接口出现问题 xff0c 出现Request method PUT not supported 可能是springboot的bug xff0c 在修改无果后 xff0c 关闭程序 xff0c 进行rebuild多次后 xff0c
关于前端传值，springboot后端的参数处理方式汇总

对于前端传值情况 xff0c 后端接收的几种情况 1 对于此类链接 http localhost 7398 order userPage page 61 1 amp pageSize 61 1 http localhost 7398 ord
Could not autowire. No beans of ‘xxxMapper‘ type found.

Could not autowire No beans of xxxMapper type found 的三种解决办法出现Could not autowire No beans of xxxMapper type found 的解决办法
后端对象数据为空的情况

后端对象数据为空的情况后端与前端对接数据形式不一致 xff0c 前端传入数据的方式 xff08 url post请求 xff0c 直接作为对象进行传递 xff09 xff0c 导致后端拿不到数据对接数据一致 xff0c request请
C-动态内存和运算符重载

titledatetagscategoriesdescription C 43 43 动态内存和运算符重载 2019 11 12 13 34 50 0800 动态内存运算符重载 C C 43 43 简单了解一下
高版本Ubuntu(如22.02)修改apt源，快速安装低版本gcc/g++

Ubuntu不同版本默认apt install gcc安装的gcc和g 43 43 版本不同 xff0c 如Ubuntu22 04默认安装gcc g 43 43 为11版本 xff0c 高版本Ubuntu无法直接通过apt install
COLMAP简介及通过2D序列图像进行3D重建操作流程

COLMAP是一种通用的运动结构 Structure from Motion SfM 和多视图立体 Multi View Stereo MVS 管道 pipeline xff0c 具有图形和命令行界面它为重建有序和无序图像集合提供了广泛的
我踩了所有ESP8266的坑，现在来个最终总结

STM32 43 ESP8266 协议接入IOT平台必成功 1 移植到STM32前先检查你的esp8266能不能用1 1 大概率你手里的esp8266是官方固件刷MQTT固件1 2 ESP8266 MQTT固件 AT指令列表 xff1a
进阶HAL开发——第二集-FreeRTOS

大三了 xff0c 在保研考研保研加分政策改变的焦虑中渡过了2021的前5个月好久没有认真学东西了不管了 xff0c 先学点东西把手里的比赛做完 xff0c 加不加分都随缘 FreeRTOS HAL库一简介二理解三使用3 1
百度easydl数据标注

一百度easydl数据标注脚本 1 官方标注工具 xff0c 链接如下 xff0c 由lableme改进而形成 GitHub Baidu AIP Easyyibiao 2 官网数据导入格式三种分别为 xff1a 布局如图所示 2 1js

百度easydl数据标注

百度easydl数据标注 的相关文章

随机推荐

热门标签

百度easydl数据标注的相关文章