如何检测桌子的水平线和垂直线并消除噪音？

2023-12-21

I am trying to get the horizontal and vertical lines of the table in an image in order to extract the texts in cells. Here's a picture I use:

我使用下面的代码来提取垂直线和水平线：

img = cv2.imread(img_for_box_extraction_path, 0)  # Read the image
(thresh, img_bin) = cv2.threshold(img, 200, 255,
                                  cv2.THRESH_BINARY | cv2.THRESH_OTSU)  # Thresholding the image
img_bin = 255-img_bin  # Invert the image
cv2.imwrite("Image_bin_2.jpg",img_bin)

# Defining a kernel length
kernel_length = np.array(img).shape[1]//140

# A verticle kernel of (1 X kernel_length), which will detect all the verticle lines from the image.
verticle_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (1, kernel_length))

# A horizontal kernel of (kernel_length X 1), which will help to detect all the horizontal line from the image.
hori_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (kernel_length, 1))

# A kernel of (3 X 3) ones.
kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (3, 3))

# Morphological operation to detect verticle lines from an image
img_temp1 = cv2.erode(img_bin, verticle_kernel, iterations=3)
verticle_lines_img = cv2.dilate(img_temp1, verticle_kernel, iterations=3)
cv2.imwrite("verticle_lines_2.jpg",verticle_lines_img)

# Morphological operation to detect horizontal lines from an image
img_temp2 = cv2.erode(img_bin, hori_kernel, iterations=3)
horizontal_lines_img = cv2.dilate(img_temp2, hori_kernel, iterations=3)
cv2.imwrite("horizontal_lines_2.jpg",horizontal_lines_img)

The pictures below are the horizontal lines and vertical lines:

我使用下面的代码将两个图像添加在一起

# Weighting parameters, this will decide the quantity of an image to be added to make a new image.
alpha = 0.5
beta = 1.0 - alpha

# This function helps to add two image with specific weight parameter to get a third image as summation of two image.
img_final_bin = cv2.addWeighted(verticle_lines_img, alpha, horizontal_lines_img, beta, 0.0)
img_final_bin = cv2.erode(~img_final_bin, kernel, iterations=2)
(thresh, img_final_bin) = cv2.threshold(img_final_bin, 128, 255, cv2.THRESH_BINARY | cv2.THRESH_OTSU)

# For Debugging
# Enable this line to see verticle and horizontal lines in the image which is used to find boxes
cv2.imwrite("img_final_bin_2.jpg",img_final_bin)

However, I get a picture like this: How do I remove the noise and get a better result? Thanks in advance.

这是一个简单的方法：

二值图像

检测到水平

检测到垂直

组合面罩

需要删除的绿色线

Result

import cv2
import numpy as np

# Load image, grayscale, Gaussian blur, Otsu's threshold
image = cv2.imread('1.jpg')
gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
blur = cv2.GaussianBlur(gray, (3,3), 0)
thresh = cv2.threshold(blur, 0, 255, cv2.THRESH_BINARY_INV + cv2.THRESH_OTSU)[1]

# Detect horizontal lines
horizontal_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (50,1))
horizontal_mask = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, horizontal_kernel, iterations=1)

# Detect vertical lines
vertical_kernel = cv2.getStructuringElement(cv2.MORPH_RECT, (1,50))
vertical_mask = cv2.morphologyEx(thresh, cv2.MORPH_OPEN, vertical_kernel, iterations=1)

# Combine masks and remove lines
table_mask = cv2.bitwise_or(horizontal_mask, vertical_mask)
image[np.where(table_mask==255)] = [255,255,255]

cv2.imshow('thresh', thresh)
cv2.imshow('horizontal_mask', horizontal_mask)
cv2.imshow('vertical_mask', vertical_mask)
cv2.imshow('table_mask', table_mask)
cv2.imshow('image', image)
cv2.waitKey()

本文内容由网友自发贡献，版权归原作者所有，本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容，请联系:hwhale#tublm.com(使用前将#替换为@)

python

image

opencv

imageprocessing

ComputerVision

如何检测桌子的水平线和垂直线并消除噪音？的相关文章

使用python查找txt文件中字母出现的次数

我需要从 txt 文件中读取该字母并打印 txt 文件中出现的次数到目前为止我已经能够在一行中打印内容但计数有问题有人可以指导吗 infile open grades txt content infile read for char
稀有对象的 python 类型注释，例如 psycopg2 对象

我了解内置类型但是我如何指定稀有对象例如数据库连接对象 def get connection and cursor gt tuple psycopg2 extensions cursor psycopg2 extensions conn
如何用 JavaScript 修复图像透视变形和旋转？

我有一些用手机拍摄的图像有没有可以拉直纸张照片并将其压平的 JavaScript 库例如我想创建一个矩形图像该图像没有任何失真换句话说我想知道如何用 JavaScript 修复透视变形和旋转例如我发现下面的示例图像来自this
无法在 selenium 和 requests 之间传递 cookie，以便使用后者进行抓取

我用 python 结合 selenium 编写了一个脚本来登录网站然后从driver to requests这样我就可以继续使用requests进行进一步的活动 I used item soup select one div class
如何过滤 Pandas GroupBy 对象并获取 GroupBy 对象？

当对 Pandas groupby 操作的结果执行过滤时它返回一个数据帧但假设我想执行进一步的分组计算我必须再次调用 groupby 这似乎有点绕有更惯用的方法吗 EDIT 为了说明我在说什么我们无耻地从 Pandas 文档中窃取
创建上下文后将 jar 文件添加到 pyspark

我正在笔记本上使用 pyspark 并且不处理 SparkSession 的创建我需要加载一个包含一些我想在处理 rdd 时使用的函数的 jar 您可以使用 jars 轻松完成此操作但在我的特定情况下我无法做到这一点有没有办法访问sp
协程从未被等待

我正在使用一个简单的上下文管理器其中包含一个异步循环 class Runner def init self self loop asyncio get event loop def enter self return self def e
在Python上获取字典的前x个元素

我是Python的新手所以我尝试用Python获取字典的前50个元素我有一本字典它按值降序排列 k 0 l 0 for k in len dict d l 1 if l lt 51 print dict 举个小例子 dict d m
Arcpy 模数在 Pycharm 中不显示

如何将 Arcpy 集成到 Pycharm 中我尝试通过导入模块但它没有显示我确实知道该模块仅适用于 2 x python arcpy 在 PyPi Python 包索引上不可用因此无法通过 pip 安装要使用 arcpy 您需要
python 中的 <> 运算符有什么作用？

我刚刚遇到这个here http www feedparser org feedparser py 总是这样使用 if string1 find string2 lt gt 1 pass 什么是 lt gt 运算符这样做为什么不使用通常的
Python Anaconda：如何测试更新的库是否与我现有的代码兼容？

我在 Windows 7 机器上使用 Python 2 7 Anaconda 安装进行数据分析和科学计算当新的库发布时例如新版本的 pandas patsy 等您建议我如何测试新版本与现有代码的兼容性是否可以在同一台机器上安装两个
使用 for 循环创建一系列元组

我已经搜索过但找不到答案尽管我确信它已经存在了我对 python 很陌生但我以前用其他语言做过这种事情我正在以行形式读取数据文件我想将每行数据存储在它自己的元组中以便在 for 循环之外访问 tup i inLine wher
两个不同长度的数据帧的列之间的余弦相似度？

我在 df1 中有文本列在 df2 中有文本列 df2 的长度将与 df1 的长度不同我想计算 df1 text 中每个条目与 df2 text 中每个条目的余弦相似度并为每场比赛给出分数输入样本 df1 mahesh suresh
python中basestring和types.StringType之间的区别？

有什么区别 isinstance foo types StringType and isinstance foo basestring 对于Python2 basestring是两者的基类str and unicode while type
Airflow 1.9 - 无法将日志写入 s3

我在 aws 的 kubernetes 中运行气流 1 9 我希望将日志发送到 s3 因为气流容器本身的寿命并不长我已经阅读了描述该过程的各种线程和文档但我仍然无法让它工作首先是一个测试向我证明 s3 配置和权限是有效的这是在我们
在骨架图像中查找线 OpenCV python

我有以下图片我想找到一些线来进行一些计算平均长度等我尝试使用HoughLinesP 但它找不到线我能怎么做这是我的代码 sk skeleton mask rows cols sk shape imgOut np zeros row
XPath：通过当前节点属性选择当前和下一个节点的文本

首先这是从我之前的问题 https stackoverflow com questions 5202187 xpath select current and next nodes text by current node attribut
可以使用哪些技术来衡量 pandas/numpy 解决方案的性能

Question 如何简洁全面地衡量下面各个功能的性能 Example 考虑数据框df df pd DataFrame Group list QLCKPXNLNTIXAWYMWACA Value 29 52 71 51 45 76 68 6
检测是否从psycopg2游标获取？

假设我执行以下命令 insert into hello username values me 我跑起来就像 cursor fetchall 我收到以下错误 psycopg2 ProgrammingError no results to fe
Django 管理器链接

我想知道是否有可能如果可以的话如何将多个管理器链接在一起以生成受两个单独管理器影响的查询集我将解释我正在研究的具体示例我有多个抽象模型类用于为其他模型提供小型的特定功能其中两个模型是DeleteMixin 和GlobalMix

随机推荐

IIS7 & Castle.MicroKernel.Lifestyle.PerWebRequestLifestyleModule 注册问题

UPDATE 在 Windsor 2 5 中程序集名称为Castle Windsor not Castle MicroKernel 我正在尝试将 ASP NET MVC 应用程序部署到 IIS7 但收到此错误看来您忘记注册 http 模
摩根大通不工作

我刚刚尝试过使用JPM https developer mozilla org en US Add ons SDK Tools jpm第一次我什么也做不了我的 index js 文件如下所示 const actionButton req
谷歌地图使用多边形突出显示有边界的国家

我正在使用谷歌地图想要使用其国家边界线 lat lng 信息突出显示多个国家我正在绘制多边形但我想要每个国家地区边界的信息从哪里可以获得这些信息以使用多边形突出显示国家地区或者还有其他突出国家的好方法吗下面的链接显示了我想要
Heroku Django Gunicorn“工头启动”错误

我正在努力通过Heroku 的 Django 教程 https devcenter heroku com articles django我一路走到了使用不同的 WSIG 服务器 https devcenter heroku com art
选中复选框时禁用某些 ASP.Net 验证控件

我正在使用老式的 ASP NET 验证呃进行结帐过程我有一个复选框我将用我的信用卡详细信息致电如果选中我需要在客户端和回发上禁用信用卡号所需的字段验证器和抄送验证器它是如何做到的呢您可以禁用验证器客户端在 JavaScri
Java 数组声明括号放置

我正在尝试从 Java 程序打印 Hello World 但我对 main 方法有点困惑 public static void main String args and public static void main String args
BigQuery：JOIN ON 与标准 SQL 中的重复/数组 STRUCT 字段？

我基本上有两张桌子 Orders and Items 由于这些表是从 Google Cloud Datastore 备份文件导入的因此引用不是通过简单的 ID 字段而是通过
获取选中和未选中的复选框值

这是我的脚本 HTML 代码 img src images bagua square gif border 0
如何阻止 Xcode 3.2.6 默认使用 iPad 模拟器？这是新行为吗？

似乎每次我启动 Xcode 项目或清理所有目标时活动可执行文件都会重置为 iPad 模拟器在 Xcode 3 2 6 发布之前 Xcode 默认为 iPhone 模拟器或记住了我以前的活动可执行文件我从未注意到这种行为因为我通常
Android Studio“运行 Git 时出错”，“空 git --version 输出：”

I have looked at the other posts and have made certain that the path is correct to the executable file as shown below I
AngularJs中如何实现数据库变化后视图自动更新？

我使用 AngularJs 和 Grails 框架并使用 Mysql 作为数据库我想实现 Facebook 上的自动视图更新等功能到目前为止我可以将 JSON 数据从 Grails 控制器发送到角度控制器并填充视图但是我如何实现诸如
CSS 背景重复

有没有办法让背景图像拉伸而不是重复不使用任何类型的跨浏览器兼容的CSS 有background size然而财产如果这是针对任何特定浏览器那么这是可能的否则您需要使用 img 并拉伸它以下是在最新浏览器中执行此操作的方法 bod
如何在 R 中将 2D 数据框“展平”或“折叠”为 1D 数据框？

我有一个二维表其中 R 中的 data frame 中的距离从 csv 导入 CP000036 CP001063 CP001368 CP000036 0 a b CP001063 a 0 c CP001368 b c 0 我想把它压平
boost::chrono 纳秒 Windows 7

include
在 C++ 中将整数输入写入向量容器

同样我们在数组中做 for cin gt gt a i 我们如何使用向量来做到这一点我声明了一个整数向量 vector
ExpandableListView 与 ViewPager 组合作为其子项

UPDATE 添加了实验结果是否可以实现 ExpandableListView 来拥有 viewpager 子项我尝试将 viewpager 作为子项放在 ExpandableListView 中但它没有显示我也尝试将其添加到 Sc
如何在 SQL Server Express Edition 中每天运行存储过程？

如何在 SQL Server Express Edition 中每天的特定时间运行存储过程 Notes 这是截断审计表所必需的另一种方法是修改插入查询但这可能效率较低 SQL Server Express Edition 没有 SQL
jQuery 兄弟姐妹不选择 div 内的选项？

input group select change function var value this val alert input group select siblings select children option length th
在 Outlook 2016 的 Office 365 组日历中创建会议时加载项被禁用

我正在关注这个使用 Outlook 2016 在 Windows 10 上在 Office 365 组日历中创建新会议但使用组日历时所有加载项在 Outlook 2016 本机中都被禁用这些插件在 Web Outlook 中的组日历
如何检测桌子的水平线和垂直线并消除噪音？

I am trying to get the horizontal and vertical lines of the table in an image in order to extract the texts in cells Her

如何检测桌子的水平线和垂直线并消除噪音？

如何检测桌子的水平线和垂直线并消除噪音？ 的相关文章

随机推荐

热门标签

如何检测桌子的水平线和垂直线并消除噪音？的相关文章