计算机视觉论文-2021-07-19

2023-11-18

本专栏是计算机视觉方向论文收集积累,时间:2021年7月19日,来源:paper digest

欢迎关注原创公众号 【计算机视觉联盟】,回复 【西瓜书手推笔记】 可获取我的机器学习纯手推笔记!

直达笔记地址:机器学习手推笔记(GitHub地址)

1, TITLE: Optical Inspection of The Silicon Micro-strip Sensors for The CBM Experiment Employing Artificial Intelligence
AUTHORS: E. Lavrik ; M. Shiroya ; H. R. Schmidt ; A. Toia ; J. M. Heuser
CATEGORY: physics.ins-det [physics.ins-det, cs.CV, hep-ex]
HIGHLIGHT: In this manuscript, we present the analysis of various sensor surface defects.

2, TITLE: Progressive Deep Video Dehazing Without Explicit Alignment Estimation
AUTHORS: Runde Li
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we propose a progressive alignment and restoration method for video dehazing.

3, TITLE: Contrastive Predictive Coding for Anomaly Detection
AUTHORS: Puck de Haan ; Sindy L�we
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: In this paper, we close this gap by making use of the Contrastive Predictive Coding model (arXiv:1807.03748).

4, TITLE: A Theoretical Analysis of Granulometry-based Roughness Measures on Cartosat DEMs
AUTHORS: Nagajothi Kannan ; Sravan Danda ; Aditya Challa ; Daya Sagar B S
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this article, we revisit the roughness measure on DEM data adapted from multiscale granulometries in mathematical morphology, namely multiscale directional granulometric index (MDGI).

5, TITLE: Efficient Automated U-Net Based Tree Crown Delineation Using UAV Multi-spectral Imagery on Embedded Devices
AUTHORS: Kostas Blekos ; Stavros Nousias ; Aris S Lalos
CATEGORY: cs.CV [cs.CV, eess.IV]
HIGHLIGHT: In this work, we propose a U-Net based tree delineation method, which is effectively trained using multi-spectral imagery but can then delineate single-spectrum images.

6, TITLE: Conditional Directed Graph Convolution for 3D Human Pose Estimation
AUTHORS: Wenbo Hu ; Changgong Zhang ; Fangneng Zhan ; Lei Zhang ; Tien-Tsin Wong
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we propose to represent the human skeleton as a directed graph with the joints as nodes and bones as edges that are directed from parent joints to child joints.

7, TITLE: Real-Time Violence Detection Using CNN-LSTM
AUTHORS: Mann Patel
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: To address the butterfly effects impact in our setting, I made a unique model and a theorized system to handle the issue utilizing deep learning.

8, TITLE: Real-Time Face Recognition System for Remote Employee Tracking
AUTHORS: Mohammad Sabik Irbaz ; MD Abdullah Al Nasim ; Refat E Ferdous
CATEGORY: cs.CV [cs.CV, cs.LG, eess.IV]
HIGHLIGHT: In this paper, we discuss in brief the system we have been experimenting with and the pros and cons of the system.

9, TITLE: Multi-Level Contrastive Learning for Few-Shot Problems
AUTHORS: Qing Chen ; Jian Zhang
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Most current applications of contrastive learning benefit only a single representation from the last layer of an encoder.In this paper, we propose a multi-level contrasitive learning approach which applies contrastive losses at different layers of an encoder to learn multiple representations from the encoder.

10, TITLE: Controlled AutoEncoders to Generate Faces from Voices
AUTHORS: Hao Liang ; Lulan Yu ; Guikang Xu ; Bhiksha Raj ; Rita Singh
CATEGORY: cs.CV [cs.CV, cs.LG, cs.SD, eess.AS, eess.IV]
HIGHLIGHT: With this in perspective, we propose a framework to morph a target face in response to a given voice in a way that facial features are implicitly guided by learned voice-face correlation in this paper.

11, TITLE: Rectifying The Shortcut Learning of Background: Shared Object Concentration for Few-Shot Image Recognition
AUTHORS: XU LUO et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we observe that image background serves as a source of domain-specific knowledge, which is a shortcut for models to learn in the source dataset, but is harmful when adapting to brand-new classes.

12, TITLE: Deep Learning to Ternary Hash Codes By Continuation
AUTHORS: Mingrui Chen ; Weiyu Li ; Weizhi Lu
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To obtain better ternary codes, we for the first time propose to jointly learn the features with the codes by appending a smoothed function to the networks.

13, TITLE: A Survey on Deep Domain Adaptation and Tiny Object Detection Challenges, Techniques and Datasets
AUTHORS: Muhammed Muzammul ; Xi Li
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: This survey paper specially analyzed computer vision-based object detection challenges and solutions by different techniques.

14, TITLE: All The Attention You Need: Global-local, Spatial-channel Attention for Image Retrieval
AUTHORS: Chull Hwan Song ; Hye Joo Han ; Yannis Avrithis
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: We present global-local attention module (GLAM), which is attached at the end of a backbone network and incorporates all four forms of attention: local and global, spatial and channel.

15, TITLE: An Energy-Efficient Edge Computing Paradigm for Convolution-based Image Upsampling
AUTHORS: Ian Colbert ; Ken Kreutz-Delgado ; Srinjoy Das
CATEGORY: cs.CV [cs.CV, cs.AR, cs.DC, cs.LG]
HIGHLIGHT: These kernel transformations, intended as a one-time cost when shifting from training to inference, enable a systems designer to use each algorithm in their optimal context by preserving the image fidelity learned when training in the cloud while minimizing data transfer penalties during inference at the edge.

16, TITLE: Is Attention to Bounding Boxes All You Need for Pedestrian Action Prediction?
AUTHORS: Lina Achaji ; Julien Moreau ; Thibault Fouqueray ; Francois Aioun ; Francois Charpillet
CATEGORY: cs.CV [cs.CV, cs.LG, cs.RO]
HIGHLIGHT: In this paper, we present a framework based on multiple variations of the Transformer models to reason attentively about the dynamic evolution of the pedestrians' past trajectory and predict its future actions of crossing or not crossing the street. In addition, we introduced a large-size simulated dataset (CP2A) using CARLA for action prediction.

17, TITLE: Pose Normalization of Indoor Mapping Datasets Partially Compliant to The Manhattan World Assumption
AUTHORS: Patrick H�bner ; Martin Weinmann ; Sven Wursthorn ; Stefan Hinz
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we present a novel pose normalization method for indoor mapping point clouds and triangle meshes that is robust against large fractions of the indoor mapping geometries deviating from an ideal Manhattan World structure.

18, TITLE: Unsupervised 3D Human Mesh Recovery from Noisy Point Clouds
AUTHORS: Xinxin Zuo ; Sen Wang ; Minglun Gong ; Li Cheng
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: This paper presents a novel unsupervised approach to reconstruct human shape and pose from noisy point cloud.

19, TITLE: Representation Consolidation for Training Expert Students
AUTHORS: ZHIZHONG LI et. al.
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: Traditionally, distillation has been used to train a student model to emulate the input/output functionality of a teacher.

20, TITLE: CCVS: Context-aware Controllable Video Synthesis
AUTHORS: Guillaume Le Moing ; Jean Ponce ; Cordelia Schmid
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: This presentation introduces a self-supervised learning approach to the synthesis of new video clips from old ones, with several new key elements for improved spatial resolution and realism: It conditions the synthesis process on contextual information for temporal continuity and ancillary information for fine control.

21, TITLE: Painting Style-Aware Manga Colorization Based on Generative Adversarial Networks
AUTHORS: YUGO SHIMIZU et. al.
CATEGORY: cs.CV [cs.CV, eess.IV]
HIGHLIGHT: To realize consistent colorization, we propose here a semi-automatic colorization method based on generative adversarial networks (GAN); the method learns the painting style of a specific comic from small amount of training data.

22, TITLE: DANCE: DAta-Network Co-optimization for Efficient Segmentation Model Training and Inference
AUTHORS: CHAOJIAN LI et. al.
CATEGORY: cs.CV [cs.CV, cs.LG]
HIGHLIGHT: To this end, we propose DANCE, general automated DAta-Network Co-optimization for Efficient segmentation model training and inference.

23, TITLE: Align Before Fuse: Vision and Language Representation Learning with Momentum Distillation
AUTHORS: JUNNAN LI et. al.
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In this paper, we introduce a contrastive loss to ALign the image and text representations BEfore Fusing (ALBEF) them through cross-modal attention, which enables more grounded vision and language representation learning.

24, TITLE: CutDepth:Edge-aware Data Augmentation in Depth Estimation
AUTHORS: Yasunori Ishii ; Takayoshi Yamashita
CATEGORY: cs.CV [cs.CV, cs.AI, cs.LG]
HIGHLIGHT: In this paper, we propose a data augmentation method, called CutDepth.

25, TITLE: Multiple Instance Learning with Auxiliary Task Weighting for Multiple Myeloma Classification
AUTHORS: TALHA QAISER et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To aid radiological reading, we propose an auxiliary task-based multiple instance learning approach (ATMIL) for MM classification with the ability to localize sites of disease.

26, TITLE: Semi-supervised 3D Hand-Object Pose Estimation Via Pose Dictionary Learning
AUTHORS: Zida Cheng ; Siheng Chen ; Ya Zhang
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To tackle the problem of data collection, we propose a semi-supervised 3D hand-object pose estimation method with two key techniques: pose dictionary learning and an object-oriented coordinate system.

27, TITLE: A Comparison of Deep Learning Classification Methods on Small-scale Image Data Set: from Converlutional Neural Networks to Visual Transformers
AUTHORS: PENG ZHAO et. al.
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: Through the comparison of experimental results, the recommended deep learning model is given according to the model application environment.

28, TITLE: OdoViz: A 3D Odometry Visualization and Processing Tool
AUTHORS: Saravanabalagi Ramachandran ; John McDonald
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: This significantly reduces the effort required in creating subsets of data from existing datasets for machine learning tasks.

29, TITLE: Self-Supervised Learning Framework for Remote Heart Rate Estimation Using Spatiotemporal Augmentation
AUTHORS: Hao Wang ; Euijoon Ahn ; Jinman Kim
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To solve this problem, we present a novel 3D self-supervised spatiotemporal learning framework for remote HR estimation on facial videos.

30, TITLE: Attention-based Vehicle Self-Localization with HD Feature Maps
AUTHORS: Nico Engel ; Vasileios Belagiannis ; Klaus Dietmayer
CATEGORY: cs.CV [cs.CV, cs.RO]
HIGHLIGHT: We present a vehicle self-localization method using point-based deep neural networks.

31, TITLE: A Survey on Bias in Visual Datasets
AUTHORS: Simone Fabbrizzi ; Symeon Papadopoulos ; Eirini Ntoutsi ; Ioannis Kompatsiaris
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: To this end, we propose a checklist that can be used to spot different types of bias during visual dataset collection.

32, TITLE: Unsupervised Discovery of Object Radiance Fields
AUTHORS: Hong-Xing Yu ; Leonidas J. Guibas ; Jiajun Wu
CATEGORY: cs.CV [cs.CV, cs.AI]
HIGHLIGHT: In this paper, we propose unsupervised discovery of Object Radiance Fields (uORF), integrating recent progresses in neural 3D scene representations and rendering with deep inference networks for unsupervised 3D scene decomposition.

33, TITLE: Panoptic Segmentation of Satellite Image Time Series with Convolutional Temporal Attention Networks
AUTHORS: Vivien Sainte Fare Garnot ; Loic Landrieu
CATEGORY: cs.CV [cs.CV]
HIGHLIGHT: In this paper, we present the first end-to-end, single-stage method for panoptic segmentation of Satellite Image Time Series (SITS).

34, TITLE: Wasserstein Distances, Geodesics and Barycenters of Merge Trees
AUTHORS: Mathieu Pont ; Jules Vidal ; Julie Delon ; Julien Tierny
CATEGORY: cs.GR [cs.GR, cs.CG, cs.CV, eess.IV]
HIGHLIGHT: This paper presents a unified computational framework for the estimation of distances, geodesics and barycenters of merge trees.

35, TITLE: In-Bed Person Monitoring Using Thermal Infrared Sensors
AUTHORS: Elias Josse ; Amanda Nerborg ; Kevin Hernandez-Diaz ; Fernando Alonso-Fernandez
CATEGORY: cs.HC [cs.HC, cs.CV, eess.SP]
HIGHLIGHT: Technological solutions involving cameras can contribute to safety, comfort and efficient emergency responses, but they are invasive of privacy.

36, TITLE: DoReMi: First Glance at A Universal OMR Dataset
AUTHORS: Elona Shatri ; Gy�rgy Fazekas
CATEGORY: cs.IR [cs.IR, cs.CV, cs.MM]
HIGHLIGHT: This paper provides a first look at DoReMi, an OMR dataset that addresses these challenges, and a baseline object detection model to assess its utility.

37, TITLE: Graph Representation Learning for Road Type Classification
AUTHORS: Zahra Gharaee ; Shreyas Kowshik ; Oliver Stromann ; Michael Felsberg
CATEGORY: cs.LG [cs.LG, cs.AI, cs.CV]
HIGHLIGHT: We present a novel learning-based approach to graph representations of road networks employing state-of-the-art graph convolutional neural networks.

38, TITLE: Measuring and Explaining The Inter-Cluster Reliability of Multidimensional Projections
AUTHORS: Hyeon Jeon ; Hyung-Kwon Ko ; Jaemin Jo ; Youngtaek Kim ; Jinwook Seo
CATEGORY: cs.LG [cs.LG, cs.CV, cs.HC]
HIGHLIGHT: We propose Steadiness and Cohesiveness, two novel metrics to measure the inter-cluster reliability of multidimensional projection (MDP), specifically how well the inter-cluster structures are preserved between the original high-dimensional space and the low-dimensional projection space.

39, TITLE: Probabilistic Appearance-Invariant Topometric Localization with New Place Awareness
AUTHORS: Ming Xu ; Tobias Fischer ; Niko S�nderhauf ; Michael Milford
CATEGORY: cs.RO [cs.RO, cs.CV]
HIGHLIGHT: To address these shortcomings, we present a new probabilistic topometric localization system which incorporates full 3-dof odometry into the motion model and furthermore, adds an "off-map" state within the state-estimation framework, allowing query traverses which feature significant route detours from the reference map to be successfully localized.

40, TITLE: Exploiting Generative Self-supervised Learning for The Assessment of Biological Images with Lack of Annotations: A COVID-19 Case-study
AUTHORS: Alessio Mascolini ; Dario Cardamone ; Francesco Ponzio ; Santa Di Cataldo ; Elisa Ficarra
CATEGORY: eess.IV [eess.IV, cs.CV]
HIGHLIGHT: In this paper we present GAN-DL, a Discriminator Learner based on the StyleGAN2 architecture, which we employ for self-supervised image representation learning in the case of fluorescent biological images.

41, TITLE: Unpaired Cross-modality Educed Distillation (CMEDL) Applied to CT Lung Tumor Segmentation
AUTHORS: Jue Jiang ; Andreas Rimner ; Joseph O. Deasy ; Harini Veeraraghavan
CATEGORY: eess.IV [eess.IV, cs.CV]
HIGHLIGHT: Our contribution eliminates two requirements of distillation methods: (i) paired image sets by using an image to image (I2I) translation and (ii) pre-training of the teacher network with a large training set by using concurrent training of all networks.

42, TITLE: Depth Estimation from Monocular Images and Sparse Radar Using Deep Ordinal Regression Network
AUTHORS: Chen-Chou Lo ; Patrick Vandewalle
CATEGORY: eess.IV [eess.IV, cs.AI, cs.CV, eess.SP]
HIGHLIGHT: We integrate sparse radar data into a monocular depth estimation model and introduce a novel preprocessing method for reducing the sparseness and limited field of view provided by radar.

43, TITLE: NeXtQSM -- A Complete Deep Learning Pipeline for Data-consistent Quantitative Susceptibility Mapping Trained with Hybrid Data
AUTHORS: FRANCESCO COGNOLATO et. al.
CATEGORY: eess.IV [eess.IV, cs.CV, cs.LG]
HIGHLIGHT: Here we aim to overcome these limitations and developed a framework to solve the QSM processing steps jointly.

44, TITLE: Joint Semi-supervised 3D Super-Resolution and Segmentation with Mixed Adversarial Gaussian Domain Adaptation
AUTHORS: NICOLO SAVIOLI et. al.
CATEGORY: eess.IV [eess.IV, cs.CV]
HIGHLIGHT: Here we propose a semi-supervised multi-task generative adversarial network (Gemini-GAN) that performs joint super-resolution of the images and their labels using a ground truth of high resolution 3D cines and segmentations, while an unsupervised variational adversarial mixture autoencoder (V-AMA) is used for continuous domain adaptation.

45, TITLE: Lightness Modulated Deep Inverse Tone Mapping
AUTHORS: Kanglin Liu ; Gaofeng Cao ; Jiang Duan ; Guoping Qiu
CATEGORY: eess.IV [eess.IV, cs.CV, cs.MM]
HIGHLIGHT: In this paper, we present a deep learning based iTM method that takes advantage of the feature extraction and mapping power of deep convolutional neural networks (CNNs) and uses a lightness prior to modulate the CNN to better exploit observations in the surrounding areas of the over-exposed regions to enhance the quality of HDR image reconstruction.

本文内容由网友自发贡献,版权归原作者所有,本站不承担相应法律责任。如您发现有涉嫌抄袭侵权的内容,请联系:hwhale#tublm.com(使用前将#替换为@)

计算机视觉论文-2021-07-19 的相关文章

随机推荐

  • Qt保存Excel格式数据

    目录 前言 1 下载源码 2 编译源码 3 写Excel数据示例 前言 本文以一个示例介绍了如何使用 libxlsxwriter 开源库保存QTableWidget表格中的数据到Excel文件 libxlsxwriter 是一个C语言库 可
  • nmap操作系统检测_Nmap操作系统检测

    nmap操作系统检测 rps include post 6632 rps include post 6632 One of the most popular feature of nmap is its Operating System d
  • 关于redis 5.0 新数据类型 Stream

    redis 5 0 新特性见 https www oschina net news 100931 redis 5 0 released p 2 对于stream 详细使用和解释见https www zhihu com question 27
  • 宋浩线性代数笔记(五)矩阵的对角化

    本章的知识点难度和重要程度都是线代中当之无愧的T0级 对于各种杂碎的知识点 多做题 复盘才能良好的掌握 良好掌握的关键点在于 所谓的性质A与性质B 是谁推导得谁
  • Pytorch学习(二)使用 torchvision

    Pytorch学习 二 使用 torchvision 训练图像分类器 准备数据集 torchvision 一 导入torchvision的库 二 使用datasets CIFAR10 函数加载数据库 三 DataLoader用多进程加速ba
  • tensorflow历史版本源码下载地址

    最近在tensorflow对应的网站上找到了tensorflow历史版本源码的下载地址 tensorflow历史版本下载地址
  • 导入MDF文件到数据库

    1 导入脚本 EXEC sp attach db dbname yhzm filename1 d jspyhzm mdf filename2 d jspyhzm log ldf 2 Microsoft SQL Server删除数据库提示出错
  • inno setup制作的安装包,安装后以管理员身份启动

    添加管理员权限 1 在 Setup 节点添加 PrivilegesRequired admin 2 进入安装目录 找到文件SetupLdr e32 这是一个二进制配置文件 需要用到ResHacker exe这个工具修改 找到
  • Win10 WSL2-CentOS7开启systemctl命令(2022-11-18更新)

    Win10 WSL2 CentOS7开启systemctl命令 2022 11 18更新 概述 文章基于如下环境 已开启WSL2的win10或者win11 WSL2下的CentOS7 下载地址为 CentOS7 自从win10支持WSL2以
  • lanelet安装

    1 https github com KIT MRT mrt cmake modules 2 sudo apt get install libpugixml dev sudo apt get install libpugixml1v5 3
  • 群体遗传学---admixture软件快速群体分群

    群体遗传学中测的很多个个体 得到了最终的SNP vcf文件 需要将其分成群体 看那几个物种聚在一起 一般使用的软件就是STRUCTURE 但是STREUTURE运行速度极慢 后面frappe软件提升了速度 但是也不是很快 admixture
  • php 解析 %e5%80%aa%e9%a3%9e,content.json

    title Linux安装Docker date 2020 11 08T14 54 29 000Z path 2020 11 08 Linux安装Docker tags name Linux Docker slug Linux Docker
  • loC和AOP使用扩展

    6 1多种方式实现依赖注入 6 1 1构造注入 6 1 2技能训练1 6 1 3使用p命名空间实现属性注入 Spring配置文件从2 0版本开始采用schema形式 使用不同的命名空间管理不同类型的配置使得配置文件更具扩展性 列如 我们曾使
  • [Python]pip查找包的历史版本

    pip查找包的历史版本 场景 在一些时候通过pip install xxx 安装第三方库的时候默认情况下安装最新版本 由于是最新版本有个稳定性就不得不考虑其中 所以部分场景会存在一些bug这就要求我们安装历史版本 对一些更新频率比较高的三方
  • Window&linux使用换行符的问题总结

    1 Window使用Git时 设置换行符格式 参见 https www jianshu com p 6ef90ce18ba2 2 vi下设置回车换行符等特殊符号 换行方式 在早期的打印机时代 开始新的一行要占用两个字符的时间 如果到了一行的
  • java.lang.UnsatisfiedLinkError

    java lang UnsatisfiedLinkError 原因 jni注册的时候匹配写错了 I B B 如下 static JNINativeMethod methods native getSps I B B void Native
  • Keras保存模型并载入模型继续训练

    我们以MNIST手写数字识别为例 import numpy as np from keras datasets import mnist from keras utils import np utils from keras models
  • PyQt5中的按钮3-QCommandLinkButton

    PyQt5中的按钮3 QCommandLinkButton QCommandLinkButton介绍 QCommandLinkButton举例 QCommandLinkButton介绍 CommandLinkButton 外观像是一个被设置
  • verilog赋多位值_关于verilog 赋值

    1 wire表示直通 即只要输入有变化 输出马上无条件地反映 reg表示一定要有触发 输出才会反映输入 2 只有 lt 表示非阻塞 给沿触发的寄存器赋值 是阻塞赋值 给电平触发的信号赋值 3 不指定就默认为1位wire类型 专门指定出wir
  • 计算机视觉论文-2021-07-19

    本专栏是计算机视觉方向论文收集积累 时间 2021年7月19日 来源 paper digest 欢迎关注原创公众号 计算机视觉联盟 回复 西瓜书手推笔记 可获取我的机器学习纯手推笔记 直达笔记地址 机器学习手推笔记 GitHub地址 1 T