Latest 15 Papers - March 10, 2025

by ADMIN 34 views

Latest 15 Papers - March 10, 2025

6d Object Pose Estimation

6D object pose estimation is a crucial task in computer vision, which involves estimating the 3D pose of an object from a 2D image. This task has numerous applications in robotics, computer-aided design, and augmented reality. In recent years, significant progress has been made in this field, with the development of various deep learning-based methods.

Title Date Comment
Active 6D Pose Estimation for Textureless Objects using Multi-View RGB Frames 2025-03-05
Improving 6D Object Pose Estimation of metallic Household and Industry Objects 2025-03-05
Increasing the Task Flexibility of Heavy-Duty Manipulators Using Visual 6D Pose Estimation of Objects 2025-02-26
EfficientPose 6D: Scalable and Efficient 6D Object Pose Estimation 2025-02-19
Enhancing Transparent Object Pose Estimation: A Fusion of GDR-Net and Edge Detection 2025-02-17
accep...

accepted at First Austrian Symposium on AI, Robotics, and Vision (AIROV 2024)

HIPPo: Harnessing Image-to-3D Priors for Model-free Zero-shot 6D Pose Estimation 2025-02-14
CordViP: Correspondence-based Visuomotor Policy for Dexterous Manipulation in Real-World 2025-02-12
Advanced Object Detection and Pose Estimation with Hybrid Task Cascade and High-Resolution Networks 2025-02-06
FreeZe: Training-free zero-shot 6D pose estimation with geometric and vision foundation models 2025-01-08
Accep...

Accepted to ECCV 2024. Project page: https://andreacaraffa.github.io/freeze

A Novel Convolution and Attention Mechanism-based Model for 6D Object Pose Estimation 2024-12-31
8 pag...

8 pages, 4 figures, 6 tables

ReFlow6D: Refraction-Guided Transparent Object 6D Pose Estimation via Intermediate Representation Learning 2024-12-30
Level-Set Parameters: Novel Representation for 3D Shape Analysis 2024-12-18
KITchen: A Real-World Benchmark and Dataset for 6D Object Pose Estimation in Kitchen Environments 2024-12-17
This ...

This work has been accepted for publishing at The 2024 IEEE-RAS International Conference on Humanoid Robots

COBRA -- COnfidence score Based on shape Regression Analysis for method-independent quality assessment of object pose estimation from single images 2024-12-16
Category Level 6D Object Pose Estimation from a Single RGB Image using Diffusion 2024-12-16

Human Pose Estimation

Human pose estimation is a fundamental task in computer vision, which involves estimating the 3D pose of a human from a 2D image. This task has numerous applications in robotics, computer-aided design, and augmented reality. In recent years, significant progress has been made in this field, with the development of various deep learning-based methods.

Title Date Comment
Refinement Module based on Parse Graph of Feature Map for Human Pose Estimation 2025-03-02
BGM2Pose: Active 3D Human Pose Estimation with Non-Stationary Sounds 2025-03-01
Transformers with Joint Tokens and Local-Global Attention for Efficient Human Pose Estimation 2025-02-28
This ...

This work has been submitted to the IEEE for possible publication

BST: Badminton Stroke-type Transformer for Skeleton-based Action Recognition in Racket Sports 2025-02-28
8 pag...

8 pages (excluding references). The code will be released in a few months

Sixth-Sense: Self-Supervised Learning of Spatial Awareness of Humans from a Planar Lidar 2025-02-28
STGFormer: Spatio-Temporal GraphFormer for 3D Human Pose Estimation in Video 2025-02-26
Pose Magic: Efficient and Temporally Consistent Human Pose Estimation with a Hybrid Mamba-GCN Network 2025-02-26
This ...

This work has been accepted by AAAI 2025

EgoSim: An Egocentric Multi-view Simulator and Real Dataset for Body-worn Cameras during Motion and Activity 2025-02-25
Leveraging 2D Masked Reconstruction for Domain Adaptation of 3D Pose Estimation 2025-02-25 16 pages, 7 figures
CHAMP: Conformalized 3D Human Multi-Hypothesis Pose Estimators 2025-02-23
DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion 2025-02-23
The s...

The source code will be available at https://github.com/WUJINHUAN/DeProPose

Pose Prior Learner: Unsupervised Categorical Prior Learning for Pose Estimation 2025-02-20
VarGes: Improving Variation in Co-Speech 3D Gesture Generation via StyleCLIPS 2025-02-18
Spatiotemporal Multi-Camera Calibration using Freely Moving People 2025-02-18 8 pages, 4 figures
X-Fi: A Modality-Invariant Foundation Model for Multimodal Human Sensing 2025-02-17

Gaussian Splatting

Gaussian splatting is a technique used in computer vision to represent 3D scenes as a collection of Gaussian distributions. This technique has numerous applications in robotics, computer-aided design, and augmented reality. In recent years, significant progress has been made in this field, with the development of various deep learning-based methods.

| Title | **Date
Q&A: Latest 15 Papers - March 10, 2025

Q: What is 6D object pose estimation?

A: 6D object pose estimation is a task in computer vision that involves estimating the 3D pose of an object from a 2D image. This task has numerous applications in robotics, computer-aided design, and augmented reality.

Q: What are some recent advancements in 6D object pose estimation?

A: Recent advancements in 6D object pose estimation include the development of various deep learning-based methods, such as active 6D pose estimation, multi-view RGB frames, and hybrid task cascade and high-resolution networks.

Q: What is human pose estimation?

A: Human pose estimation is a fundamental task in computer vision that involves estimating the 3D pose of a human from a 2D image. This task has numerous applications in robotics, computer-aided design, and augmented reality.

Q: What are some recent advancements in human pose estimation?

A: Recent advancements in human pose estimation include the development of various deep learning-based methods, such as refinement modules, transformers with joint tokens and local-global attention, and spatio-temporal graph formers.

Q: What is Gaussian splatting?

A: Gaussian splatting is a technique used in computer vision to represent 3D scenes as a collection of Gaussian distributions. This technique has numerous applications in robotics, computer-aided design, and augmented reality.

Q: What are some recent advancements in Gaussian splatting?

A: Recent advancements in Gaussian splatting include the development of various deep learning-based methods, such as multimodal place recognition, efficient video representation and compression, and sparse-view super-resolution 3D Gaussian splatting.

Q: What is diffusion?

A: Diffusion is a technique used in computer vision to represent 3D scenes as a collection of Gaussian distributions. This technique has numerous applications in robotics, computer-aided design, and augmented reality.

Q: What are some recent advancements in diffusion?

A: Recent advancements in diffusion include the development of various deep learning-based methods, such as fluid reconstruction and prediction, coarse graining and reduced order models, and compositional world knowledge.

Q: How can I stay up-to-date with the latest advancements in these fields?

A: You can stay up-to-date with the latest advancements in these fields by following reputable sources, such as arXiv, ResearchGate, and Academia.edu. You can also attend conferences and workshops, and participate in online forums and discussions.

Q: What are some potential applications of these techniques?

A: These techniques have numerous potential applications in robotics, computer-aided design, and augmented reality. They can be used to improve the accuracy and efficiency of object recognition, pose estimation, and scene understanding.

Q: What are some potential challenges and limitations of these techniques?

A: These techniques have several potential challenges and limitations, including the need for large amounts of training data, the risk of overfitting, and the difficulty of generalizing to new scenarios.

Q: How can I get started with implementing these techniques?

A: You can get started with implementing these techniques by following online tutorials and documentation, and by participating in online forums and discussions. You can also join research groups and collaborate with other researchers to gain experience and expertise.

Q: What are some potential future directions for these techniques?

A: Some potential future directions for these techniques include the development of more efficient and accurate methods, the integration of multiple modalities, and the application of these techniques to new domains and scenarios.

Q: How can I contribute to the development of these techniques?

A: You can contribute to the development of these techniques by participating in online forums and discussions, by sharing your own research and ideas, and by collaborating with other researchers to advance the state-of-the-art.