CKD-LQPOSE: Towards a Real-World Low-quality Cross-Task Distilled Pose Estimation Architecture
dc.contributor.author | Liu, Tao | en_US |
dc.contributor.author | Yao, Beiji | en_US |
dc.contributor.author | Huang, Jun | en_US |
dc.contributor.author | Wang, Ya | en_US |
dc.contributor.editor | Chen, Renjie | en_US |
dc.contributor.editor | Ritschel, Tobias | en_US |
dc.contributor.editor | Whiting, Emily | en_US |
dc.date.accessioned | 2024-10-13T18:05:44Z | |
dc.date.available | 2024-10-13T18:05:44Z | |
dc.date.issued | 2024 | |
dc.description.abstract | Although human pose estimation (HPE) methods have achieved promising results, they remain challenging in real-world lowquality (LQ) scenarios. Moreover, due to the general lack of modeling of LQ information in currently public HPE datasets, it is difficult to accurately evaluate the performance of the HPE methods in LQ scenarios. Hence, we propose a novel CKD-LQPose architecture, which is the first architecture fusing cross-task feature information in HPE that uses a cross-task distillation method to merge HPE information and well-quality (WQ) information. The CKD-LQPose architecture effectively enables adaptive feature learning from LQ images and improves their quality to enhance HPE performance. Additionally, we introduce the PatchWQ-Gan module to obtain WQ information and the refined transformer decoder (RTD) module to refine the features further. In the inference stage, CKD-LQPose removes the PatchWQ-Gan and RTD modules to reduce the computational burden. Furthermore, to accurately evaluate the HPE methods in LQ scenarios, we develop an RLQPose-DS test benchmark. Extensive experiments on RLQPose-DS, real-world images, and LQ versions of well-known datasets such as COCO, MPII, and CrowdPose demonstrate CKD-LQPose outperforms state-of-the-art approaches by a large margin, demonstrating its effectiveness in realworld LQ scenarios. | en_US |
dc.description.sectionheaders | Human II | |
dc.description.seriesinformation | Pacific Graphics Conference Papers and Posters | |
dc.identifier.doi | 10.2312/pg.20241321 | |
dc.identifier.isbn | 978-3-03868-250-9 | |
dc.identifier.pages | 12 pages | |
dc.identifier.uri | https://doi.org/10.2312/pg.20241321 | |
dc.identifier.uri | https://diglib.eg.org/handle/10.2312/pg20241321 | |
dc.publisher | The Eurographics Association | en_US |
dc.rights | Attribution 4.0 International License | |
dc.rights.uri | https://creativecommons.org/licenses/by/4.0/ | |
dc.subject | CCS Concepts: Computing methodologies → Image processing; Interest point and salient region detections | |
dc.subject | Computing methodologies → Image processing | |
dc.subject | Interest point and salient region detections | |
dc.title | CKD-LQPOSE: Towards a Real-World Low-quality Cross-Task Distilled Pose Estimation Architecture | en_US |
Files
Original bundle
1 - 1 of 1