University of Lincoln
Browse

EquiCaps: Predictor-Free Pose-Aware Pre-Trained Capsule Networks

Download (4.54 MB)
conference contribution
posted on 2025-10-23, 08:04 authored by Athinoulla Konstantinou, Georgios Leontidis, Mamatha ThotaMamatha Thota, Aiden Durrant
<p dir="ltr">Learning self-supervised representations that are invariant and equivariant to transformations is crucial for advancing beyond traditional visual classification tasks. However, many methods rely on predictor architectures to encode equivariance, despite evidence that architectural choices, such as capsule networks, inherently excel at learning interpretable pose-aware representations. To explore this, we introduce EquiCaps (Equivariant Capsule Network), a capsule-based approach to pose-aware self-supervision that eliminates the need for a specialised predictor for enforcing equivariance. Instead, we leverage the intrinsic pose-awareness capabilities of capsules to improve performance in pose estimation tasks. To further challenge our assumptions, we increase task complexity via multi-geometric transformations to enable a more thorough evaluation of invariance and equivariance by introducing 3DIEBench-T, an extension of a 3D object-rendering benchmark dataset. Empirical results demonstrate that EquiCaps outperforms prior state-of-the-art equivariant methods on rotation prediction, achieving a supervised-level R2 of 0.78 on the 3DIEBench rotation prediction benchmark and improving upon SIE and CapsIE by 0.05 and 0.04 R2 , respectively. Moreover, in contrast to non-capsule-based equivariant approaches, EquiCaps maintains robust equivariant performance under combined geometric transformations, underscoring its generalisation capabilities and the promise of predictor-free capsule architectures.</p>

History

School affiliated with

  • School of Engineering and Physical Sciences (Research Outputs)

Publication Title

Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV)

Pages/Article Number

7947-7957

Publisher

IEEE Computer Society, in collaboration with the Computer Vision Foundation (CVF)

ISSN

2380-7504

Date Submitted

2025-06-11

Date Accepted

2025-06-25

Date of First Publication

2025-10-19

Date of Final Publication

2025-10-23

Event Name

IEEE/CVF International Conference on Computer Vision (ICCV)

Event Dates

19th - 23th of October 2025

Event Organiser

Computer Vision Foundation (CVF)

Open Access Status

  • Open Access

Will your conference paper be published in proceedings?

  • Yes