Leveraging SE(3) Equivariance for Learning 3D Geometric Shape Assembly. (arXiv:2309.06810v1 [cs.CV])
Click here to flash read.
Shape assembly aims to reassemble parts (or fragments) into a complete
object, which is a common task in our daily life. Different from the semantic
part assembly (e.g., assembling a chair's semantic parts like legs into a whole
chair), geometric part assembly (e.g., assembling bowl fragments into a
complete bowl) is an emerging task in computer vision and robotics. Instead of
semantic information, this task focuses on geometric information of parts. As
the both geometric and pose space of fractured parts are exceptionally large,
shape pose disentanglement of part representations is beneficial to geometric
shape assembly. In our paper, we propose to leverage SE(3) equivariance for
such shape pose disentanglement. Moreover, while previous works in vision and
robotics only consider SE(3) equivariance for the representations of single
objects, we move a step forward and propose leveraging SE(3) equivariance for
representations considering multi-part correlations, which further boosts the
performance of the multi-part assembly. Experiments demonstrate the
significance of SE(3) equivariance and our proposed method for geometric shape
assembly. Project page: https://crtie.github.io/SE-3-part-assembly/
No creative common's license