arxiv:2310.05327

Provable Compositional Generalization for Object-Centric Learning

Published on Oct 9, 2023

Authors:

Thaddäus Wiedemer ,

Jack Brady ,

Alexander Panfilov ,

Attila Juhos ,

Abstract

Autoencoders with specific structural assumptions can learn object-centric representations that enable compositional generalization.

AI-generated summary

Learning representations that generalize to novel compositions of known concepts is crucial for bridging the gap between human and machine perception. One prominent effort is learning object-centric representations, which are widely conjectured to enable compositional generalization. Yet, it remains unclear when this conjecture will be true, as a principled theoretical or empirical understanding of compositional generalization is lacking. In this work, we investigate when compositional generalization is guaranteed for object-centric representations through the lens of identifiability theory. We show that autoencoders that satisfy structural assumptions on the decoder and enforce encoder-decoder consistency will learn object-centric representations that provably generalize compositionally. We validate our theoretical result and highlight the practical relevance of our assumptions through experiments on synthetic image data.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

No model linking this paper

Cite arxiv.org/abs/2310.05327 in a model README.md to link it from this page.

No dataset linking this paper

Cite arxiv.org/abs/2310.05327 in a dataset README.md to link it from this page.

No Space linking this paper

Cite arxiv.org/abs/2310.05327 in a Space README.md to link it from this page.

No Collection including this paper

Add this paper to a collection to link it from this page.