Efficient Autoregressive Shape Generation via Octree-Based Adaptive Tokenization

  • 2025-04-03 18:57:52
  • Kangle Deng, Hsueh-Ti Derek Liu, Yiheng Zhu, Xiaoxia Sun, Chong Shang, Kiran Bhat, Deva Ramanan, Jun-Yan Zhu, Maneesh Agrawala, Tinghui Zhou
  • 0

Abstract

Many 3D generative models rely on variational autoencoders (VAEs) to learncompact shape representations. However, existing methods encode all shapes intoa fixed-size token, disregarding the inherent variations in scale andcomplexity across 3D data. This leads to inefficient latent representationsthat can compromise downstream generation. We address this challenge byintroducing Octree-based Adaptive Tokenization, a novel framework that adjuststhe dimension of latent representations according to shape complexity. Ourapproach constructs an adaptive octree structure guided by aquadric-error-based subdivision criterion and allocates a shape latent vectorto each octree cell using a query-based transformer. Building upon thistokenization, we develop an octree-based autoregressive generative model thateffectively leverages these variable-sized representations in shape generation.Extensive experiments demonstrate that our approach reduces token counts by 50%compared to fixed-size methods while maintaining comparable visual quality.When using a similar token length, our method produces significantlyhigher-quality shapes. When incorporated with our downstream generative model,our method creates more detailed and diverse 3D content than existingapproaches.

 

Quick Read (beta)

loading the full paper ...