
GPT4D is an autoregressive generative framework that reformulates 4D point cloud video understanding as next-token prediction, integrating long-range motion priors with local geometric details to achieve state-of-the-art performance on human action recognition benchmarks.
Mar 5, 2026
Progressive Multi-Granularity Autoregressive Pre-training for 4D Point Cloud Understanding
Dec 12, 2025