This will be a hybrid event with in-person attendance in Wu and Chen and virtual attendance on Zoom.
Recent years have witnessed remarkable progress in 3D reconstruction and generation. However, most existing methods primarily focus on modeling geometry and appearance. I believe the next generation of 3D reconstruction and generation should go further in two key directions. First, it should be well-aligned with other modalities—such as language and images—so that 3D representations can play an important role in the multi-modal era. Second, it should incorporate physical understanding to ensure reconstructions and generations are physically plausible, which will ultimately make them more applicable in robotics. In this talk, I will present our recent efforts toward these goals and discuss the challenges that lie ahead.