PoseGPT: Quantizing human motion for large scale generative modeling

Thomas Lucas, Fabien Baradel, Philippe Weinzaepfel, Gregory Rogez

Summary

We address the problem of action-conditioned generation of human motion sequences.
Unlike existing work, we generate motion conditioned on observations of arbitrary length, including none. To solve this generalized problem, we propose PoseGPT, an auto-regressive transformer-based approach which internally compresses human motion into quantized latent sequences. Inspired by the Generative Pretrained Transformer (GPT), we propose to train a GPT-like model for next-index prediction in that space; this allows PoseGPT to output distributions on possible futures, with or without conditioning on past motion. We mainly experiment on BABEL, a recent large scale MoCap dataset.

PoseGPT: Quantizing human motion for large scale generative modeling

Summary

ACTION

Providing embodied agents with sequential decision-making capabilities to safely execute complex tasks in dynamic environments.

INTERACTION

Equip robots to interact safely with humans, other robots and systems.

VISION

Perception to help robots understand and interact with the environment.

All

Publications

Blog

News

Code & Data

Careers

People

NAVER FRANCE Gender Equality 2024

NAVER FRANCE Gender Equality 2023

Action

PoseGPT: Quantizing human motion for large scale generative modeling

Summary

All

Publications

Blog

News

Code & Data

Careers

People

Cookie settings