Goal-Driven Human Motion Synthesis in Diverse Tasks

Inwoo Hwang1, Jinseok Bae1,

Donggeun Lim1, Young Min Kim1

Abstract

We propose a framework for goal-driven human motion generation, which can synthesize interaction-rich scenarios. Given target positions for key joints, our pipeline generates a natural full-body motion that approaches the goal in cluttered environments. Our pipeline solves the complex constraints in a tractable formulation by disentangling the process of motion generation into two stages. The first stage computes the trajectory of the key joints, such as hands and feet, to encourage the character to approach the target position while avoiding possible physical violation. We demonstrate that diffusion-based guidance sampling can flexibly adapt to the local scene context while satisfying the target-goal conditions. Then, the subsequent second stage can easily generate plausible full-body motion that traverses the key joint trajectories. The proposed pipeline applies to various scenarios that have to account for 3D scene geometry and body joint configurations concurrently.

GoalDriven teaser image.

We propose a motion generation pipeline where pre-defined keyjoints approach user-specified positional goals. The goals are shown as green spheres, and our pipeline can adapt to the customized conditions, including novel scenes and goal conditions. We can generate motions that reach for an object in cluttered scenes, climb a wall, or sit with specified hand positions.

Reaching an Object Goal in a Cluttered Indoor Scene Examples

Our method able to generate motions reaching a goal while avoiding collision.

Examples

Case #1

Case #2

Case #3

Case #4

Diverse Tasks Examples

Examples

Sitting with Suggested Contact Points.

Sitting with Suggested Contact Points.

Rock Climbing Guided by Multiple Goals.

Contact-Aware Motion Generation.

BibTeX


        @InProceedings{Hwang_2025_CVPR,
          author    = {Hwang, Inwoo and Bae, Jinseok and Lim, Donggeun and Kim, Young Min},
          title     = {Goal-Driven Human Motion Synthesis in Diverse Task},
          booktitle = {Proceedings of the Computer Vision and Pattern Recognition Conference (CVPR) Workshops},
          month     = {June},
          year      = {2025},
          pages     = {2920-2930}
      }