Current approaches to humanoid control generally fall into two paradigms: perceptive locomotion, which handles terrain well but is limited to pedal gaits, and general motion tracking, which reproduces complex skills but ignores environmental capabilities. This work unites these paradigms to achieve perceptive general motion control. We present a framework where exteroceptive sensing is integrated into whole-body motion tracking, permitting a humanoid to perform highly dynamic, non-locomotion tasks on uneven terrain. By training a single policy to perform multiple distinct motions across varied terrestrial features, we demonstrate the non-trivial benefit of integrating perception into the control loop. Our results show that this framework enables robust, highly dynamic multi-contact motions—such as vaulting and dive-rolling—on unstructured terrain, significantly expanding the robot's traversability beyond simple walking or running.
@misc{zhuang2026deepwholebodyparkour,
title={Deep Whole-body Parkour},
author={Ziwen Zhuang and Shaoting Zhu and Mengjie Zhao and Hang Zhao},
year={2026},
eprint={2601.07701},
archivePrefix={arXiv},
primaryClass={cs.RO},
url={https://arxiv.org/abs/2601.07701},
}