Overview
This system joins language conditioning, robot perception, and action chunking into a single control loop for the Unitree G1 humanoid. The work focused on making a research stack stable enough for repeated evaluation instead of optimizing only for a polished demo.
Notes
The project paired scripted demonstrations with ACT training, then layered in domain randomization to push policy robustness beyond the nominal simulation setup.