Adaptation-Aware Model Across Environmental Settings
The paper models split-belt locomotion with a two-level architecture: a low-level feedback controller that maintains stable gait step by step, and a high-level reinforcement learner that updates control policy over time. Two internal parameters are especially important: the learning rate, which controls how aggressively the policy is updated, and the symmetry weight, which determines how much symmetry is prioritized relative to metabolic efficiency. By varying the belt-speed difference, the authors create distinct environmental settings with different theoretical optima and different stability demands.
Figure 2. Adaptation-aware model of locomotor performance across environmental settings, with a low-level feedback controller, a high-level reinforcement learner, and the key learning parameters that govern adaptation.