Skip to content
RDL Network logo
Maximizing Alignment with Minimal Feedback: Efficiently Learning Rewards for Visuomotor Robot Policy Alignment — Ran Tian (2024) | RDL Network