Skip to content
RDL Network logo
OTTER: A Vision-Language-Action Model with Text-Aware Visual Feature Extraction — Huang Huang (2025) | RDL Network