Parametrized Hierarchical Procedures for Neural Programming

Roy Fox; Richard Shin; Sanjay Krishnan; Ken Goldberg; Dawn Song; Ion Stoica

Abstract

1 min read

Neural programs are highly accurate and structured policies that perform algorithmic tasks by controlling the behavior of a computation mechanism. Despite the potential to increase the interpretability and the compositionality of the behavior of artificial agents, it remains difficult to learn from demonstrations neural networks that represent computer programs. The main challenges that set algorithmic domains apart from other imitation learning domains are the need for high accuracy, the involvement of specific structures of data, and the extremely limited observability. To address these challenges, we propose to model programs as Parametrized Hierarchical Procedures (PHPs). A PHP is a sequence of conditional operations, that uses a program counter, along with the observation, to select between taking an elementary action, invoking another PHP as a sub-procedure, and returning to the caller. We develop an algorithm for training PHPs from a mixture of annotated and unannotated demonstrations, and apply it to efficient level-wise training of multi-level PHPs. We show in two benchmarks, NanoCraft and long-hand addition, that PHPs can learn neural programs more accurately from smaller amounts of strong and weak supervision.

Parametrized Hierarchical Procedures for Neural Programming

Abstract

Discussion(0)

Related publications

Hierarchical Variational Imitation Learning of Control Programs

Multi-Task Hierarchical Imitation Learning for Home Automation

Parametrically Managed Activation Function for a Fitting a Neural Network Potential with Physical Behavior Enforced by a Low-Dimensional Potential

Parametrically Managed Activation Function for Fitting a Neural Network Potential with Physical Behavior Enforced by a Low-Dimensional Potential

State-Only Imitation Learning for Dexterous Manipulation

Related publications

Preprint2019
Hierarchical Variational Imitation Learning of Control Programs
Preprint2019

Article2019
Multi-Task Hierarchical Imitation Learning for Home Automation
Article2019

Preprint2023
Parametrically Managed Activation Function for a Fitting a Neural Network Potential with Physical Behavior Enforced by a Low-Dimensional Potential
Preprint2023

Article2023
Parametrically Managed Activation Function for Fitting a Neural Network Potential with Physical Behavior Enforced by a Low-Dimensional Potential
Article2023

Preprint2021
State-Only Imitation Learning for Dexterous Manipulation
Preprint2021