Skip to content
RDL Network logo
Adversarial Batch Inverse Reinforcement Learning: Learn to Reward from Imperfect Demonstration for Interactive Recommendation — Jialin Liu (2023) | RDL Network