Bi-Level World Model | Wadhwani School of Data Science and Artificial Intelligence

MABL: Bi-Level Latent-Variable World Model for Sample-Efficient Multi-Agent Reinforcement Learning

Publications

Multi-agent reinforcement learning (MARL) methods often suffer from high sample complexity, limiting their use in real-world problems where data is sparse or expensive to collect. Although latent-variable world models …

Tags: Bi-Level World Model