MATLAB Example: Train MBPO Agent to Balance Cart-Pole System

Mathworks

Intermediate

Example

Demonstration

Link to External Site

https://www.mathworks.com/help/reinforcement-learning/ug/train-mbpo-agent-to-ba…

This example shows how to train a model-based policy optimization (MBPO) agent to balance a cart-pole system modeled in MATLAB. For more information on MBPO agents, see Model-Based Policy Optimization Agents.

MBPO agents use an environment model to generate more experiences while training a base agent. In this example, the base agent is a soft actor-critic (SAC) agent.

Radioactivity

This resource is included in the following topics and journeys:

Resource

MATLAB Example: Train MBPO Agent to Balance Cart-Pole System

Reinforcement Learning

Companion resources to "Why Choose Model-Based Reinforcement Learning" ...