Back to Knowledge Base
ConceptsFeatured

Model-Based vs Model-Free Reinforcement Learning

Understanding the key differences between model-based and model-free approaches in RL.

reinforcement learningmodel-basedmodel-freecomparison

Model-Based vs Model-Free Reinforcement Learning

Model-Free RL

Characteristics

  • Directly maps states to actions or values
  • No explicit model of environment dynamics
  • Learns through trial and error

Examples

  • Q-Learning, DQN
  • Policy Gradient methods
  • Actor-Critic methods

Model-Based RL

Characteristics

  • Learns a model of environment dynamics
  • Uses model for planning or policy learning
  • Can train in imagination

Examples

  • World Models
  • Dreamer
  • MuZero

World Models as Model-Based RL

World Models represent a sophisticated model-based approach that:

  1. Learns a compressed representation (VAE)
  2. Models dynamics in latent space (MDN-RNN)
  3. Plans using the learned model (Controller)
References
Academic papers and resources

Model-Based Reinforcement Learning: A Survey

Thomas M. Moerland et al. (2023)

paper