Viet-Anh on Software Logo

What is: Policy Similarity Metric?

SourceContrastive Behavioral Similarity Embeddings for Generalization in Reinforcement Learning
Year2000
Data SourceCC BY-SA - https://paperswithcode.com

Policy Similarity Metric, or PSM, is a similarity metric for measuring behavioral similarity between states in reinforcement learning. It assigns high similarity to states for which the optimal policies in those states as well as in future states are similar. PSM is reward-agnostic, making it more robust for generalization compared to approaches that rely on reward information.