Tag
1 articles
This paper studies how to keep RL controllers safe while they adapt to non-stationary systems—and shows why existing methods still fall short.