Skip to main content

--question--

--text--

What can happen if the agent does not have a good balance of taking random actions and using learned actions?

--answers--

The agent will always try to minimize its reward for the current state/action, leading to local minima.


The agent will always try to maximize its reward for the current state/action, leading to local maxima.

--video-solution--

2