--question--
--text--
What can happen if the agent does not have a good balance of taking random actions and using learned actions?
--answers--
The agent will always try to minimize its reward for the current state/action, leading to local minima.
The agent will always try to maximize its reward for the current state/action, leading to local maxima.
--video-solution--
2