Toggle Menu
In the context of the Multi-Armed Bandit problem: “Always pull the machine with the highest chance of yielding a reward”, is this a behaviour of “explore” or “exploit”?
Explore
Exploit
Back to top