In the context of the Multi-Armed Bandit problem: “Always pull the machine with the highest chance of yielding a reward”, is this a behaviour of “explore” or “exploit”?

Solving Multi-Armed Bandit Problems | by Hennie de Harder | Towards Data Science


Explore

Exploit

You can't submit an answer to this poll because you're not part of this course!

Back to top

COMP3142 24T3 (Software Testing and Quality Assurance) is powered by WebCMS3
CRICOS Provider No. 00098G