In the context of the Multi-Armed Bandit problem: “Always pull the machine with the highest chance of yielding a reward”, is this a behaviour of “explore” or “exploit”?


This poll has responses from 49 out of 339 course users - a 14.45% completion rate.

Legend (with response counts)


Back to top

COMP3142 25T2 (Software Testing and Quality Assurance) is powered by WebCMS3
CRICOS Provider No. 00098G