In the context of the Multi-Armed Bandit problem: “Always pull the machine with the highest chance of yielding a reward”, is this a behaviour of “explore” or “exploit”?


This poll has responses from 6 out of 89 course users - a 6.74% completion rate.

Legend (with response counts)


Back to top

COMP3142 24T3 (Software Testing and Quality Assurance) is powered by WebCMS3
CRICOS Provider No. 00098G