TY - GEN
T1 - Learning to bluff
AU - Hurwitz, Evan
AU - Marwala, Tshilidzi
PY - 2007
Y1 - 2007
N2 - The act of bluffing confounds game designers to this day. The very nature of bluffing is even open for debate, adding further complication to the process of creating intelligent virtual players that can bluff, and hence play, realistically. Through the use of intelligent, learning agents, and carefully designed agent outlooks, an agent can in fact learn to predict its opponents' reactions based not only on its own cards, but on the actions of those around it. With this wider scope of understanding, an agent can in learn to bluff its opponents, with the action representing not an "illogical" action, as bluffing is often viewed, but rather as an act of maximising returns through an effective statistical optimisation. By using a TD(λ.) learning algorithm to continuously adapt neural network agent intelligence, agents have been shown to be able to learn to bluff without outside prompting, and even to learn to call each other's bluffs in free, competitive play.
AB - The act of bluffing confounds game designers to this day. The very nature of bluffing is even open for debate, adding further complication to the process of creating intelligent virtual players that can bluff, and hence play, realistically. Through the use of intelligent, learning agents, and carefully designed agent outlooks, an agent can in fact learn to predict its opponents' reactions based not only on its own cards, but on the actions of those around it. With this wider scope of understanding, an agent can in learn to bluff its opponents, with the action representing not an "illogical" action, as bluffing is often viewed, but rather as an act of maximising returns through an effective statistical optimisation. By using a TD(λ.) learning algorithm to continuously adapt neural network agent intelligence, agents have been shown to be able to learn to bluff without outside prompting, and even to learn to call each other's bluffs in free, competitive play.
UR - http://www.scopus.com/inward/record.url?scp=40949113197&partnerID=8YFLogxK
U2 - 10.1109/ICSMC.2007.4413589
DO - 10.1109/ICSMC.2007.4413589
M3 - Conference contribution
AN - SCOPUS:40949113197
SN - 1424409918
SN - 9781424409914
T3 - Conference Proceedings - IEEE International Conference on Systems, Man and Cybernetics
SP - 1188
EP - 1193
BT - 2007 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2007
T2 - 2007 IEEE International Conference on Systems, Man, and Cybernetics, SMC 2007
Y2 - 7 October 2007 through 10 October 2007
ER -