Hunter-prey model and Deep Boltzmann Machine diagrams for multi-agent cognitive policy learning (KIIS 2016)

Multi-Agent Cognitive Policy Learning through Competition

Undergraduate research at KAIST on multi-agent cognitive policy learning through competitive reinforcement learning, demonstrating emergence of complex behaviors. Won Best Paper Award at 2016 KIIS Conference.

November 15, 2016 · 1 min · Jungbae Park