Training replay — animations play when scrolled into view
Agents must navigate through a constrained environment, learning to break out of confined starting positions to reach the opposing flag.
Early generations struggle to find escape routes, while later generations efficiently identify and exploit gaps in the opposing team's defence.
A narrow chokepoint forces agents to develop sequential movement strategies, coordinating passage through a single exit.
Advanced generations learn to route defenders to hold the exit while attackers push through, avoiding the bottlenecks that plagued earlier runs.
In this scenario agents learn to divide into two groups, coordinating to attack from multiple directions simultaneously.
By generation 50 both teams exhibit a clear splitting strategy, with agents flanking the opposing team's flag position from opposite sides.