The StarCraft Multi-Agent Challenges+ requires agents to learn completion of multi-stage tasks and usage of environmental factors without precise reward functions. The previous challenges (SMAC) recognized as a standard benchmark of Multi-Agent Reinforcement Learning are mainly concerned with ensuring that all agents cooperatively eliminate approaching adversaries only through fine manipulation with obvious reward functions. This challenge, on the other hand, is interested in the exploration capability of MARL algorithms to efficiently learn implicit multi-stage tasks and environmental factors as well as micro-control. This study covers both offensive and defensive scenarios. In the offensive scenarios, agents must learn to first find opponents and then eliminate them. The defensive scenarios require agents to use topographic features. For example, agents need to position themselves behind protective structures to make it harder for enemies to attack.
11 PAPERS • 13 BENCHMARKS
SMAC+ defensive armored scenario with sequential episodic buffer
9 PAPERS • 1 BENCHMARK
smac+ defense infantry scenario with parallel episodic buffer
SMAC+ defensive infantry scenario with sequential episodic buffer
SMAC+ defensive outnumbered scenario with sequential episodic buffer
smac+ defense armored scenario with parallel episodic buffer
8 PAPERS • 1 BENCHMARK
smac+ defense outnumbered scenario with parallel episodic buffer
smac+ offensive hard scenario with 20 parallel episodic buffer.
smac+ offensive scenario with 20 parallel episodic buffer.
smac+ offensive complicated scenario with 20 parallel episodic buffer.
7 PAPERS • 1 BENCHMARK
SMAC+ offense distant scenario.
smac+ offensive near scenario with 20 parallel episodic buffer
SMAC+ offensive complicated scenario with sequential episodic buffer
5 PAPERS • 1 BENCHMARK
SMAC+ offensive distant scenario with sequential episodic buffer
SMAC+ offensive hard scenario with sequential episodic buffer
SMAC+ offensive near scenario with sequential episodic buffer
SMAC+ offensive superhard scenario with sequential episodic buffer