Alejandro Armenta's Blog: MARL Multi Agent Reinforcement Learning

I made MARL (Multi Agent Reinforcement Learning) with DDPG (Deep Deterministic Policy Gradient).

I made the DDPG with python and Pytorch. I used Unity Machine Learning Agents for the environment.

This model can be used for robots that need to learn collaboration with continuous action spaces.

Two agents need to learn to interact with each other to play Tennis. They have to pass the ball over the Net and work collaboratively.

I made one DDPG with two Noises.

I solved the environment. This is the learning by episode Score Table:

If you want to look at the code:

Alejandro Armenta's Blog