preprint

Adaptive reinforcement learning of multi-agent ethically-aligned behaviours: the QSOM and QDSOM algorithms

Preprint describing two Reinforcement Learning algorithms (Q-SOM and Q-DSOM) I have developped. They focus on continuous and multi-dimensional observations and actions, and adaptation to changes in the environment.

Rémy Chaput, Olivier Boissier, Mathieu Guillermin