Remy Chaput
Remy Chaput
Home
About Me
Projects
Research
Teaching
Light
Dark
Automatic
preprint
Adaptive reinforcement learning of multi-agent ethically-aligned behaviours: the QSOM and QDSOM algorithms
Preprint describing two Reinforcement Learning algorithms (
Q-SOM
and
Q-DSOM
) I have developped. They focus on continuous and multi-dimensional observations and actions, and adaptation to changes in the environment.
Rémy Chaput
,
Olivier Boissier
,
Mathieu Guillermin
Cite
HAL
ArXiv
Cite
×