Antoine Barrier
Antoine Barrier
Accueil et contact
Recherche
Enseignement
Exposés et conférences
Documents
Light
Dark
Automatic
Français
English
muti-armed bandits
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
In Deep Reinforcement Learning models trained using gradient-based techniques, the choice of optimizer and its learning rate are …
Henrique DONANCIO
,
Antoine BARRIER
,
Leah SOUTH
,
Florence FORBES
17 octobre 2024
PDF
arXiv
BibTeX