Antoine Barrier
Antoine Barrier
Home & Contact
Research
Teaching
Talks
Documents
Light
Dark
Automatic
English
Français
muti-armed bandits
Dynamic Learning Rate for Deep Reinforcement Learning: A Bandit Approach
In Deep Reinforcement Learning models trained using gradient-based techniques, the choice of optimizer and its learning rate are …
Henrique DONANCIO
,
Antoine BARRIER
,
Leah SOUTH
,
Florence FORBES
17 October 2024
PDF
arXiv
BibTeX