Deep Reinforcement Learning Engineer (Principal)
1 day ago
Santiago de Compostela
Tackle stability & sample-efficiency: GAE, normalization, entropy/KL control, distributional/value-loss tuning, curriculum learning and reward shaping, … Launch multi-GPU training, parallel rollouts, efficient replay/storage, and reproducible experiment tooling. Collaborate with the C-Level Team