Research Scientist, Science of Post-Training and Reinforcement Learning
DeepMind· London, UK· 2 weeks ago
Apply Now ↗About the Role
Snapshot
We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work.
Company
DeepMind →Job Details