Research Scientist, Science of Post-Training and Reinforcement Learning at DeepMind

Snapshot

We are starting a small team aimed at building a real science of post-training for agents. This involves reinforcement learning for LLM-based systems, rigorous experimentation, and a focus on scaling, evaluation, and the practical details that make methods work.

Company

DeepMind →

Job Details

Location: London, UK
Work Type: On-site / Hybrid
Posted: 2 weeks ago

Apply Now ↗

Research Scientist, Science of Post-Training and Reinforcement Learning

About the Role

Snapshot

Company

Job Details