Rodeo
ResourcesPartnersSign in

Google DeepMind

Research Scientist, Agent Post-Training, DeepMind

London
Posted 1 day ago
Sign up to applySee more jobs like this

How your CV stacks up

1Upload CV
2Analyse CV
3Improve CV

Upload your CV to see how well it fits this job role

?%

Research Scientist, Agent Post-Training, DeepMind

Minimum qualifications:

PhD in Computer Science, Machine Learning, or a related quantitative field, or equivalent practical experience. 2 years of experience with machine learning frameworks such as JAX, Flax, or PyTorch. Experience conducting research in reinforcement learning, tool use, or agentic systems.

Preferred qualifications:

Experience publishing research in reinforcement learning, reinforcement learning from human feedback, or tool-use algorithms at machine learning venues. Experience working with large-scale distributed training infrastructure and scaling experiments. Research experience in systems design, code complexity, or working with large-codebase environments. Experience developing simple, scalable solutions for complex, open-ended research problems.

About The Job

At DeepMind, we have built a unique culture and work environment where long-term ambitious research can flourish. We are seeking a highly motivated Research Scientist to join our team and contribute to groundbreaking fundamental research and deployment in large-scale agent post-training.

About us the Artificial Intelligence could be one of humanity’s most useful inventions. At DeepMind, we’re a team of scientists, engineers, machine learning experts and more, working together to advance in artificial intelligence. We use our technologies for widespread public benefit and scientific discovery, and collaborate with others on critical challenges, ensuring safety and ethics are the highest priority.

Reasons to use Rodeo

I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?

Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.

Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.

Start with a chat, not a search bar

Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.

P

Graduate Consultant — 2026 Scheme

PwC·London, UK
£35,000/yr

Why you're a good match

Strong

Your economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.

See breakdown
Save jobNot relevant
View details

It searches the market for you

Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.

Why you're a good match

You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.

See breakdown
Strong

Experience fit

Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.

See breakdown
Strong

Only hits

No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.

In this role, we are looking for a Research Scientist or Research Engineer that is keen to push the frontier of agentic tasks in post-training large language models. As part of the Gemini post-training team, you will have the chance to drive the research that makes up for the foundation of upcoming releases.

Artificial intelligence will be one of humanity’s most transformative inventions. At Google DeepMind, we are a pioneering AI lab with exceptional interdisciplinary teams focused on advancing AI development to solve complex global challenges and accelerate high-quality product innovation for billions of users. We use our technologies for widespread public benefit and scientific discovery, ensuring safety and ethics are always our highest priority.

We are pushing the boundaries across multiple domains. Our global teams offer diverse learning opportunities and varied career pathways for those driven to achieve exceptional results through collective effort.

Get help with your application

Your very own career expert that helps elevate your application to the next level.

Get help applying for this job

Responsibilities

Drive the research process for large-scale agent post-training from hypothesis formulation to delivery in the Gemini model recipe. Design and execute ablation studies to validate research hypotheses and accelerate experimental feedback loops. Communicate research findings, progress, and outcomes to the broader team through visualizations and reports. Develop research infrastructure and utilities for data analysis and model evaluations using standard engineering practices. Collaborate with other research scientists and engineers to maintain a regular feedback and communication loop.

Google is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to equal employment opportunity regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity or Veteran status. We also consider qualified applicants regardless of criminal histories, consistent with legal requirements. See also Google's EEO Policy and EEO is the Law. If you have a disability or special need that requires accommodation, please let us know by completing our Accommodations for Applicants form .

Trusted by 25,000+ job seekers

“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”

Jessica, London

Get help applying for this job

Skills

Machine Learning
Reinforcement Learning
Tool Use
Agentic Systems
JAX
Flax
PyTorch
Research
Data Analysis
Model Evaluations
Systems Design
Code Complexity
Large-Scale Distributed Training
Experimental Feedback
Collaboration
Communication

Location

London, England, United Kingdom

Sign up to applySee more jobs like this