micro1
(Research Engineering) Member of Technical Staff

How your CV stacks up
Upload your CV to see how well it fits this job role
?%
(Research Engineering) Member of Technical Staff
Member of Technical Staff, Research Engineering
Job Type: Full-time
Location: Remote
The Role
We are seeking a Research Engineer to operate at the frontier of Reinforcement Learning (RL), developing novel environments, training pipelines, and evaluation systems that advance the capabilities of modern AI models. This role sits at the intersection of research and production, translating experimental ideas into scalable, high-performance systems.
What You’ll Work On
- Architect self-contained RL environments that capture complex, real-world tasks, including reward functions, verifiers, and evaluation logic.
- Design and scale episode pipelines and multi-component training processes (MCPs) to support reproducible experimentation.
- Build automated data generation systems, leveraging synthetic data to accelerate training cycles without compromising quality.
- Develop and integrate AI-driven evaluation and quality assurance systems for automated grading, validation, and feedback loops.
- Fine-tune and optimize open-source RL models using internally generated datasets and custom training strategies.
- Establish benchmarking frameworks to measure model capability, robustness, and data quality across tasks.
- Contribute to the release and analysis of evaluations on internal and external benchmark platforms (e.g., micro1 benchmarks).
Reasons to use Rodeo
I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?
Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.
Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.
Start with a chat, not a search bar
Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.
Graduate Consultant — 2026 Scheme
Why you're a good match
StrongYour economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.
See breakdownIt searches the market for you
Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.
Why you're a good match
You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.
Experience fit
Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.
Only hits
No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.
What We're Looking For
- Deep experience in Reinforcement Learning, including environment design and training dynamics.
- Strong track record of building and scaling RL systems, pipelines, or experimentation frameworks.
- Proficient in automation and data generation, including synthetic data pipelines.
- Familiar with automated evaluation systems, model validation, and quality assurance workflows.
- experienced in fine-tuning and evaluating open-source ML models.
- Clear, concise communicator with strong technical writing skills.
- Comfortable operating in fast-paced, research-driven, and highly collaborative environments.


Get help with your application
Your very own career expert that helps elevate your application to the next level.
Preferred
- Experience publishing benchmarks, evaluations, or research artifacts.
- Familiarity with evaluation ecosystems (e.g., micro1 benchmarks or similar frameworks).
- Background in scalable infrastructure for large-scale RL experimentation.
“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”
Jessica, London
Skills
Location