Rodeo
ResourcesPartnersSign in

Insight International (UK) Ltd

Chaos Engineering Tech Lead

Sheffield
Posted 2 days ago
Sign up to applySee more jobs like this

How your CV stacks up

1Upload CV
2Analyse CV
3Improve CV

Upload your CV to see how well it fits this job role

?%

Chaos Engineering Tech Lead

Chaos Engineering Tech Lead

About the Role

The Chaos Engineering Tech Lead will lead the development and execution of chaos engineering capabilities for the Treasury Technology team, aiming to enhance platform resilience, recoverability, and operational stability across critical services. This role entails providing technical leadership to:

  • Define chaos engineering practices
  • Design and govern experiments
  • Drive the remediation of resilience gaps across the Treasury technology estate

Responsibilities

  • Strategy and Roadmap

    • Define and lead the chaos engineering strategy, roadmap, and operating model for Treasury Technology
    • Establish best practices, guardrails, and standards for safe, effective chaos engineering
  • Experiment Design & Execution

    • Design, review, and execute chaos experiments to validate resilience across:
      • Infrastructure
      • Platforms
      • Applications
      • Service dependencies
    • Identify weaknesses in:
      • Resilience
      • Single points of failure
      • Recovery gaps
      • Operational risks
  • Remediation & Improvement

    • Drive remediation by collaborating with engineering and platform teams
    • Ensure experiments are:
      • Measurable
      • Controlled
      • Aligned with service resilience objectives and business criticality
  • Resilience Metrics & Reporting

    • Define resilience metrics, reporting, and evidence for tracking maturity
    • Demonstrate continuous improvement over time

Reasons to use Rodeo

I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?

Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.

Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.

Start with a chat, not a search bar

Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.

P

Graduate Consultant — 2026 Scheme

PwC·London, UK
£35,000/yr

Why you're a good match

Strong

Your economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.

See breakdown
Save jobNot relevant
View details

It searches the market for you

Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.

Why you're a good match

You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.

See breakdown
Strong

Experience fit

Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.

See breakdown
Strong

Only hits

No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.

  • Resilience-by-Design Principles

    • Embed proactive resilience into:
      • Engineering practices
      • Delivery processes
      • Operational readiness
    • Run Gamdays (game days) to rehearse failure scenarios and validate:
      • Runbooks
      • Alerting systems
      • On-call readiness
    • Foster a resilience-focused team culture
  • Cross-Team Collaboration

    • Partner with architecture, SRE, DevOps, infrastructure, security, and support teams
    • Align resilience activities with engineering priorities
  • Coaching & Mentorship

    • Coach engineers in:
      • Resilience thinking
      • Experiment design
      • Chaos engineering best practices
  • Stakeholder Communication

    • Clearly communicate:
      • Technical findings
      • Resilience risks
      • Improvement priorities
    • Target audience: senior stakeholders and decision-makers

Requirements

Essential

  • University Degree (or equivalent) in Computer Science, Software Engineering, or related discipline
  • Excellent written & spoken English
  • Strong experience in a technical leadership role within:
    • Platform engineering
    • Site Reliability Engineering (SRE)
    • Cloud engineering
    • Resilience engineering (or related discipline)
  • Proven track record in designing and implementing chaos engineering, fault injection, or resilience testing practices in complex enterprise environments
  • Deep hands-on expertise in:
    • Kubernetes (deployment, failure handling, scaling, networking, troubleshooting)
  • Strong experience with:
    • GCP (Google Cloud Platform)
    • Cloud-native platforms (including operational & resilience considerations)
  • Strong understanding of:
    • Distributed systems
    • Failure modes
    • System recovery
    • Service resilience patterns
  • Experience working with:
    • Microservices
    • APIs
    • Platform services
    • Cloud-based applications
  • Proficiency in observability (metrics, logging, tracing, alerting, service health monitoring)
  • Experience with:
    • Automation
    • CI/CD pipelines
    • Engineering tooling to support scalable resilience practices
  • Strong problem-solving capabilities (diagnose technical issues, drive practical solutions)
  • Excellent stakeholder management & communication skills in:
    • Engineering teams
    • Senior management
  • Resilience mindset with a focus on:
    • Proactive risk reduction
    • Continuous improvement

Get help with your application

Your very own career expert that helps elevate your application to the next level.

Get help applying for this job

Preferred

  • Experience in financial services or another highly regulated environment
  • Work with business-critical platforms, prioritising:
    • Stability
    • Recoverability
    • Availability
  • Familiarity with:
    • SRE principles
    • Incident management
    • Disaster recovery
    • Operational resilience practices
  • Experience introducing new engineering capabilities and driving adoption across multiple teams
  • Knowledge of the technology stack used within Treasury platforms would be advantageous
Trusted by 25,000+ job seekers

“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”

Jessica, London

Get help applying for this job

Skills

Chaos Engineering
Technical Leadership
Kubernetes
Cloud Engineering
Resilience Engineering
Distributed Systems
Observability
Automation
CI/CD Pipelines
Problem-Solving
Stakeholder Management
Resilience Mindset
Incident Management
Disaster Recovery
Operational Resilience
Microservices

Location

Sheffield, England, United Kingdom

Sign up to applySee more jobs like this