The Investigo Group

Senior Site Reliability Engineer (SRE)

United Kingdom

Posted 23 days ago

How your CV stacks up

1Upload CV

2Analyse CV

3Improve CV

Upload your CV to see how well it fits this job role

Drag and drop your CV

or browse files

Supported files: PDF, DOC, DOCX

Senior Site Reliability Engineer (SRE)

Senior Site Reliability Engineer (SRE) – Kubernetes / OKD

Location

Remote (UK-based) – occasional travel to TIG Secure site or customer locations

Job Type

Full-time, Permanent

Security Clearance Requirements

A Security Check (SC) clearance is required for this role. You do not need an existing clearance, but you must meet the following criteria:

Right to work in the UK
Lived in the UK continuously for the past 5 years
Not spent more than 6 months outside the UK in total during this period
Willing to undergo security vetting as part of the onboarding process

About The Investigo Group (TIG)

TIG is a coalition of technology businesses specialising in secure platforms, software, data, AI, and digital solutions for regulated, mission-critical environments. Our brands include:

Voixtel – Secure communications and voice platforms
IIS – Secure internet access for public and private organisations
Vestigo Consulting – Expert consultancy, training, and sector insights
Collaboraite – Secure, user-centred AI and data solutions

We build secure, customer-focused products and services for organisations in highly regulated environments, combining engineering excellence with deep technical and data expertise.

At TIG, we foster an inclusive culture where everyone is empowered to succeed.

About You

You are a senior SRE, Cloud/Platform Engineer, or Kubernetes Specialist with hands-on production experience beyond simply deploying to Kubernetes.

What We’re Looking For

Deep expertise in operating Kubernetes/OKD in production—you own it, not just use it.
Strong experience with Linux, infrastructure automation (Ansible, Terraform, etc.), GitOps, CI/CD, observability, and secure platform operations.
Treat infrastructure as a product—prioritise reliability, clean automation, and useful observability.
Incident response calmness—leads blameless post-mortems and drives systemic improvements.
Experience in regulated, secure, or government environments (defence, finance, telecoms, etc.).
A senior individual contributor who mentors, influences engineering practices, and serves as a technical authority without formal management.

Key Soft Skills

Structured incident handling – methodical debugging over shortcuts.
Clear communication – well-written runbooks, post-mortems, and emails.
Collaborative collaboration – works seamlessly across networking, security, and application engineering.
Blameless post-mortem culture – holds a firm line without finger-pointing.
Mentoring ability – elevates the expertise of peers.
Opinionated & evidence-based – influences practices through example, not hierarchy.

Reasons to use Rodeo

I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?

Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.

Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.

Start with a chat, not a search bar

Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.

It searches the market for you

Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.

Only hits

No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.

About The Role

We’re seeking a Senior Site Reliability Engineer (SRE) – Kubernetes/OKD to own, harden, and mature our production Kubernetes/OKD platforms, which are critical to TIG’s government and regulated customer engagements.

This is a hands-on senior role, not a ticket-based operational job. You’ll work across:

Bare metal, virtualisation, and Kubernetes control plane operations (OKD, OpenShift).
Ingress, CI/CD, GitOps, identity, observability, and developer platforms.
Cross_team collaboration with platform, security, AI, networking, and DevOps teams.

Key Responsibilities

✅ Operate and harden OKD/OpenShift/Kubernetes clusters (on-prem/hybrid). ✅ Support migrations and modernise infrastructure (compute, networking, storage). ✅ Own GitOps (Argo CD) and CI/CD pipelines for platform and app components. ✅ Automate cloud-native app deployment (Helm, Kustomize, SPAs, microservices). ✅ Maintain core platform services (Keycloak, ingress, SLOs, errors budgets, observability). ✅ Improve security hardening in compliance with regulated environments. ✅ Build observability (logs, metrics, traces, alerting, contributions). ✅ Script automation (Python, Go, Ansible, Terraform, Helm). ✅ Lead incident response, conduct post-mortems, and implement systemic fixes. ✅ Collaborate with security & networking teams on load balancing, segmentation, and accreditation. ✅ Write and maintain technical docs, runbooks, and engineering guidance. ✅ Mentor engineers as a senior technical authority (no line management needed). ✅ Rotate on-call responsibility with compensation.

Success In This Role

🎯 More reliable, measurable prod Kubernetes/OKD with active error budgets & SLOs. 🎯 Mature GitOps implementation (rollback, drift detection). 🎯 Improved CI/CD pipelines (security, QA, compliance earlier). 🎯 Hardened & well-documented platform services. 🎯 Reduced incident noise with clear observability and decision support. 🎯 Stronger incident culture via clearer runbooks and post-mortems. 🎯 Technical authority in Kubernetes, cloud, and platform engineering.

Requirements

Essential

Production Kubernetes experience – more than just knowledge.
Deep Linux skills (sysadmin, networking, storage, performance).
Experience with OpenShift, OKD, vanilla Kubernetes, Rancher, EKS, AKS, or GKE.
Infrastructure as code (Ansible + Terraform/equivalent).
GitOps & CI/CD automation (Argo CD, Flux, GitHub Actions).
Observability stack (Prometheus, Grafana, Fluentd, OpenTelemetry, LGTM).
Identity & access (OIDC, SAML, SCIM, Keycloak).
Virtualisation (KVM, libvirt, or VMware experience).
Scripting (Python, Go, shell) for automation.
Troubleshooting & problem-solving expertise.
Secure regime experience (government, finance, defence, enterprise).
Documentation skills for runbooks, post-mortems, and design.
Eligible for UK Security Check (SC) clearance.

Get help with your application

Your very own career expert that helps elevate your application to the next level.

Get help applying for this job

Desirable (Plus Points!)

OpenShift/OKD-specific expertise (operators, MachineConfig, SCCs).
Service mesh (Istio, Linkerd).
Policy engines (OPA, Gatekeeper, Kyverno).
Software supply chain security (Sigstore, SigVault, image signing).
Storage experience (Ceph, Longhorn, OpenShift Data Foundation).
Networking (BGP, VXLAN, Palo Alto, Juniper).
AI/GPU platform ops (NVIDIA DGX, ML workloads).
Kubernetes certifications (CKA, CKAD, CKS, or Red Hat).
UK SC clearance (active or recent).
Open-source contributions to Kubernetes or related ecosystems.

What The Role Offers You

🆓 Ownership of OKD/Kubernetes estate shipping into government-regulated environments 💻 Modern, self-hosted toolchain (NKD/OpenShift + NVIDIA DGX AI platform) 🏢 Small, senior engineering team (minimal bureaucracy, CTO-led leadership) 🌍 True remote-first culture (travel only for high-value initiatives)

The Team

Platform Engineering maintains the foundational Kubernetes/OKD estate, developer tools, and security infrastructure supporting:

Government and regulated commercial customers.
Accredited environments.

Benefits

Private medical insurance
Health cash plan
4x life assurance
Flexible, inclusive work culture
36 days’ holiday (increasing to 30+)
Continuous learning & development
Performance-based bonus potential
Discounts on products & services
Pension scheme contributions
Electric Vehicle (EV) car scheme
Regular pay reviews
And more

Equal Opportunities

At TIG, we commit to equal opportunities and value diversity, equity, and inclusion. We do not discriminate based on:

Race, religion, colour, national origin
Sex, gender identity, sexual orientation
Age, marital status, veteran status
Disability status

We’re happy to discuss reasonable accommodations throughout the hiring process.

Trusted by 25,000+ job seekers

“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”

Jessica, London

Get help applying for this job

Skills

Kubernetes

Linux

Infrastructure as Code

GitOps

CI/CD

Observability

Identity Management

Cloud Engineering

Scripting

Troubleshooting

Security

Networking

Virtualization

Automation

Platform Operations

Production Environments

Location

United Kingdom