The Investigo Group
Senior Site Reliability Engineer (SRE)

How your CV stacks up
Upload your CV to see how well it fits this job role
?%
Senior Site Reliability Engineer (SRE)
Senior Site Reliability Engineer (SRE) – Kubernetes / OKD
Location
Remote (UK-based) – occasional travel to TIG Secure site or customer locations
Job Type
Full-time, Permanent
Security Clearance Requirements
A Security Check (SC) clearance is required for this role. You do not need an existing clearance, but you must meet the following criteria:
- Right to work in the UK
- Lived in the UK continuously for the past 5 years
- Not spent more than 6 months outside the UK in total during this period
- Willing to undergo security vetting as part of the onboarding process
About The Investigo Group (TIG)
TIG is a coalition of technology businesses specialising in secure platforms, software, data, AI, and digital solutions for regulated, mission-critical environments. Our brands include:
- Voixtel – Secure communications and voice platforms
- IIS – Secure internet access for public and private organisations
- Vestigo Consulting – Expert consultancy, training, and sector insights
- Collaboraite – Secure, user-centred AI and data solutions
We build secure, customer-focused products and services for organisations in highly regulated environments, combining engineering excellence with deep technical and data expertise.
At TIG, we foster an inclusive culture where everyone is empowered to succeed.
About You
You are a senior SRE, Cloud/Platform Engineer, or Kubernetes Specialist with hands-on production experience beyond simply deploying to Kubernetes.
What We’re Looking For
- Deep expertise in operating Kubernetes/OKD in production—you own it, not just use it.
- Strong experience with Linux, infrastructure automation (Ansible, Terraform, etc.), GitOps, CI/CD, observability, and secure platform operations.
- Treat infrastructure as a product—prioritise reliability, clean automation, and useful observability.
- Incident response calmness—leads blameless post-mortems and drives systemic improvements.
- Experience in regulated, secure, or government environments (defence, finance, telecoms, etc.).
- A senior individual contributor who mentors, influences engineering practices, and serves as a technical authority without formal management.
Key Soft Skills
- Structured incident handling – methodical debugging over shortcuts.
- Clear communication – well-written runbooks, post-mortems, and emails.
- Collaborative collaboration – works seamlessly across networking, security, and application engineering.
- Blameless post-mortem culture – holds a firm line without finger-pointing.
- Mentoring ability – elevates the expertise of peers.
- Opinionated & evidence-based – influences practices through example, not hierarchy.
Reasons to use Rodeo
I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?
Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.
Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.
Start with a chat, not a search bar
Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.
Graduate Consultant — 2026 Scheme
Why you're a good match
StrongYour economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.
See breakdownIt searches the market for you
Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.
Why you're a good match
You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.
Experience fit
Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.
Only hits
No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.
About The Role
We’re seeking a Senior Site Reliability Engineer (SRE) – Kubernetes/OKD to own, harden, and mature our production Kubernetes/OKD platforms, which are critical to TIG’s government and regulated customer engagements.
This is a hands-on senior role, not a ticket-based operational job. You’ll work across:
- Bare metal, virtualisation, and Kubernetes control plane operations (OKD, OpenShift).
- Ingress, CI/CD, GitOps, identity, observability, and developer platforms.
- Cross_team collaboration with platform, security, AI, networking, and DevOps teams.
Key Responsibilities
✅ Operate and harden OKD/OpenShift/Kubernetes clusters (on-prem/hybrid). ✅ Support migrations and modernise infrastructure (compute, networking, storage). ✅ Own GitOps (Argo CD) and CI/CD pipelines for platform and app components. ✅ Automate cloud-native app deployment (Helm, Kustomize, SPAs, microservices). ✅ Maintain core platform services (Keycloak, ingress, SLOs, errors budgets, observability). ✅ Improve security hardening in compliance with regulated environments. ✅ Build observability (logs, metrics, traces, alerting, contributions). ✅ Script automation (Python, Go, Ansible, Terraform, Helm). ✅ Lead incident response, conduct post-mortems, and implement systemic fixes. ✅ Collaborate with security & networking teams on load balancing, segmentation, and accreditation. ✅ Write and maintain technical docs, runbooks, and engineering guidance. ✅ Mentor engineers as a senior technical authority (no line management needed). ✅ Rotate on-call responsibility with compensation.
Success In This Role
🎯 More reliable, measurable prod Kubernetes/OKD with active error budgets & SLOs. 🎯 Mature GitOps implementation (rollback, drift detection). 🎯 Improved CI/CD pipelines (security, QA, compliance earlier). 🎯 Hardened & well-documented platform services. 🎯 Reduced incident noise with clear observability and decision support. 🎯 Stronger incident culture via clearer runbooks and post-mortems. 🎯 Technical authority in Kubernetes, cloud, and platform engineering.
Requirements
Essential
- Production Kubernetes experience – more than just knowledge.
- Deep Linux skills (sysadmin, networking, storage, performance).
- Experience with OpenShift, OKD, vanilla Kubernetes, Rancher, EKS, AKS, or GKE.
- Infrastructure as code (Ansible + Terraform/equivalent).
- GitOps & CI/CD automation (Argo CD, Flux, GitHub Actions).
- Observability stack (Prometheus, Grafana, Fluentd, OpenTelemetry, LGTM).
- Identity & access (OIDC, SAML, SCIM, Keycloak).
- Virtualisation (KVM, libvirt, or VMware experience).
- Scripting (Python, Go, shell) for automation.
- Troubleshooting & problem-solving expertise.
- Secure regime experience (government, finance, defence, enterprise).
- Documentation skills for runbooks, post-mortems, and design.
- Eligible for UK Security Check (SC) clearance.


Get help with your application
Your very own career expert that helps elevate your application to the next level.
Desirable (Plus Points!)
- OpenShift/OKD-specific expertise (operators, MachineConfig, SCCs).
- Service mesh (Istio, Linkerd).
- Policy engines (OPA, Gatekeeper, Kyverno).
- Software supply chain security (Sigstore, SigVault, image signing).
- Storage experience (Ceph, Longhorn, OpenShift Data Foundation).
- Networking (BGP, VXLAN, Palo Alto, Juniper).
- AI/GPU platform ops (NVIDIA DGX, ML workloads).
- Kubernetes certifications (CKA, CKAD, CKS, or Red Hat).
- UK SC clearance (active or recent).
- Open-source contributions to Kubernetes or related ecosystems.
What The Role Offers You
🆓 Ownership of OKD/Kubernetes estate shipping into government-regulated environments 💻 Modern, self-hosted toolchain (NKD/OpenShift + NVIDIA DGX AI platform) 🏢 Small, senior engineering team (minimal bureaucracy, CTO-led leadership) 🌍 True remote-first culture (travel only for high-value initiatives)
The Team
Platform Engineering maintains the foundational Kubernetes/OKD estate, developer tools, and security infrastructure supporting:
- Government and regulated commercial customers.
- Accredited environments.
Benefits
- Private medical insurance
- Health cash plan
- 4x life assurance
- Flexible, inclusive work culture
- 36 days’ holiday (increasing to 30+)
- Continuous learning & development
- Performance-based bonus potential
- Discounts on products & services
- Pension scheme contributions
- Electric Vehicle (EV) car scheme
- Regular pay reviews
- And more
Equal Opportunities
At TIG, we commit to equal opportunities and value diversity, equity, and inclusion. We do not discriminate based on:
- Race, religion, colour, national origin
- Sex, gender identity, sexual orientation
- Age, marital status, veteran status
- Disability status
We’re happy to discuss reasonable accommodations throughout the hiring process.
“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”
Jessica, London
Skills
Location