Rodeo
ResourcesPartnersSign in

Epsilon

Manager, System and Platform Operations

London
Posted 3 months ago
Sign up to applySee more jobs like this

How your CV stacks up

1Upload CV
2Analyse CV
3Improve CV

Upload your CV to see how well it fits this job role

?%

Manager, System and Platform Operations

Overview

How You’ll Make an Impact

A subsidiary of Publicis Groupe, Epsilon is a leading provider of multi-channel marketing services, technologies, and database solutions. We do more than collect and store data, and we might be the most important Internet company you’ve never heard of. Join our team for your chance to work in the digital marketing space and solve meaningful problems on a massive scale—and have fun doing it.

The System and Platform Operations Manager is a technical leadership role that is responsible for the support, reliability and stability of Epsilon Retail Media production systems, environments and offerings. The team owns the reliability vision for the company, driving continuous improvement through a combination of development and operations initiatives as well as process excellence. This position and their team has solid-line responsibility for operations including the deployment, management, monitoring, reporting, troubleshooting, and repair of production systems. Core to the success of the role is to provide a premium customer support experience focused on a “center of excellence” that allows for a full-service delivery support cycle.

This role is responsible for managing the Platform Operation Team centralized within a single geo-region, orchestrating the regional teamwork, serving with both technical and professional support, and championing the company values. The Platform Operations Engineer works closely with the Engineering team to ensure ongoing system stability and supports the Technical Account Managers from an environment's perspective.

The Platform Operations team is responsible for supporting all retailers once they are live. Critically important is how this team collaborates and liaises with other teams such as Customer Support, Technical Account Management, Engineering and Customer Success teams.

Responsibilities

What you’ll do:

Establish and manage operational practices and ensure we design, implement and operate a support model that is fit for purpose for our future. Adopt a “Measure Everything” approach to ensure that internal service level objectives and customer service levels agreements are exceeded including executive level reporting on operational health metrics such as SLAs, incident resolution, performance, availability, reliability, capacity etc. Take ownership of complex issues related to performance, reliability, and scalability and leading resolution of serious incidents and events including communications with customers and wider stakeholders. Provide insight and expertise on how customers will perceive the changes or impacts to customers to drive customer organization change management and communication. Empower the Delivery teams to release new products, features, updates and fixes quickly, while ensuring Platforms remain reliable and stable. Work with the wider Engineering, Product, Delivery and Security teams to ensure that appropriate attention is given to production/system reliability. Identify the capabilities needed to meet the current and emerging business needs of a significant function. As subject matter expert on the team, maintain understanding of current technology, database management, reliability practices, and future trends through ongoing education, conference attendance and industry press.

Reasons to use Rodeo

I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?

Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.

Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.

Start with a chat, not a search bar

Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.

P

Graduate Consultant — 2026 Scheme

PwC·London, UK
£35,000/yr

Why you're a good match

Strong

Your economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.

See breakdown
Save jobNot relevant
View details

It searches the market for you

Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.

Why you're a good match

You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.

See breakdown
Strong

Experience fit

Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.

See breakdown
Strong

Only hits

No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.

Qualifications

Who You Are:

At least 5 years of experience of hands-on experience in Site Reliability focused positions. Strong knowledge of containerization technologies (Docker, Kubernetes). Experience with infrastructure as code (Terraform). Solid understanding of networking, security, and system architecture. Proficient in scripting languages (Java, Golang, Python, Bash, or similar). Experience with monitoring and observability tools (DataDog, Prometheus, Grafana). Knowledge of database management systems (PostgreSQL, Bigtable). Understanding of API and microservices architecture. Strong people leadership skills with at least a year in leading and driving high-performance technical teams Operations teams within enterprise environments with knowledge of DevOps, ITIL, Cloud Services, IT Infrastructure and Operations supporting and maintaining production and development environments and building cloud services that are secure, reliable, scalable and observable Experience with establishing Service Delivery strategies that align to new ways of work methods, including Agile. Experience of establishing and delivering IT support services in a high availability (HA) environment such as 24/7 operations.

Click here to view how Epsilon transforms marketing with 1 View, 1 Vision, 1 Voice.

Additional Information

When You Join Us, We’ll Create Something EPIC Together

Epsilon is a global data, technology and services company that powers the marketing and advertising ecosystem. For decades, we’ve provided marketers from the world’s leading brands the data, technology and services they need to engage consumers with 1 View, 1 Vision and 1 Voice. 1 View of their universe of potential buyers. 1 Vision for engaging each individual. And 1 Voice to harmonize engagement across paid, owned and earned channels.

Get help with your application

Your very own career expert that helps elevate your application to the next level.

Get help applying for this job

Epsilon’s comprehensive portfolio of capabilities across our suite of digital media, messaging and loyalty solutions bridge the divide between marketing and advertising technology. We process 400+ billion consumer actions each day using advanced AI and hold many patents of proprietary technology, including real-time modeling languages and consumer privacy advancements. Thanks to the work of every employee, Epsilon has been consistently recognized as industry-leading by Forrester, Adweek and the MRC. Epsilon is a global company with more than 9,000 employees around the world.

Our pillars aren't just words. They're how we show up every day.

People centricity: We focus on employee well-being in an environment where colleagues truly care about each other. Collaboration: We work together, support one another, and collectively achieve goals. Growth: There are endless opportunities for growth through learning, development and career advancement. Innovation: We drive progress through cutting-edge solutions and forward-thinking approaches. Flexibility: We’ve created a balance between work and personal life, and we encourage adaptability to solve problems creatively.

Our values guide us to create value for our clients, our people and consumers.

Act with integrity Work together to win together Innovate with purpose Respect all voices Empower with accountability

These pillars and values are our foundation—shaping our culture, guiding our decisions, and uniting us in common purpose.

Because You Matter

We know that we have some of the brightest and most talented employees in the world, and we believe in rewarding them accordingly. If you work here, expect competitive compensation, a great benefits package and endless opportunities to advance your career.

We offer hybrid working opportunities, with our office space located in the Iconic Television Centre, White City.

As part of our dedication to enhance our inclusive and diverse workforce, Epsilon is committed to equal access to opportunity for people without regard to race, age, sex, disability, neurodiversity, sexual orientation, gender identity, pregnancy and maternity, marriage and civil partnership or religion or belief. We are committed to providing reasonable adjustments for candidates in our application process.

Trusted by 25,000+ job seekers

“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”

Jessica, London

Get help applying for this job

Skills

Site Reliability
Containerization
Docker
Kubernetes
Infrastructure as Code
Terraform
Networking
Security
System Architecture
Scripting Languages
Java
Golang
Python
Bash
Monitoring Tools
Database Management

Location

London, England, United Kingdom

Sign up to applySee more jobs like this