Rodeo
ResourcesPartnersSign in

AWTG Ltd

AI / ML Engineer

London
Posted 1 day ago
Sign up to applySee more jobs like this

How your CV stacks up

1Upload CV
2Analyse CV
3Improve CV

Upload your CV to see how well it fits this job role

?%

AI / ML Engineer

Job Title: AI/ML Engineer

We are looking for a goal-oriented and driven AI/ML Engineer with strong experience in building, deploying, and scaling AI/ML applications. The ideal candidate will have hands-on experience with generative AI, agentic AI systems, RAG applications, LLM platforms, APIs, cloud deployment, and production-ready AI architectures.

Key Responsibilities

AI/ML Model Development

  • Develop, train, fine-tune, and optimise machine learning, Generative AI and neural network models to meet specific business and functional requirements.

Generative AI and Agentic AI Development

  • Design and build generative AI applications, agentic AI workflows, and multi-agent architectures using modern AI frameworks and orchestration tools.

RAG and GraphRAG Applications

  • Build Retrieval-Augmented Generation applications, including GraphRAG solutions using knowledge graphs, Neo4j, Astra DB, vector databases, and related retrieval technologies.

LLM Application Development

  • Work with both open-source and closed-source large language models to build scalable AI applications, including model routing, prompt engineering, evaluation, and optimisation.

Voice-Based AI Implementation

  • Design and implement voice-based AI solutions, including speech-to-text, text-to-speech, conversational AI, and voice-enabled intelligent assistants.

API Development and Integration

  • Create robust API endpoints using tools such as FastAPI to enable seamless access to AI models and integration with external systems and applications.

AI Platform Development

  • Architect and develop a user-friendly AI platform where multiple AI models can be accessed, managed, and utilised through API calls.

Reasons to use Rodeo

I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?

Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.

Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.

Start with a chat, not a search bar

Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.

P

Graduate Consultant — 2026 Scheme

PwC·London, UK
£35,000/yr

Why you're a good match

Strong

Your economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.

See breakdown
Save jobNot relevant
View details

It searches the market for you

Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.

Why you're a good match

You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.

See breakdown
Strong

Experience fit

Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.

See breakdown
Strong

Only hits

No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.

System Design and Scalable Architecture

  • Contribute to the design of scalable, reliable AI systems, including queue-based processing, asynchronous workflows, distributed services, caching mechanisms, and production-grade backend architecture.

LLM Performance and Caching Optimisation

  • Optimise LLM performance and scalability using caching mechanisms such as KV cache, response caching, prompt caching, and efficient model-serving strategies.

Observability and Monitoring

  • Implement observability, logging, tracing, monitoring, and evaluation workflows using tools such as Langfuse and related platforms to track system performance, reliability, cost, and user interactions.

Cloud Deployment and Infrastructure

  • Deploy AI/ML applications across different cloud providers and server environments, ensuring scalability, reliability, security, and performance.

Continuous Improvement

  • Continuously monitor, update, and improve models, APIs, workflows, and platforms based on user feedback, system performance, and evolving AI technologies.

Skills And Qualifications

  • Minimum 3 years of experience in building AI/ML software and production-ready AI applications.
  • Strong expertise in machine learning, neural networks, deep learning, and generative AI applications.
  • Proficiency in Python and AI/ML frameworks such as TensorFlow, PyTorch, NumPy, LangChain, LangGraph, FastAPI, and related tools.
  • Experience with agentic AI, multi-agent architecture, RAG, GraphRAG, and LLM-based application development.
  • Hands-on experience with Langfuse, LiteLLM, observability tools, tracing, model monitoring, and AI evaluation workflows.
  • Experience working with queues, asynchronous processing, caching mechanisms, scalable system design, and backend architecture.
  • Strong understanding of knowledge graphs, vector databases, Neo4j, Astra DB, and graph-based retrieval systems.
  • Experience with both open-source and closed-source LLMs.
  • Experience deploying AI applications across different cloud providers and server environments.
  • Good understanding of software engineering best practices, including clean code, testing, documentation, CI/CD, version control, and maintainable system design.
  • Excellent problem-solving abilities with strong attention to detail.
  • Strong communication skills and the ability to collaborate effectively in a team-oriented environment.

Get help with your application

Your very own career expert that helps elevate your application to the next level.

Get help applying for this job

Bonus / Preferred Experience

  • Experience implementing voice-based AI applications, including conversational AI, speech-to-text, text-to-speech, and voice assistant technologies.
  • Experience scaling LLM applications using caching mechanisms such as KV cache, prompt caching, response caching, and efficient inference strategies.
  • Experience working across multiple cloud providers.
  • Experience integrating both open-source and closed-source LLMs into production applications.
  • Experience with advanced LLM operations, including model routing, cost optimisation, monitoring, and performance tuning.

Educational Requirements

  • Bachelor’s or Master’s degree in Computer Science, Engineering, Artificial Intelligence, Data Science, or a related field, with a focus on AI/ML.

Working Hours

  • Candidates must be available to work core UK business hours, 9:00–17:30 GMT/BST, Monday–Friday.
Trusted by 25,000+ job seekers

“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”

Jessica, London

Get help applying for this job

Skills

AI/ML Applications
Generative AI
Agentic AI
RAG Applications
LLM Platforms
APIs
Cloud Deployment
Production-Ready Architectures
Python
TensorFlow
PyTorch
Neo4j
Astra DB
Observability
Caching Mechanisms
Model Monitoring

Location

London, England, United Kingdom

Sign up to applySee more jobs like this