
How your CV stacks up
Upload your CV to see how well it fits this job role
?%
AI / ML Engineer
Job Title: AI/ML Engineer
We are looking for a goal-oriented and driven AI/ML Engineer with strong experience in building, deploying, and scaling AI/ML applications. The ideal candidate will have hands-on experience with generative AI, agentic AI systems, RAG applications, LLM platforms, APIs, cloud deployment, and production-ready AI architectures.
Key Responsibilities
AI/ML Model Development
- Develop, train, fine-tune, and optimise machine learning, Generative AI and neural network models to meet specific business and functional requirements.
Generative AI and Agentic AI Development
- Design and build generative AI applications, agentic AI workflows, and multi-agent architectures using modern AI frameworks and orchestration tools.
RAG and GraphRAG Applications
- Build Retrieval-Augmented Generation applications, including GraphRAG solutions using knowledge graphs, Neo4j, Astra DB, vector databases, and related retrieval technologies.
LLM Application Development
- Work with both open-source and closed-source large language models to build scalable AI applications, including model routing, prompt engineering, evaluation, and optimisation.
Voice-Based AI Implementation
- Design and implement voice-based AI solutions, including speech-to-text, text-to-speech, conversational AI, and voice-enabled intelligent assistants.
API Development and Integration
- Create robust API endpoints using tools such as FastAPI to enable seamless access to AI models and integration with external systems and applications.
AI Platform Development
- Architect and develop a user-friendly AI platform where multiple AI models can be accessed, managed, and utilised through API calls.
Reasons to use Rodeo
I’m in my final year doing Economics and I don’t know whether to apply for grad schemes now or do a masters first. What do you think?
Honest answer — it depends on where you want to end up. A lot of top grad schemes (Big 4, civil service, banking) don’t need a masters. Let’s look at the ones you’d be competitive for now, and we can decide if a masters actually adds anything.
Also worth knowing: most autumn 2026 applications are open now. Timing matters more than you think.
Start with a chat, not a search bar
Grad scheme, placement, apprenticeship? Not sure what you want yet — that's fine. Your agent talks it through with you and turns "I have no idea" into a shortlist.
Graduate Consultant — 2026 Scheme
Why you're a good match
StrongYour economics background and your summer at a regional bank line up with what PwC looks for on the consulting scheme. Applications close in four weeks.
See breakdownIt searches the market for you
Every day your agent scans the market matching roles against what actually matters to you, not just keywords on a CV.
Why you're a good match
You’ve got the grades and the economics background, and your bank internship is exactly the experience this scheme looks for. Apply soon — deadlines close within the month.
Experience fit
Your summer at the bank plus your econometrics coursework map directly to the day-one responsibilities on this scheme — client modelling, market briefings, and deal support.
Only hits
No noise. No "maybe this fits." Just roles with a clear explanation of why they're right — and where to focus when applying.
System Design and Scalable Architecture
- Contribute to the design of scalable, reliable AI systems, including queue-based processing, asynchronous workflows, distributed services, caching mechanisms, and production-grade backend architecture.
LLM Performance and Caching Optimisation
- Optimise LLM performance and scalability using caching mechanisms such as KV cache, response caching, prompt caching, and efficient model-serving strategies.
Observability and Monitoring
- Implement observability, logging, tracing, monitoring, and evaluation workflows using tools such as Langfuse and related platforms to track system performance, reliability, cost, and user interactions.
Cloud Deployment and Infrastructure
- Deploy AI/ML applications across different cloud providers and server environments, ensuring scalability, reliability, security, and performance.
Continuous Improvement
- Continuously monitor, update, and improve models, APIs, workflows, and platforms based on user feedback, system performance, and evolving AI technologies.
Skills And Qualifications
- Minimum 3 years of experience in building AI/ML software and production-ready AI applications.
- Strong expertise in machine learning, neural networks, deep learning, and generative AI applications.
- Proficiency in Python and AI/ML frameworks such as TensorFlow, PyTorch, NumPy, LangChain, LangGraph, FastAPI, and related tools.
- Experience with agentic AI, multi-agent architecture, RAG, GraphRAG, and LLM-based application development.
- Hands-on experience with Langfuse, LiteLLM, observability tools, tracing, model monitoring, and AI evaluation workflows.
- Experience working with queues, asynchronous processing, caching mechanisms, scalable system design, and backend architecture.
- Strong understanding of knowledge graphs, vector databases, Neo4j, Astra DB, and graph-based retrieval systems.
- Experience with both open-source and closed-source LLMs.
- Experience deploying AI applications across different cloud providers and server environments.
- Good understanding of software engineering best practices, including clean code, testing, documentation, CI/CD, version control, and maintainable system design.
- Excellent problem-solving abilities with strong attention to detail.
- Strong communication skills and the ability to collaborate effectively in a team-oriented environment.


Get help with your application
Your very own career expert that helps elevate your application to the next level.
Bonus / Preferred Experience
- Experience implementing voice-based AI applications, including conversational AI, speech-to-text, text-to-speech, and voice assistant technologies.
- Experience scaling LLM applications using caching mechanisms such as KV cache, prompt caching, response caching, and efficient inference strategies.
- Experience working across multiple cloud providers.
- Experience integrating both open-source and closed-source LLMs into production applications.
- Experience with advanced LLM operations, including model routing, cost optimisation, monitoring, and performance tuning.
Educational Requirements
- Bachelor’s or Master’s degree in Computer Science, Engineering, Artificial Intelligence, Data Science, or a related field, with a focus on AI/ML.
Working Hours
- Candidates must be available to work core UK business hours, 9:00–17:30 GMT/BST, Monday–Friday.
“It took my CV and asked me questions relevant to understanding what kind of jobs to suggest for me. Suggestions were almost perfect. Jobs were exactly what I’ve been looking for.”
Jessica, London
Skills
Location