H

MLOps Engineer (LLM/GenAI)

icon building Company : Hsbc
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - MLOps Engineer (LLM/GenAI)


If you’re looking for a career that will help you stand out, join HSBC, and fulfil your potential - whether you want a career that could take you to the top, or an exciting new direction, we offer opportunities, support and rewards that will take you further.


We’re one of the largest banking and financial services organisations in the world, with a network that covers more than 50 countries and territories. We aim to be where the growth is, enabling businesses to thrive and economies to prosper, and, ultimately, helping people fulfil their hopes and realise their ambitions.


We are seeking a MLOps Engineer (LLM/GenAI)


In this fantastic role, you’ll engineer production-grade infrastructure for modern AI: hosting LLMs and speech/embedding models, pushing inference performance on real hardware, and building repeatable fine-tuning pipelines that ship domain-adapted models into production.


If you like hard performance problems, platform engineering, and seeing your work used broadly across a global organisation, this role is built for you


As an HSBC employee in the UK, you’ll have access to tailored professional development opportunities and a competitive pay and benefits package. This includes private healthcare for all UK-based employees, enhanced maternity and adoption pay and support when you return to work, and a contributory pension scheme with a generous employer contribution.


In this role, you will:



  • Design, build, and operate scalable model hosting platforms for LLMs, embeddings, and STT/TTS across heterogeneous hardware

  • Optimise inference for latency, throughput, and cost (e.g., quantisation, KV-cache optimisation, dynamic/continuous batching)

  • Evaluate and integrate inference frameworks (e.g., vLLM, TensorRT-LLM, SGLang) to maximise performance on target hardware

  • Own inference health/performance monitoring (latency, throughput, TTFT, memory, availability) and troubleshoot bottlenecks/deployment issues

  • Build end-to-end fine-tuning pipelines (data prep → distributed training → validation) and integrate fine-tuned models into the hosting/inference stack


To be successful in this role you should have the following skills:



  • Extensive experience in building AI platforms covering model hosting/inference optimisation and fine-tuning pipelines (LLM experience strongly preferred)

  • Strong Python and CUDA engineering; solid understanding of GPU/CPU architecture and HPC fundamentals

  • Deep inference optimisation expertise: KV-cache, batching, quantisation (INT4/FP8/GPTQ/AWQ), operator optimisation, framework integration (vLLM/TensorRT-LLM/SGLang)

  • Production hosting experience with Docker/Kubernetes and cloud platforms (AWS/GCP/Azure)

  • End-to-end fine-tuning expertise: data preparation, distributed training, hyperparameter tuning, HF/Accelerate/LoRA/QLoRA, plus benchmarking/monitoring/troubleshooting


Opening up a world of opportunity.


 


Being open to different points of view is important for our business and the communities we serve. At HSBC, we’re dedicated to creating diverse and inclusive workplaces - no matter their gender, ethnicity, disability, religion, sexual orientation, socio-economic background or age. We are committed to removing barriers and ensuring careers at HSBC are inclusive and accessible for everyone to be at their best. We take pride in being a Disability Confident Leader and will offer an interview to people with disabilities, long term conditions or neurodivergent candidates who meet the minimum criteria for the role.


 


If you have a need that requires accommodations or changes during the recruitment process, please get in touch with our Recruitment Helpdesk via [email protected].



Original job MLOps Engineer (LLM/GenAI) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to MLOps Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar MLOps Engineer Jobs in the UK

GrabJobs is the no1 job portal in the UK, connecting you to thousands of jobs fast! Find the best jobs in the UK, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.