Logo-of-Qualcomm-hiring-for-jobs-in-Saudi-Arabia-on-GrabJobs

LLM Serving Engineer (Cloud AI Engineering) - Riyadh, KSA

icon building Company : Qualcomm
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - LLM Serving Engineer (Cloud AI Engineering) - Riyadh, KSA

## \nCompany:\n\nQualcomm Middle East Information Technology Company LLC\n\n## Job Area:\n\nEngineering Group, Engineering Group \u003e Systems Engineering\n\nGeneral Summary:\n\nAbout Us\n\nQualcomm is growing its presence in Riyadh and is hiring Data Centre Engineers to support our expanding infrastructure across the region.\n\nAs Saudi Arabia accelerates its digital transformation under Vision 2030, Qualcomm is investing in world\u2011class computing and data centre capabilities to power AI, cloud, and advanced connectivity at scale. This is a unique opportunity to work in a fast\u2011growing technology hub, supporting critical environments and helping shape the future of data centre operations in the Kingdom and beyond.\n\nQualcomm is utilizing its traditional strengths in digital wireless technologies to play a central role in the evolution of Cloud AI. We are investing in several supporting technologies including Deep Learning. The Qualcomm Cloud AI team is developing hardware and software solutions for Inference Acceleration. \n\nWe are hiring LLM Serving Engineers at multiple levels to join our dynamic, collaborative team.\n\nThis role spans the full product lifecycle\u2014from cutting-edge research and development to commercial deployment\u2014and demands strategic thinking, strong execution, and excellent communication skills. \n\nThis role involves the following activities: \n\n * Building a scalable LLM inference platform using inference techniques (e.g. disaggregated serving and KV-Cache management, advanced parallelism, speculative algorithms, model optimization, specialized kernels). \n * Contribute to the development of LLM Serving packages (e.g. vLLM, SGLang, TGI, Triton-Inference server, Dynamo, LLM-d). \n * Work closely with customers to drive solutions by collaborating with internal compiler, firmware and platform teams. \n * Work at the forefront of GenAI by understanding advanced algorithms (e.g. attention mechanisms, MoEs) and numerics to identify new optimization opportunities. \n * Drive efficient serving through smart autoscaling, load balancing and routing. \n * Engage with open-source serving communities to evolve the framework. \n\n\n\nYou will demonstrate the following: \n\n * Hands-on experience in one or more of the following LLM serving/Orchestration packages (Triton-Inference Server, vLLM, SGLang, Ollama, llm-d, KServe, LMCache, MoonCake) \n * Deep understanding of foundational LLMs, VLMs, SLMs, transformer-based architectures. \n * Strong experience in developing language models using PyTorch. \n * Strong computer science fundamentals - algorithms, data structures, parallel and distributed programming. \n * Understanding of computer architecture, ML accelerators, in-memory processing and distributed systems. \n * Strong Python development skills for large-scale projects with passion for software engineering. \n * Experience in analyzing, profiling, and optimizing deep learning workloads. \n * Proactive learning about the latest inference optimization techniques. \n * Excellent communication and problem-solving skills, with the ability to thrive in a fast-paced and collaborative environment. \n * MS in Computer Science, Machine Learning, Computer Engineering or Electrical Engineering. \n\n\n\nBonus Skills include: \n\n * Open-source contribution to any GenAI package. \n * Experience architecting and developing large-scale distributed systems. \n * High-level kernel design experience (PyTorch, CUDA, Triton). \n * Knowledge of torch.compile or torchDynamo \n * PhD in Computer Science, Computer Engineering or Machine Learning \n\n\n\nMinimum Qualifications:\n\n\u2022 Bachelor\u0027s degree in Computer Science, Electrical or Computer Engineering, Information Systems, or related field and 5+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. \nOR \nMaster\u0027s degree in Computer Science, Electrical or Computer Engineering, Information Systems, or related field and 4+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience. \nOR \nPhD in Computer Science, Engineering, Information Systems, or related field and 2+ years of Hardware Engineering, Software Engineering, Systems Engineering, or related work experience.\n\nWhat\u0027s on Offer\n\nApart from working with great people, we offer the below:\n\n * Salary including housing \u0026 transport allowance\n\n * Stock (RSU\u0027s) and performance related bonus\n\n * 16 weeks fully paid Maternity Leave\n\n * 6 weeks fully paid Paternity Leave\n\n * Employee stock purchase scheme\n\n * Child Education Allowance\n\n * Relocation and immigration support (if needed)\n\n * Life and Medical Insurance\n\n * Live+ Well Reimbursement for health and recreational membership fees\n\n\n\n\nMinimum Qualifications:\n\n\u2022 Bachelor\u0027s degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Systems Engineering or related work experience. \nOR \nMaster\u0027s degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Systems Engineering or related work experience. \nOR \nPhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Systems Engineering or related work experience.\n\n*References to a particular number of years experience are for indicative purposes only. Applications from candidates with equivalent experience will be considered, provided that the candidate can demonstrate an ability to fulfill the principal duties of the role and possesses the required competencies.\n\nQualcomm is an equal opportunity employer. If you are an individual with a disability and need an accommodation during the application/hiring process, rest assured that Qualcomm is committed to providing an accessible process. You may e-mail [email protected] or call Qualcomm\u0027s toll-free number found here. Upon request, Qualcomm will provide reasonable accommodations to support individuals with disabilities to be able participate in the hiring process. Qualcomm is also committed to making our workplace accessible for individuals with disabilities. (Keep in mind that this email address is used to provide reasonable accommodations for individuals with disabilities. We will not respond here to requests for updates on applications or resume inquiries).\n\nQualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.\n\nTo all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.\n\nIf you would like more information about this role, please contact Qualcomm Careers.\n
Original job LLM Serving Engineer (Cloud AI Engineering) - Riyadh, KSA posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to LLM Serving Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar LLM Serving Engineer Jobs in Saudi Arabia

GrabJobs is the no1 job portal in Saudi Arabia, connecting you to thousands of jobs fast! Find the best jobs in Saudi Arabia, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.