Machine Learning Ops & Infrastructure Engineer

Salary :

$160,000 - 300,000 yearly

Company : Noble Machines

Job Type : Full Time

Sunnyvale, United States

Number of Applicants

000+

Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Job Description - Machine Learning Ops & Infrastructure Engineer

About Noble Machines

Noble Machines (formerly Under Control Robotics) builds multipurpose robots to support human workers in the world's toughest jobs—turning dangerous work from a necessity into a choice. Our work demands reliability, robustness, and readiness for the unexpected—on time, every time. We're assembling a mission-driven team focused on delivering real impact in heavy industry, from construction and mining to energy. If you're driven to build rugged, reliable products that solve real-world problems, we'd love to talk.

Position Overview

At noble machines AI, we are pushing the boundaries of machine learning and artificial intelligence. To support our rapid pace of innovation, we are looking for an experienced ML Ops & Infrastructure Engineer to build the foundational systems that power our AI development.

In this role, you will sit at the critical intersection of our Research and Engineering teams. You won’t just be maintaining systems; you will be architecting the high-performance ML infrastructure that enables our researchers to seamlessly transition from data collection and model training to evaluation and production. If you are passionate about scalable compute, elegant data platforms, and robust deployment pipelines, we want you on our team.

Responsibilities

End-to-End ML Infrastructure: Design, build, and maintain a highly scalable and reliable machine learning infrastructure that accelerates the research and development lifecycle.

Data Platform & Management: Architect and manage robust data ingestion, collection, and processing pipelines. You will own the data platforms that ensure our models are trained on high-quality, perfectly versioned datasets.

Training & Evaluation Pipelines: Build and optimize the environments used for distributed model training, hyperparameter tuning, and automated model evaluation.

Cloud Compute Orchestration: Manage and orchestrate heavy compute workflows seamlessly across AWS and/or Google Cloud Platform (GCP), optimizing for both performance and cost.

Containerization & Kubernetes: Take full ownership of containerizing ML workloads and orchestrating them via Kubernetes (K8s) to ensure high availability, scalability, and reproducibility.

Cross-Functional Collaboration: Partner closely with ML Researchers and Software Engineers to understand their bottlenecks, gather requirements, and build tooling that makes their workflows frictionless.

Requirements

Proven Industry Experience: 3+ years of hands-on industry experience building scalable ML infrastructure, MLOps platforms, or data engineering systems.

Cloud & Orchestration Mastery: Deep expertise in cloud platforms (AWS or GCP) and modern orchestration tools, specifically Docker and Kubernetes (K8s).

Software Engineering Fundamentals: Strong programming skills in Python, alongside experience with bash scripting and version control (Git).

Data & Pipeline Expertise: Hands-on experience building large-scale data management pipelines and using workflow orchestration tools (e.g., Airflow, Argo, Kubeflow, or similar).

Relevant Domain Background: While explicit robotics experience is not required, we highly value candidates with backgrounds in hardware-interfacing AI, autonomous driving, computer vision, or other high-complexity ML fields.

Nice to Have

Experience with Infrastructure as Code (IaC) tools like Terraform.

Familiarity with distributed training frameworks (e.g., PyTorch DDP, Horovod, Ray).

Experience implementing model observability, monitoring, and data drift detection in production environments.

A background handling large volumes of unstructured data (video, sensor data, spatial data).

The base salary range for this full-time position is $160,000 - $300,000, in addition to bonus, equity and benefits.

To apply, submit your resume here or email [email protected]. To increase your chances of being selected for an interview, we encourage you to include up to TWO examples of your most representative work featuring hardware demonstrations.

Original job Machine Learning Ops & Infrastructure Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Apply Now

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to Machine Learning Ops & Infrastructure Engineer Jobs with your AI JobCopilot

Auto-Apply with AI

Similar Machine Learning Ops & Infrastructure Engineer Jobs in the US

Get your Resume Reviewed for Free

Email address

Why are you reporting this job?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

Setup your job alert:

Frequency

By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime. Skip