S

Member of Technical Staff, AI Reliability & Monitoring Engineering Lead

salary Salary :

$256,000 - 276,000 yearly

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Member of Technical Staff, AI Reliability & Monitoring Engineering Lead









The Opportunity


Postman is seeking an experienced AI Systems Reliability Engineer to help define, build, and maintain the infrastructure and processes that ensure the reliability, scalability, and performance of Postman’s AI-powered API and agentic systems in production. This role focuses on monitoring, availability, incident response, and automation to support AI services and tools trusted by millions of developers globally.


What You’ll Do




  • Develop and manage reliability metrics (SLOs) for AI-driven API services and agentic AI platform features




  • Implement comprehensive observability and monitoring systems for real-time performance and fault detection




  • Design and drive automated failover, recovery, and incident response strategies for high-availability AI infrastructure




  • Optimize resource utilization, particularly GPU/accelerator efficiency, ensuring cost-effective AI system operation




  • Collaborate closely with engineering, platform, and product teams to align reliability efforts with broader organizational goals




  • Lead efforts to build internal tooling and automation focused on AI system stability and operational excellence




  • Drive continuous improvement in deployment practices, monitoring approaches, and incident management processes




About You




  • Have a strong background in AI reliability engineering, SRE, or DevOps for distributed systems




  • Understand the unique challenges of maintaining large-scale AI systems and integrating AI-specific metrics into reliability frameworks




  • Are experienced with cloud platforms, monitoring tools, and incident response automation




  • Are comfortable collaborating across teams to influence best practices for AI system reliability and operational health




  • Thrive in dynamic, fast-paced environments focusing on delivering reliable, safe AI-powered services




Bonus Skills and Experiences




  • Hands-on experience with AI/ML infrastructure, including GPU/xPU optimization and scaling




  • Familiarity with API platform operations and large-scale distributed services




  • Prior experience building or operating observability tools tailored for AI and agentic systems




  • Contribution to open-source projects or reliability engineering thought leadership




The reasonably estimated base salary for this role ranges from $256,000 to $276,000, plus a competitive equity package. Actual compensation is based on the candidate's skills, qualifications, and experience. 










Original job Member of Technical Staff, AI Reliability & Monitoring Engineering Lead posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

About the Company

Sign Up For Free

Unify API design, testing, documentation, and monitoring in one platform. Build, collaborate, and innovate faster with seamless Git and gateway integrations.

Read more about the company

Auto-Apply to Member of Technical Staff, AI Reliability & Monitoring Engineering Lead Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Member of Technical Staff, AI Reliability & Monitoring Engineering Lead Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.