V

DevOps & Site Reliability Engineer

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - DevOps & Site Reliability Engineer

Position Title: DEVOPS & SRE ENGINEER
Location: HOUSTON, TX
FLSA Class: EXEMPT
Responsible to: Directo of Software Engineering


Position Summary: DevOps / Site Reliability Engineer to implement and evolve the infrastructure, deployment pipelines, and reliability posture of our systems. You'll work closely with engineering teams to build scalable, observable, and resilient infrastructure while driving a culture of operational excellence.


Essential Duties and Responsibilities:



  • Design, build, and maintain cloud infrastructure

  • Manage and optimize Kubernetes clusters and containerized workloads in production

  • Develop and maintain infrastructureascode using Terraform (or equivalent tooling)

  • Build and improve CI/CD pipelines to enable fast, safe, and reliable deployments

  • Implement and maintain monitoring, alerting, and observability systems (Prometheus, Grafana, Datadog, or similar)

  • Define and track SLIs/SLOs, participate in incident response, root cause analysis, and blameless postmortems

  • Identify and eliminate toil through automation and selfservice tooling

  • Configure and maintain onprem baremetal servers and Linuxbased infrastructure

  • Configure, maintain, and optimize virtualized assets

  • Collaborate with development teams on system design, capacity planning, and performance optimization

  • Participate in oncall rotations and ensure production readiness of new services


Other Requirements:



  • 4+ years of experience in DevOps, SRE, or infrastructure engineering roles

  • Strong experience with at least one major cloud provider (AWS, GCP, or Azure AWS preferred)

  • Deep hands-on experience with Kubernetes and Docker in production environments

  • Proficiency with infrastructureascode tools, particularly Terraform

  • Experience building and maintaining CI/CD pipelines (GitHub Actions, GitLab CI, Jenkins, or similar)

  • Solid understanding of monitoring and observability (metrics, logs, traces)

  • Strong scripting skills (Bash, Python, or Go)

  • Experience with incident management, SLObased reliability practices, and capacity planning

  • Strong Linux systems administration skills (Ubuntu, RHEL/CentOS, or similar)

  • Experience with virtualization platforms including VM provisioning, storage, networking, and cluster management

  • Solid understanding of networking, DNS, load balancing, and security fundamentals


Nice to Have:



  • Contributions to internal developer platforms or platform engineering initiatives

  • Proxmox VE experience

  • Certifications in cloud platforms (AWS SA, CKA, etc.)


The above statements are intended to describe the general nature and level of work being performed by employees assigned to this classification. All personnel may be required to perform duties outside of their normal responsibilities from time to time, as needed.


VoltaGrid is an Equal Opportunity Employer that does not discriminate on the basis of actual or perceived race, creed, color, religion, alienage or national origin, ancestry, citizenship status, age, disability or handicap, sex, marital status, veteran status, sexual orientation, genetic information, arrest record, or any other characteristic protected by applicable federal, state or local laws. 


Our management team is dedicated to this policy with respect to recruitment, hiring, placement, promotion, transfer, training, compensation, benefits, employee activities, and general treatment during employment.

Original job DevOps & Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to DevOps & Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar DevOps & Site Reliability Engineer Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.