C

Site Reliability Engineer

icon building Company : Cloudsmiths
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Engineer

CloudSmiths is looking for a proactive Intermediate Site Reliability Engineer (GCP) to join our Managed Services team.



In this role, you will be a key player in ensuring the reliability, scalability, and performance of production environments for our diverse range of clients. You will bridge the gap between development and operations by implementing robust monitoring, automation, and DevOps practices specifically within the Google Cloud Platform.



Key Responsibilities:



  • Act as a technical resource for SRE practices on GCP, ensuring consistent uptime and performance across various environments.
    Champion DevOps best practices by applying Infrastructure as Code (IaC) principles using tools like Terraform, Ansible, or Deployment Manager.

  • Drive monitoring initiatives using tools such as Grafana, Prometheus, and Stackdriver to ensure deep visibility into system health.

  • Design, maintain, and optimize CI/CD pipelines using GCP-native tools and industry standards.

  • Troubleshoot complex production incidents, perform root cause analysis, and foster a proactive, blameless post-mortem culture.

  • Manage your workload effectively while maintaining clear communication with internal and external stakeholders regarding project progress.



Requirements:



  • 3–5+ years of hands-on experience in a Site Reliability, DevOps, or Cloud Engineering role.

  • Strong experience working directly with GCP infrastructure, services, and security/cost optimization.Containerization: Proven experience with Kubernetes (GKE), Docker, and container orchestration at scale.


Technical Skills:



  • Expertise in UNIX/Linux administration.

  • Strong scripting skills in Python, Bash, or Shell.

  • Familiarity with configuration management tools like Chef, Puppet, or Ansible.

  • A Degree or Diploma in IT, Computer Science, or equivalent experience.

  • Google Cloud Professional certifications (DevOps Engineer or Cloud Architect) are highly advantageous.



Why Join Us?


We are 100% remote. Enjoy the flexibility of working from anywhere. 


You’ll drive monitoring, observability, and CI/CD initiatives using the latest GCP-native tools.


We value innovative thinkers who aren't afraid to challenge the status quo to drive excellence


Original job Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in South Africa

GrabJobs is the no1 job portal in South Africa, connecting you to thousands of jobs fast! Find the best jobs in South Africa, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.