Logo-of-66degrees-hiring-for-jobs-in-Canada-on-GrabJobs

Site Reliability Engineer

icon building Company : 66degrees
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Engineer


Overview of 66degrees


66degrees is an end-to-end AI transformation partner that guides enterprises from complex business challenges to clear, quantifiable outcomes. Our company is the culmination of several successful firms, each a leader in its own right in cloud, artificial intelligence, and data. This convergence of talent and expertise is how we help businesses reach their own "inflection point," where chaotic data becomes a strategic asset, complexity becomes clarity, and AI becomes an engine for growth. Our ultimate vision is to be the catalyst for a future where every business operates as an intelligent enterprise, with autonomous systems unlocking human potential.


At 66degrees, we believe in thriving through challenges and winning together. These values not only guide us in achieving our goals as a company but also for our people. We are dedicated to creating a significant impact for our employees by fostering a culture that sparks innovation and supports professional and personal growth along the way.

Overview of Role


66degrees’ Managed Cloud Optimization (MCO) team works with some of the largest cloud users in the world to help them transform their businesses with technology. Our Site Reliability Engineers (SREs) combine Google Cloud Platform expertise with a passion for devops methodologies to help our clients maintain, optimize, and scale their cloud implementations.


On a daily basis, our SREs work with varied and exciting customers on topics ranging from solving critical outages to designing and deploying new cloud workloads to building self-healing automation. Our SREs work with cutting-edge Google Cloud technologies like Google Kubernetes Engine (GKE), Anthos, BigQuery and data pipelines, as well as leading 3rd party tools like Prometheus, Datadog, and many others. Our SREs also work with languages like Python and Terraform to create automation, deploy infrastructure, and contribute to open-sourcing.


If you’re looking to continually build and apply your Google Cloud expertise to new and varied environments while acting as a key contributor to building the best Google consulting partner in the industry – let’s talk.


Note: Pacific and Mountain Time Zones preferred; This role has a weekend on-call rotation


Responsibilities



  • Ensuring near-zero downtime with monitoring and alerting, self-healing automation, and continuous improvement

  • Create highly automated, available and scalable systems by applying software and infrastructure principles

  • Employ and advise clients on DevOps and SRE principles and practices, covering deployment pipelines, HA, service reliability, technical debt, and operational toil for live services running at scale

  • Provide a proactive approach to our clients’ workloads, anticipating failures, automating tasks, ensuring availability, and providing a great customer experience

  • Work closely with clients, your team, and Google engineers to investigate and resolve infrastructure issues

  • Manage a Jira queue of inbound requests for numerous clients while effectively balancing and prioritizing projects

  • Contribute to ad-hoc initiatives such as writing documentation, open-sourcing, and improving operation, making a huge impact at a rapid-growth Google Premier Partner


Qualifications



  • Minimum 4+ years of cloud and infrastructure experience, including demonstrated expertise with Linux, Windows, k8s, databases, and networking services

  • 2+ years of full-time Google Cloud experience preferred

  • Proficiency with Python required. Other programming language experience is a plus

  • Strong provisioning and configuration skills using Terraform

  • Experience in troubleshooting that spans systems, network, and code

  • Microsoft Server and SQL Server experience is a plus but not required

  • Experience with 24x7x365 monitoring, incident response, and on-call support preferred

  • Experience determining & negotiating Error budgets, SLIs, SLOs, and SLAs with product owners

  • Demonstrate the ability to work independently and as a member of a greater team, including cross-team activities

  • Experience working in Agile Scrum, Kanban methodologies in SDLC

  • Proven experience balancing service reliability, metrics, sustainability, technical debt, and operational toil for live services running at scale

  • Strong communication skills, as this is a heavily customer-facing role

  • A Bachelor’s degree in Computer Science, Computer Engineering, or related or equivalent work experience required.

66degrees is an Equal Opportunity employer. All qualified applicants will receive consideration for employment without regard to actual or perceived race, color, religion, sex, gender, gender identity, national origin, age, weight, height, marital status, sexual orientation, veteran status, disability status or other legally protected class.


AI Transparency & Disclosure


As an AI transformation partner, 66degrees leverages intelligent solutions to enhance our recruitment experience. We utilize AI tools—including LinkedIn Recruiter’s Hiring Assistant and interview transcription technologies—to assist with sourcing, role analysis, and capturing interview highlights.


These tools augment our process, but we "Commit to Our Craft" by ensuring all final hiring decisions are made by our human Talent Team. By applying, you acknowledge the use of these technologies to help us "Win Together" in finding the best fit for our team.

Original job Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.