P

Sr. Site Reliability Engineer

Job Description - Sr. Site Reliability Engineer

The Site Reliability Engineering organization at Pinterest is accountable for ensuring overall Pinterest availability as well as enhancing Engineering teams’ capability to design, build and operate robust systems at scale.


Pinterest’s applications and infrastructure that handle billions of monthly page views and petabytes of data as Pinterest continues to grow and scale. As a Pinterest SRE, you will design and build systems, platforms, tools, frameworks and methodologies to assure the reliability of our large-scale distributed systems.


 


What you’ll do:



  • Develop software solutions to enable reliability and operability of large scale distributed systems handling petabytes of data and serving 

  • Build a deep understanding of how Pinterest’s systems behave, scale, interact and fail, and use that insight to identity risks and opportunities for remediation

  • Build tools and automation to eliminate toil and reduce operational overhead. Create frameworks, processes and best practices to be used across Pinterest Engineering

  • Build meaningful, insightful and actionable SLIs

  • Automate critical portions of Pinterest’s engineering processes, to minimize risk and maximize the speed of innovation

  • Manage capacity and performance to help scale our infrastructure both on public and private clouds around the world


 


What we’re looking for:



  • Strong knowledge of Linux/Unix/BSD internals and experience working with open source software (e.g. MySQL, Hadoop, Envoy, HAProxy, Nginx)

  • Experience with technologies such as ElasticSearch, ZooKeeper, HBase, Hadoop, Memcache and Kafka with a focus on reliability, automation, operability and performance

  • 4+ years of experience with programming languages (Python, Java, Ruby, etc.)

  • Infrastructure as code a plus (e.g. Terraform, Puppet, Chef, Ansible, Salt, Fabric, Docker, etc)

  • Bonus points if experienced with deploying web apps to cloud infrastructure (AWS, etc.) and working with distributed, service-oriented architecture 

  • Bachelor’s or Master’s degree in a relevant field such as Computer Science, or equivalent experience


 


Relocation Statement:



  • This position is not eligible for relocation assistance. Visit our PinFlex page to learn more about our working model.


 


In-Office Requirement Statement:



  • We let the type of work you do guide the collaboration style. That means we're not always working in an office, but we continue to gather for key moments of collaboration and connection.

  • This role will need to be in the office for in-person collaboration 1-2 times per half and therefore can be situated anywhere in Ontario. 


 


#LI-CH1


#LI-HYBRID 

Original job Sr. Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Share Job
Share Job

Similar Site Reliability Engineer Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.