J

Staff Software Developer, Production Engineering

icon building Company : Jobgether
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Staff Software Developer, Production Engineering


This position is listed on behalf of a partner company, who manages all applications and next steps. Our partner is looking for a Staff Software Developer, Production Engineering based in Canada.


This is a high-impact engineering leadership role focused on improving the reliability, scalability, and operational excellence of a large-scale technology platform serving millions of users. Working at the intersection of platform and product engineering, you will tackle complex infrastructure challenges and help establish engineering practices that elevate system resilience across the organization. The role combines hands-on technical problem-solving with strategic influence, allowing you to drive initiatives that prevent incidents, improve system performance, and accelerate safe software delivery. You will collaborate with diverse engineering teams, guide reliability best practices, and build solutions that create lasting improvements across multiple services. This is an excellent opportunity for a senior technologist who enjoys solving systemic challenges, influencing technical direction, and shaping the future of production engineering in a fast-paced, innovation-driven environment.


Accountabilities:



  • Design and implement platform-level reliability improvements, including guardrails, engineering standards, and best practices that reduce service failures and operational risk.

  • Develop and enhance tools that improve incident detection, response, mitigation, and recovery, including support for AI-assisted operational workflows.

  • Lead investigations into performance, scalability, and load-testing outcomes, transforming findings into measurable reliability improvements across critical systems.

  • Partner with platform and product engineering teams to review architectures, assess production readiness, and promote resilient engineering practices.

  • Identify recurring operational issues and design long-term solutions that eliminate root causes rather than addressing individual incidents.

  • Influence engineering teams through technical leadership, mentorship, and collaboration, driving adoption of reliability-focused standards across the organization.

  • Contribute to reliability planning, risk assessment discussions, and cross-functional initiatives aimed at improving uptime and user experience.

  • Support continuous improvement efforts by helping define operational metrics, incident prevention strategies, and engineering excellence initiatives.


Requirements



  • 8+ years of software engineering experience, including substantial exposure to platform engineering, infrastructure, site reliability engineering (SRE), or related disciplines.

  • Proven track record of improving reliability at scale through operational standards, automation, incident reduction initiatives, or platform-wide engineering improvements.

  • Strong expertise in backend systems, distributed architectures, and diagnosing complex production issues across interconnected services.

  • Experience conducting load testing, performance analysis, capacity planning, and translating technical findings into actionable engineering solutions.

  • Deep understanding of modern cloud-native technologies and deployment ecosystems, including Kubernetes, Helm, Argo, and related tooling.

  • Demonstrated ability to influence technical direction and drive adoption of best practices across teams without direct managerial authority.

  • Strong communication and stakeholder management skills, with the ability to present recommendations to both technical and senior leadership audiences.

  • Systems-thinking mindset with a focus on root-cause analysis, long-term problem prevention, and scalable engineering solutions.

  • Comfortable navigating ambiguity, balancing competing priorities, and driving outcomes in a fast-moving environment.

  • Interest in emerging technologies, including AI-assisted engineering and operational tooling, is considered a strong advantage.


Benefits



  • Comprehensive health, medical, and life insurance coverage.

  • Employer-matched long-term savings and retirement programs.

  • 20 days of annual vacation plus additional wellness days.

  • Unlimited sick leave and mental health days.

  • Flexible remote work environment across Canada.

  • Opportunity to work internationally for up to 90 days per year.

  • Access to employee resource groups and inclusive community initiatives.

  • Collaborative culture focused on innovation, learning, and professional growth.

  • Exposure to large-scale systems, cutting-edge technologies, and high-impact engineering challenges.

  • Opportunity to work alongside highly skilled and motivated technology professionals across North America.


How Jobgether works:

We use an AI-powered matching process to ensure your application is reviewed quickly, objectively, and fairly against the role's core requirements. Our system identifies the top-fitting candidates, and this shortlist is then shared directly with the hiring company. The final decision and next steps (interviews, assessments) are managed by their internal team.

We appreciate your interest and wish you the best!


 

Data Privacy Notice: By submitting your application, you acknowledge that Jobgether will process your personal data to evaluate your candidacy and share relevant information with the hiring employer. This processing is based on legitimate interest and pre-contractual measures under applicable data protection laws (including GDPR). You may exercise your rights (access, rectification, erasure, objection) at any time.

 

 

#LI-CL1
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses and identifying potential inconsistencies or verification signals in application materials based on available information. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Original job Staff Software Developer, Production Engineering posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Staff Software Developer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Staff Software Developer Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.