K

Principal Software Engineer, Enterprise Scalability

icon building Company : Klaviyo
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Principal Software Engineer, Enterprise Scalability

At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair shot at success and appreciate the experiences each person brings beyond the traditional job requirements. If you’re a close but not exact match with the description, we hope you’ll still consider applying. Want to learn more about life at Klaviyo? Visit klaviyo.com/careers to see how we empower creators to own their own destiny.


Be Klaviyo’s senior IC for scale, you will report into a VP of Engineering and lead performance, reliability, multi‑region, and large‑tenant readiness. You’ll drive platform-wide architectural change, hunt bottlenecks and optimize systems, and partner across teams to productionize improvements. Given that this is an IC role with no direct reports; you will lead via technical depth, hands‑on impact, and crisp cross‑org alignment.


What You’ll Do



  • Define enterprise scalability fitness functions (latency/throughput/error rates) and a scorecard; align teams to SLOs and budgets.

  • Design/implement sharding and partitioning strategies, caching/back‑pressure, multi‑region readiness, and high‑volume migration paths.

  • Build lightweight enablement: benchmarks, profiling harnesses, reproducible testbeds; pair with teams to land fixes.

  • Lead scalability reviews and readiness gates that accelerate—not block—delivery; drive incident deep dives tied to systemic fixes.

  • Communicate clearly to execs and engineers, tying technical work to business impact and customer outcomes.

  • Integrate AI into scale and resiliency work—from proactive anomaly detection to synthetic load and guided runbooks—so performance improvements stick and incidents don’t repeat.


Who You Are



  • Experience: 12+ years scaling multi‑tenant SaaS with a reputation for removing major bottlenecks and proving impact with data.

  • Technical expertise: Performance engineering, capacity planning, sharding/partitioning, caching/back‑pressure, multi‑region readiness, and high‑volume migrations; you turn hotspots into robust patterns.

  • AI tools & automation: You apply AI to scale work—profiling assistance, workload modeling, synthetic traffic generation, anomaly detection, and runbook copilots—always with explicit guardrails and observability.

  • Cross‑org influence: You align teams through fitness functions, scorecards, and readiness gates that accelerate—not block—delivery; you communicate tradeoffs crisply to execs and engineers.

  • AI fluency: Curious, adaptable, and proactive in exploring AI that responsibly improves scale outcomes.


Nice to Haves



  • Scale scorecard: Company‑wide fitness functions (latency/throughput/error rates) are adopted and reviewed regularly.

  • High‑impact wins: 2–3 bottlenecks removed with documented, reproducible testbeds; pXX latencies and error rates improve on top enterprise workloads; repeat P0s trend down.

  • AI‑assisted scale engineering: AI‑driven anomaly detection reduces alert noise while improving signal; generative load testing and copilot runbooks are used in release/readiness checks for the top critical services; time‑to‑isolate regressions drops 20–30%.


Success in 6–12 Months



  • Company‑wide scale scorecard in place; 2–3 high‑impact bottlenecks removed; top enterprise workloads show improved pXX latencies and error rates; fewer repeat P0s.


 


 


We use Covey as part of our hiring and / or promotional process. For jobs or candidates in NYC, certain features may qualify it as an AEDT. As part of the evaluation process we provide Covey with job requirements and candidate submitted applications. We began using Covey Scout for Inbound on April 3, 2025.


Please see the independent bias audit report covering our use of Covey here

Original job Principal Software Engineer, Enterprise Scalability posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Principal Software Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Principal Software Engineer Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.