N

Senior Systems Engineer- Network Infrastructure

icon building Company : Nscale
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Senior Systems Engineer- Network Infrastructure

.


Senior Systems Engineer – Network Infrastructure 


About Us


We are building next-generation AI infrastructure from the ground up. Our mission is to deliver highly performant, reliable, and scalable network clusters purpose-built for large-scale AI training and inference.


As a startup, we operate with urgency, ownership, and a bias toward action. We are assembling the foundational infrastructure that will power frontier AI workloads—and we’re looking for engineers who want to build it from zero to scale.


The Role


We are hiring a Senior Deployment Engineer to lead hands-on bringup of network clusters across our data center environments. You will own the execution of node, rack, and network deployment, ensuring clusters are validated, performant, and production-ready.


This role is deeply technical and execution-focused. You will be in the details—cabling racks, validating firmware, tuning fabrics, debugging performance—and helping us build repeatable processes as we scale.


What You’ll Do


Cluster Deployment & Bringup



  • Execute end-to-end bringup of network nodes and racks from installation to production readiness.


  • Validate BIOS/BMC/firmware configurations and network health.


  • Perform rack-level integration including power, cabling, and airflow validation.


  • Bring up and validate high-speed network fabrics (InfiniBand, RoCE, 100–400G Ethernet).



Network & Performance Validation



  • Configure and validate leaf/spine network connectivity.


  • Run cluster-wide burn-in and stress testing.


  • Validate node-to-node performance (NCCL, RDMA, GPUDirect).


  • Troubleshoot hardware, firmware, and fabric-level issues.



Automation & Process



  • Contribute to automation for provisioning and cluster validation.


  • Improve deployment playbooks and documentation.


  • Identify reliability issues early and drive corrective actions.


  • Help turn ad hoc deployments into repeatable systems.



Cross-Functional Collaboration



  • Work closely with networking, systems software, and data center teams.


  • Coordinate with hardware vendors to resolve bringup issues.


  • Support rapid capacity expansion as we scale.



What We’re Looking For


Required



  • 5–8+ years in infrastructure engineering, hardware deployment, or data center operations.


  • Hands-on experience deploying network servers (HGX/DGX or similar platforms).


  • Experience with high-speed networking (InfiniBand, RoCE, Ethernet fabrics).


  • Strong Linux systems knowledge.


  • Experience troubleshooting distributed systems performance issues.


  • Comfortable working onsite in data center environments as needed.



Strongly Preferred



  • Experience in AI/ML infrastructure or HPC environments.


  • Familiarity with NCCL, CUDA, RDMA.


  • Automation experience (Python, Ansible, Terraform, Bash).


  • Experience in high-density power and cooling environments.



What Success Looks Like



  • Clusters are brought online quickly and correctly.


  • Performance baselines meet or exceed expectations.


  • Deployment processes become faster and more reliable over time.


  • You help build the foundation for scaled infrastructure growth.


For information on how Nscale handles candidate personal data, please see our Employee & Candidate Privacy Notice: Here.

Original job Senior Systems Engineer- Network Infrastructure posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Systems Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Systems Engineer Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.