F

Hardware System Test Engineering Team Leader

icon building Company : Fractile
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Hardware System Test Engineering Team Leader






















Fractile is building silicon, systems and software which will redefine the frontier of AI: running the world’s most advanced models at radically higher speed and lower cost. We have an exceptional team across hardware and software capable of bringing about this change, and we are growing fast to meet demand and deliver our product at scale.


You will own the architecture and execution of system-level test strategies for an AI accelerator server platform, covering the manufacture of individual servers and multi-node racks, and field deployment environments. Lead the creation, automation, and integration of tests spanning hardware, firmware, OS, high-speed IO, cooling, and manageability subsystems. Ensure scalable, reliable test processes that support high-volume manufacturing, rack integration, and in-field diagnostics and repair.


Key Responsibilities



  • Define the system-level test strategy for an AI accelerator server platform with hundreds of PCIe Gen6 links, liquid cooling, and heterogeneous compute/management subsystems.

  • Lead the design and implementation of structural, functional, stress and reliability tests executed on servers and full-rack configurations.

  • Manage the deployment of system test suites in manufacturing environments, ensuring high throughput, repeatability, failure isolation, and secure automation workflows.

  • Define and deliver test solutions for field diagnostics, repair, and maintenance—including remote test suites, health-check routines, failure triage, and component-level isolation.

  • Collaborate with hardware, firmware, OS, and platform teams to embed system-level test and diagnostic functions and ensure testability across the server and rack.

  • Drive test integration into OpenBMC, Redfish, and platform-management frameworks; validate BMC/RMC test and maintenance behaviors, telemetry, and hardware-control functions.

  • Develop test automation infrastructure and tooling that spans multiple OS environments (embedded Linux on BMC/RMCs, server-class Linux on host CPUs).

  • Partner with rack integration teams to define tests executed at rack-level assembly.

  • Establish KPIs and dashboards for system test coverage, yield, throughput, failure rates, and field return correlation.

  • Manage internal test engineering team and relevant external partners.

  • Own system test documentation: test specifications, factory/field procedures, debug guides, and integration handbooks.


 


Required Qualifications 



  • 8+ years of experience in system-level or server platform validation, test engineering, or platform QA.

  • Strong understanding of server architectures, BMC/OpenBMC/Redfish, Linux-based systems, and hardware/firmware interactions.

  • Experience validating or testing high-speed IO subsystems (PCIe Gen4/5/6 or comparable serial fabrics).

  • Experience leading test development for manufacturing or field diagnostics in large-scale server, storage, or networking systems.

  • Ability to lead teams, manage external vendors, and drive cross-functional test initiatives.

  • Strong skills in system debug, log analysis, failure isolation, and multi-component testing.

  • Familiarity with liquid-cooled or thermally complex server designs.


 


Preferred Qualifications



  • Experience with accelerator-based compute (AI/ML, GPUs, custom ASICs).

  • Experience with rack-level architectures, DC power distribution, or cooling infrastructure.

  • Knowledge of reliability testing, environmental stress testing, and fleet-level monitoring.


How we work



  • Ownership and execution: you will have full agency to drive your work forward

  • Rapid iteration: we all work directly with top leadership to move from idea to hardware on ambitious timelines

  • Full-stack engagement: hardware, software, silicon, and modelling teams all work closely together to create a product with generational impact

  • Optimistic and pragmatic: we possess the will to win, and to do the hard work to get us there

  • Team player mentality: the mission is bigger than any of us, and we have the curiosity and technical focus to see the best idea shipped, no matter who’s it is


About us



  • Founded in 2022, team of 70+ which is expanding rapidly

  • Modern, open offices in London and Bristol

  • Collaborative, problem-solving culture built on deep curiosity, entrepreneurial initiative and technical fluency


Export control and security clearance


Certain roles may involve working on technologies subject to export restrictions. Applicants may be required to undergo additional eligibility checks to ensure compliance with applicable law.


 






















Original job Hardware System Test Engineering Team Leader posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Hardware System Test Engineering Team Leader Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Hardware System Test Engineering Team Leader Jobs in the UK

GrabJobs is the no1 job portal in the UK, connecting you to thousands of jobs fast! Find the best jobs in the UK, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.