B

Site Reliability Engineer - Technical Lead (initial 12 month FTC)

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Engineer - Technical Lead (initial 12 month FTC)


About Us


Founded in 1983, BTG Pactual is now the largest investment bank in Latin America. We're committed to a future where investing is dynamic and straightforward, which is why we're undergoing a digitization and expansion process across various fronts. Our entrepreneurial mindset allows us to empathize with our clients and understand their challenges, leading to swift, autonomous, and bureaucracy-free solutions. Renowned for our excellence, flexibility, and versatility, we serve clients from our offices in Brazil, Chile, Colombia, Peru, Mexico, Argentina, the United States, United Kingdom, Portugal, Spain, Luxembourg and Saudi Arabia.


 


The Role


BTG Pactual is looking for an experienced SRE Engineer to take on a hands-on technical lead role within our Technology Team in London. You will own and drive automation and infrastructure integration efforts across cloud and on-premises environments in a fast-paced, global organisation.


In this role, you will guide and develop a small team of engineers while remaining deeply involved in technical delivery yourself. You will collaborate with agile, cross-functional teams and contribute to DevOps practices including CI/CD, deployment automation, automated testing, and multi-cloud architecture. We offer the freedom to explore new languages, development standards, and methodologies.


This is a temp to perm opportunity for an initial period of 12 months, on an Fixed-Term Contract (FTC) basis. Due to ongoing growth within our Technology Team, the intent will be for this position to transition into a permanent role upon the successful completion of the initial 12 month period, subject to performance and business requirements. 


This is currently a hybrid role requiring a minimum of three days per week in the London office. The successful candidate must have the flexibility to increase office attendance to four or five days per week when business needs, project demands, or critical incidents require on-site presence. As the company continues to evolve, office attendance expectations may be adjusted over time to align with business needs.


 


Role Responsibilities 



  • Lead and develop a team of engineers, organising and prioritising workloads, tracking project plans, and communicating status to stakeholders.

  • Execute hands-on technical tasks alongside the team, acting as a senior individual contributor as well as a lead.

  • Define technical architectures for robust, resilient, high-performance, and scalable infrastructure systems.

  • Implement and maintain automation across AWS and Azure cloud environments and on-premises setups, using tools such as Terraform, Python, CloudFormation, and Ansible.

  • Build and maintain CI/CD pipelines to support continuous integration and deployment.

  • Leverage AI coding platforms and tools (e.g., GitHub Copilot, Claude Code) to accelerate development, improve code quality, and enhance team productivity.

  • Develop scripts and integrations using APIs to automate operational workflows.

  • Implement Disaster Recovery automation and lead regular HA/DR tests to minimise RTOs.

  • Build and maintain automated monitoring, alerting, ITSM, and capacity plan Sitening solutions.

  • Troubleshoot and resolve complex infrastructure and reliability challenges.

  • Document processes, procedures, and environment configurations to a high standard.

  • Provide frequent and organised status reports of the team’s activities, communicating both technical detail to engineering peers and clear summaries to management.

  • Develop and execute training and upskilling strategies for team members.

  • Work closely with IT project managers, developers, and business stakeholders to shape and deliver solutions.

  • Manage vendor relationships and escalate hardware, software, or service issues when required.

  • Participate in an on-call support rota, following a pre-defined schedule, to provide after-hours and weekend coverage for critical infrastructure and reliability incidents.

  • Execute planned production environment changes outside of standard business hours, including evenings and weekends.


 


Skills & Experience 


Required



  • Bachelor’s degree in Computer Science, Engineering, or a related field, or equivalent practical experience.

  • 6+ years of hands-on experience in cloud operations, infrastructure management, and automation, with at least 4 years focused on AWS.

  • Proven track record delivering scalable, reliable cloud-based solutions.

  • Demonstrated experience leading an engineering team of 5 or more people.

  • Strong proficiency in infrastructure-as-code tools (Terraform, Ansible, CloudFormation) and scripting languages (Python, Bash).

  • Solid understanding of CI/CD principles and tooling, with hands-on experience building and maintaining automated pipelines.

  • Experience with container orchestration and Kubernetes.

  • Familiarity with observability and monitoring tools (e.g. Datadog, Prometheus, Grafana, Zabbix).

  • Strong troubleshooting skills across cloud networking, Linux systems, and distributed services.

  • Excellent communication skills and the ability to collaborate with cross-functional and international teams.

  • Strong organisational skills with the ability to prioritise and manage competing demands in a fast-paced environment.

  • SMCR Category: Conduct Rules Staff.


Beneficial



  • Azure cloud experience.

  • Exposure to ITSM platforms and capacity planning tooling.

  • Experience with Disaster Recovery design and automation.


 


What We Look For



  • A proactive, self-directed approach — you take ownership and drive outcomes with minimal supervision.

  • A genuine commitment to continuous learning — actively developing your own skills and staying current with emerging technologies, tools, and SRE best practices.

  • Clear, confident communication — able to present ideas and translate technical concepts for non-technical audiences.

  • Attention to detail and a high bar for quality in both code and documentation.

  • Ability to work well under pressure, responding to critical incidents and requests calmly and efficiently.

  • Strong reporting and documentation skills — able to produce clear, structured updates on team activities for both technical and non-technical audiences.

  • Willingness and availability to participate in on-call support as per the defined rota.

  • Commitment to a hybrid working model with a minimum of three days per week in the office, and flexibility to increase attendance as business needs evolve.


 


Our Offer


BTG Pactual is a global financial institution that retains the culture, pace and agility of a young Company. As an expanding firm, we are committed to attracting, developing and retaining the very best talent, by offering a workplace where results are truly recognised and rewarded.  We offer a fantastic opportunity for you to grow including:



  • Professional, international working environment;

  • Challenging, rewarding career;

  • Collaborative environment;

  • Competitive compensation package.


Please note that candidates should not contact BTG Pactual members directly outside of the recruitment process (e.g., LinkedIn messages or emails). All interested applicants should apply through the official application channel, as all applications are reviewed equally. Thank you!


By submitting this application, I agree to share the information above. Your information will only be used to evaluate the application process and talent database for BTG Pactual and its subsidiaries in accordance with our privacy policy.

Original job Site Reliability Engineer - Technical Lead (initial 12 month FTC) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Site Reliability Engineer - Technical Lead Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer - Technical Lead Jobs in the UK

GrabJobs is the no1 job portal in the UK, connecting you to thousands of jobs fast! Find the best jobs in the UK, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.