Senior Site Reliability Engineer

icon building Company : Lula
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.

Job Description - Senior Site Reliability Engineer

WHAT WE DO

We're Lula. We build innovative fintech products to help SMEs make cash flow. From instant access to funding to all-in-one business banking accounts and cutting-edge financial analysis tools, we're on it!

Our purpose is to help SMEs manage their business better, faster, simpler, Lula, so they can spend more time doing what they love.

Speaking of love, we're looking for Lulas who love to make a difference to join our team and change the game.

OUR VALUES

Collaborative - we're a clan and work together as a team, always towards a common goal

Committed - we're accountable and follow through no matter the challenge

Curious - we look for better ways to do things and make a positive difference

Connected - we stay close to, learn from and look to understand each other and our customers

Compassionate - we go out of our way to care about our colleagues, our customers and our community

OVERALL PURPOSE

We are seeking an experienced Site Reliability Engineer to join our team. The ideal candidate should have a deep understanding of Microsoft Azure, cloud computing, and distributed systems. As a Senior Site Reliability Engineer, you will be responsible for monitoring, maintaining and improving our Azure-based infrastructure and applications, ensuring their reliability, scalability, and security as well as acting as the technical escalation to both Junior and Intermediate Site Reliability engineers and representing the Site Reliability team in CAB as approver.

Responsibilities will include: 

  • Monitor Azure alerts and respond to incidents, including participation in root cause analysis and developing remediation plans
  • Respond to and resolve service requests related to the Azure infrastructure and applications
  • Build and maintain a comprehensive set of Azure alerts based on Azure standard monitoring tools, Logic Apps and Azure Log Analytic Queries (KQL) to provide a holistic view of our infrastructure and application performance
  • Monitor and analyse performance and usage metrics to identify areas for improved alerting, platform optimisation and reliability
  • Partner with our internal Developers and DevOps teams to build, monitor and manage highly available, reliable, scalable and resilient architectures with high levels of visibility on Azure
  • Partner with Microsoft to resolve complex remediation and improvement as required in our Azure environment
  • Partner with our internal SecOps team to ensure the security of the Azure infrastructure and applications by implementing and enforcing security policies and best practices
  • Develop and maintain automation scripts and tools to streamline deployment and management of Azure services
  • Continuously research and evaluate new Azure features and services to optimise our infrastructure and improve our application development workflows
  • Participate in on-call rotation to provide 24/7 support for critical systems
  • Act as a monitoring resource on all Changes and Releases happening in your on-call rotation as is required.

THE COMPETENCIES WE'RE AFTER

  • Strong written and verbal communication skills
  • Ability to communicate complex technical concepts to non-technical stakeholders
  • Ability to work independently and as part of a team
  • A proactive, collaborative and high attention to detail approach to issues
  • A quick and hungry learner
  • Highly credible and trustworthy with an open and honest approach
  • Strong planning skills and ability to prioritise
  • Adaptable and flexible with resilience to change and ambiguity
  • Adaptable between proactive and reactive support in real time
  • Ability to mentor and grow others

THE SKILLS AND EXPERIENCE WE'RE LOOKING FOR

  • Matric certificate or equivalent
  • 5+ years of experience in monitoring and maintaining Azure infrastructure and platforms
  • Strong understanding of Azure services such as Web Applications, Functions and Application Gateways
  • Experience in monitoring, logging and troubleshooting in Azure using App Insights, Azure Monitor, Log Analytics, Logic Apps and Query Performance measures in SQL Databases
  • Experience in monitoring and Troubleshooting SQL Databases in Azure
  • Experience with automation tools such as PowerShell, Azure CLI and ARM templates
  • Strong troubleshooting and problem-solving skills
  • Excellent communication and collaboration skills to work with cross-functional teams
  • Microsoft Azure Administrator Associate or above required (AZ-104), Azure Developer Associate or above preferred (AZ-204)
  • Microsoft 365 Fundamentals advantageous, but not required
  • ITIL Foundations or above preferred
  • Familiarity with DevOps practices and tools such as Azure DevOps
  • Experience with Grafana, Jira, OpsGenie and PRTG beneficial

ALL STAFF APPOINTMENTS WILL BE MADE WITH DUE CONSIDERATION OF THE COMPANY'S EE TARGETS

Original job Senior Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Share this job with your friends

icon get direction How to get there?

icon geo-alt Cape Town, Western Cape

icon get direction How to get there?
View similar HR / Recruitment jobs below

Similar Jobs in South Africa

GrabJobs is the no1 job portal in South Africa, connecting you to thousands of jobs fast! Find the best jobs in South Africa, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2024 Grabjobs Pte.Ltd. All Rights Reserved.