O

Site Reliability (Infrastructure) Engineer (AVP/VP)

icon building Company : Ocbc Bank
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability (Infrastructure) Engineer (AVP/VP)

WHO WE ARE:

As Singapore’s longest established bank, we have been dedicated to enabling individuals and businesses to achieve their aspirations since 1932. How? By taking the time to truly understand people. From there, we provide support, services, solutions, and career paths that meet their individual needs and desires.

 Today, we’re on a journey of transformation. Leveraging technology and creativity to become a future-ready learning organisation. But for all that change, our strategic ambition is consistently clear and bold, which is to be Asia’s leading financial services partner for a sustainable future.

 We invite you to build the bank of the future. Innovate the way we deliver financial services. Work in friendly, supportive teams. Build lasting value in your community. Help people grow their assets, business, and investments. Take your learning as far as you can. Or simply enjoy a vibrant, future-ready career.

Your Opportunity Starts Here.

Why Join
Imagine being part of a team that powers the technology behind one of Singapore's longest established banks. As a Technology Infrastructure Specialist at OCBC, you'll play a critical role in ensuring our systems and infrastructure are secure, efficient, and always available. You'll be part of a team that's driving innovation and transformation in the banking industry.

How you succeed
As a Site Reliability Engineer you will be responsible for the operation, reliability, and performance of the company's platform infrastructure. You will work closely with infrastructure  teams to ensure high availability, scalability, and operational excellence across multiple environments.

What you do

  • Incident Response & RCA: Lead the response for complex virtualization, storage, or OS-level disruptions and conduct blameless post-mortems and Root Cause Analysis (RCA) to prevent systemic recurrence.

  • Systems Automation: Develop and maintain software tools (Python, PowerShell, Java) that  automation if infrastructure task via, CI/CD pipelines, and to improve efficiency and reduce operational risk.

  • Observability & Telemetry: Architect and manage AI-first monitoring systems (Grafana, ELK) to capture deep telemetry for predictive failure detection across hypervisors, storage arrays, and OS performance counters.

  • Availability Management: Define and measure infrastructure-specific SLIs and SLOs (e.g., IOPS, disk latency, OS uptime) and manage "error budgets" to balance rapid infrastructure changes with environment stability.

  • Infrastructure as Code (IaC): Adopt and maintain declarative configurations (e.g., Terraform, Ansible) to ensure consistency and speed across Windows and Linux deployments in multi-cloud and data center environments.

Who you are

  • A degree in Computer Science, Information Technology, Engineering related.

  • At least 5 years of relevant experiences.

  • OS Engineering: Intermediate-level administration of Windows Server (Active Directory, Clustering) and Redhat Linux. 

  • Virtualization & Storage: High proficiency in hypervisors (VMware) and enterprise storage architecture (SAN, NAS, S3).

  • Programming & Scripting: Proficiency in Python and PowerShell for developing automated management platforms and systems tooling.

  • Observability & Telemetry: Hands-on experience architecting monitoring solutions using Grafana and Elasticsearch (ELK) for predictive health analytics.

  • CI/CD & DevOps: Experience building automated pipelines (Jenkins, Bitbucket, Jira) 

  • Configuration management tools (e.g. Ansible, BigFix)


Who we are
As Singapore's longest established bank, we have been dedicated to enabling individuals and businesses to achieve their aspirations since 1932. How? By taking the time to truly understand people. From there, we provide support, services, solutions, and career paths that meet their individual needs and desires.

Today, we're on a journey of transformation. Leveraging technology and creativity to become a future-ready learning organisation.
But for all that change, our strategic ambition is consistently clear and bold, which is to be Asia's leading financial services partner for a sustainable future.

We invite you to build the bank of the future. Innovate the way we deliver financial services. Work in friendly, supportive teams. Build lasting value in your community. Help people grow their assets, business, and investments. Take your learning as far as you can. Or simply enjoy a vibrant, future-ready career. Your Opportunity Starts Here.

What we offer:


Competitive base salary. A suite of holistic, flexible benefits to suit every lifestyle. Community initiatives. Industry-leading learning and professional development opportunities. Your wellbeing, growth and aspirations are every bit as cared for as the needs of our customers.

Original job Site Reliability (Infrastructure) Engineer (AVP/VP) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in Singapore

GrabJobs is the no1 job portal in Singapore, connecting you to thousands of jobs fast! Find the best jobs in Singapore, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.