We are looking for a VibeOps Engineer to support mission-critical banking infrastructure across enterprise and IBM Z environments. This role combines infrastructure operations, SRE practices, and AI-powered operational assistants to improve reliability, accelerate incident resolution, and modernize enterprise operations. The ideal candidate has strong experience in production support, infrastructure engineering, troubleshooting, and highly available enterprise systems, preferably within banking or other regulated industries.
Engagement Duration: 6 months + possibility to prolong Workload: Full-time Start Date: July
Rate: up to $55-70/hourly Location: USA
Must-have Skills:
Strong experience in infrastructure operations, systems administration, site reliability engineering (SRE), platform engineering, or related disciplines
Hands-on experience supporting highly available enterprise production environments
Practical expertise in Linux/Unix or Windows Server administration, cloud infrastructure operations, enterprise monitoring, or infrastructure automation
Strong knowledge of incident, change, and problem management processes
Advanced troubleshooting and root cause analysis skills
Nice-to-Have:
Experience working within banking, financial services, or other regulated industries
Exposure to IBM Z and related technologies (z/OS, CICS, DB2, IMS, RACF, JES, SDSF) is an advantage
Position Requirements
In this role, you will operate mission-critical banking infrastructure at the intersection of enterprise operations, AI, and legacy platforms. You will use AI-powered operational assistants to support IBM Z and surrounding enterprise environments, helping teams resolve incidents faster, improve reliability, and scale institutional knowledge in a modern way. This is an opportunity to shape a new operating model where human judgment and AI work together across complex, high-availability systems.
Responsibilities
Support production infrastructure powering critical banking operations across enterprise platforms and systems
Utilize AI-powered operational assistants to troubleshoot incidents, access platform knowledge, and execute operational procedures
Monitor system health, performance, availability, and reliability across both legacy and distributed technology environments
Investigate incidents, perform root cause analysis, and drive timely resolution of production issues
Participate in change management, release management, disaster recovery, and business continuity activities
Collaborate with infrastructure, application, cybersecurity, risk, and business teams to ensure service availability and stability
Identify and implement opportunities for automation, operational improvements, and increased platform efficiency
Contribute to operational excellence initiatives and the evolution of AI-augmented operations practices
Apply modern infrastructure engineering and SRE principles within complex, regulated enterprise environments.
Requirements
Strong experience in infrastructure operations, systems administration, site reliability engineering, platform engineering, or related disciplines
Experience supporting highly available production systems in enterprise environments
Hands-on experience with one or more of the following: Linux or Unix administration, Windows Server administration, SRE, cloud infrastructure operations, middleware administration, enterprise monitoring, or infrastructure automation
Understanding of incident, change, and problem management processes Strong troubleshooting and root cause analysis skills in complex operational environments
Familiarity with IT operations in regulated industries, preferably banking or financial services
Knowledge of banking operations such as payments, treasury, lending, wealth or asset management, securities, custody, or operational risk is an advantage
Exposure to IBM Z, z/OS, CICS, DB2, IMS, RACF, JES, SDSF, or related technologies is beneficial
Ability to learn quickly and adapt in complex technology ecosystems Strong communication and collaboration skills for working across technical and business teams.
Recruitment Process:
CV Screening: Applications are reviewed within 24 hours.
Pre-Screening: A short Q&A session (AI or with a recruiter) to assess your experience.
Shortlisting: Selected candidates are presented to the hiring manager.
Interview1: Tech discussion with the project team
Interview2: Tech discussion with the end customer
Offer & Onboarding: Successful candidates receive an offer and start the onboarding process.
Note: You can choose to complete the pre-screening via an automated session (recommended for faster feedback) or with a recruiter.
Information about the processing of your personal data is provided in our Privacy Policy, which is available online a Privacy Policy
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in the US.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip
GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast!
Find the best jobs in the US, apply in 1 click and get a job today!