Number of Applicants
:000+
Let AI Supercharge Your Job Hunt!
JobCopilot scans 500,000+ company career sites daily to find jobs for you
Wissen Technology is Hiring for (SRE) – Cloud & DevOps
About Wissen Technology:
Wissen Technology is a globally recognized organization known for building solid technology teams, working with major financial institutions, and delivering high-quality solutions in IT services. With a strong presence in the financial industry, we provide cutting-edge solutions to address complex business challenges.
Role Overview:
We are looking for a proactive Site Reliability Engineer (SRE) with hands-on experience in cloud infrastructure, automation, monitoring, and incident management. This role will be critical in maintaining system reliability, scalability, and performance across distributed applications and services.
Key Responsibilities:
Monitor system health and proactively address issues across distributed applications.
Create detection strategies and implement automated systems for issue resolution and alerting.
Manage infrastructure and CI/CD pipelines across cloud environments (preferably AWS).
Collaborate with product and engineering teams to ensure adherence to DevOps best practices.
Participate in incident response, troubleshooting, and post-mortem analysis.
Administer and optimize monitoring, alerting, and logging tools for reliability and visibility.
Support containerized environments using Docker (experience with Kubernetes or Terraform is a plus).
Write basic scripts in Linux (Bash/sh/zsh), and optionally on Windows.
Manage and query databases using SQL for diagnostics and health checks.
Understand application performance analysis and tuning methodologies.
Contribute to management and optimization of data pipelines (Snowflake knowledge is a plus).
Required Skills:
Strong cloud experience (AWS preferred; GCP/Azure acceptable with core service knowledge).
Good grasp of Linux scripting (bash/sh/zsh); Windows scripting is a nice-to-have.
Basic programming/scripting in Python, Java, or JavaScript.
Hands-on experience with monitoring, alerting, and logging tools (e.g., Prometheus, Grafana, ELK, Splunk).
Familiarity with CI/CD processes and tools.
Experience with Docker and container orchestration (Kubernetes is a plus).
Solid understanding of distributed systems and application reliability.
Knowledge of incident management workflows and tooling.
Ability to write and execute SQL queries and work with data
Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.