Logo-of-Tiger-Resourcing-Group-hiring-for-jobs-in-Ireland-on-GrabJobs

Site Reliability Engineer

salary Salary :

€55 - 75,000 yearly

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Engineer

SRE with Kafka

Dublin/Hybrid

Permanent

Salary upto 75k

NP – immediate to 1 month official

The Business Operations (Project) team is
seeking a Business Operations Site Reliability Engineer (SRE).

Overview

The role of Business Operations
Organization is to be the production readiness steward for Client products. As
a Business Operations SRE, we are responsible for ensuring that our platform is
stable and healthy. We break down barriers to run our products by fostering
developer run ownership and empowering developers to build resilient products.
We support our developers during the application build phase in software run
principals that includes operational design, automation, capacity planning,
monitoring that leads to fault -tolerant, scalable products. We see the big
picture and help create and enforce operations standards while facilitating an
agile and learning culture.

We support daily operations with a hyper
focus on triage, root cause by understanding the business impact of our
products and subsequently performing blameless post -mortems. The goal of every
Business Operations team is to engage early in the development lifecycle to be
more proactive and upfront in the development process, and to proactively
manage production and change activities to maximize customer experience and
increase the overall value of supported applications. Business Operations teams
also focus on risk management by tying all our activities together with an
overarching responsibility for compliance and risk mitigation across all our
environments.

Ultimately, the role of Business Operations
is to align Product and Customer Focused priorities with Operational needs by
providing continuous feedback throughout the lifecycle.

Business Operations is leading the DevOps
transformation at Client through our tooling and by being an advocate for
change & standards throughout the development, quality, release, and
product organizations. We need team members with an appetite for change and
pushing the boundaries of what can be done with automation. Experience in
working across development, operations, and product teams to prioritize needs
and to build relationships is a must.

Mission

The role of business operations is to be
the production readiness steward for the platform. This is accomplished by
closely partnering with developers to design, build, implement, and support
technology services. A business operations engineer will ensure operational
criteria like system availability, capacity, performance, monitoring,
self -healing, and deployment automation are implemented throughout the delivery
process. Business Operations plays a key role in leading the DevOps
transformation at Client through our tooling and by being an advocate for
change and standards throughout the development, quality, release, and product
organizations.

We accomplish this transformation through
supporting daily operations with a hyper focus on triage and then root cause by
understanding the business impact of our products. The goal of every Project
team is to shift left to be more proactive and upfront in the development
process, and to proactively manage production and change activities to maximize
customer experience, and increase the overall value of supported applications. Project
teams also focus on risk management by tying all our activities together with
an overarching responsibility for compliance and risk mitigation across all our
environments. A Project focus is also on streamlining and standardizing
traditional application specific support activities and centralizing points of
interaction for both internal and external partners by communicating
effectively with all key stakeholders.

• Operational Readiness Architect:

o Serve as the primary contact responsible
for the overall application health, performance, and capacity

o Support services before they go live
through activities such as system design consulting, capacity planning and
launch reviews.

o Partner with the development and product
team of a new application to establish the right monitoring and alerting
strategy and create the framework to achieve zero downtime during deployment.

• Site Reliability Engineering:

o Serve as the primary contact responsible
for ensuring application scalability, performance, and resilience.

o Practice sustainable incident response
and blameless post -mortems while taking a holistic approach to problem solving
and optimizing time to recover.

o Automate data -driven alerts to
proactively escalate issues. Work with development teams to establish SLOs and
improve reliability.

• DevOps/Automation:

o Tackle complex development, automation,
and business process problems. Engage in and improve the whole lifecycle of
services—from inception and design, through deployment, operation, and
refinement.

o Support the application CI/CD pipeline
for promoting software into higher environments through validation and
operational gating, and lead Client in DevOps automation and best practices.

o Increase automation and tooling to reduce
toil and manual intervention

• ITSM Practices:

Role Qualifications

The ideal candidate will have experience in
many of these areas:

• BS degree in Computer Science or related
technical field involving coding (e.g., physics or mathematics), or equivalent
practical experience.

• Coding and/ or scripting exposure.

• Appetite for change and pushing the
boundaries of what can be done with automation. Be curious about new
technology, infrastructure, and practices to scale our architecture and prepare
for future growth.

• Experience with algorithms, data
structures, scripting, pipeline management, and software design

• Systematic problem -solving approach,
coupled with strong communication skills and a sense of ownership and drive.

• Interest in designing, analysing, and
troubleshooting large -scale distributed systems.

• Willingness and ability to learn and take
on challenging opportunities and to work as a member of matrix based diverse
and geographically distributed project team.

• Ability to balance doing things right
with fixing things quickly. Flexible and pragmatic, while working towards
improving the long -term health of the system.

• Comfortable collaborating with
cross -functional teams to ensure that expected system behavior is understood,
and monitoring exists to detect anomalies.

 

Kafka Knowledge is MUST

  • 3 -5 years of experience working with Apache Kafka in a
    production environment.

  • Strong knowledge of Kafka architecture, including brokers,
    topics, partitions, and replicas.

  • Experience with Kafka security, including SSL, SASL, and ACLs.
  • Proficiency in configuring, deploying, and managing Kafka
    clusters in cloud and on -premises environments.

  • Experience with Kafka stream processing using tools like Kafka
    Streams, KSQL, or Apache Flink.

  • Solid understanding of distributed systems, data streaming, and
    messaging patterns.

  • Proficiency in Java, Scala, or Python for Kafka -related
    development tasks.

  • Familiarity with DevOps practices, including CI/CD pipelines,
    monitoring, and logging.

  • Experience with tools like Zookeeper, Schema Registry, and
    Kafka Connect.

  • Strong problem -solving skills and the ability to troubleshoot
    complex issues in a distributed environment.

  • Excellent communication and collaboration skills to work
    effectively with cross -functional teams and stakeholders.

 



Original job Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

About the Company

Tiger Resourcing Group

Tiger Resourcing Group are a UK based IT and Engineering Recruitment Specialist with a Global reach.

Read more about the company

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in Ireland

GrabJobs is the no1 job portal in Ireland, connecting you to thousands of jobs fast! Find the best jobs in Ireland, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.