Senior Site Reliability Engineer

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
icon loader

This job is no longer accepting applications.

Scroll down below to view similar jobs .

Job Description - Senior Site Reliability Engineer

Senior Site Reliability Engineer
Greenwich, CT – Must be willing to work onsite 3 days a week.

Must have strong troubleshooting skills and also strong scripting skills

• Our DevOps team is looking for a Site Reliability Engineer who can partner with engineering teams to triage issues, build and deploy new features, and maintain components in the software development tool chain.

What Will Be Your Responsibilities :

• Level 2 support for incident management across our DevOps tool chain
• Develop and improve instrumentation for monitoring and logging the health and availability of services.
• Educate and lead efforts to improve observability across all engineering teams.
• Proactively monitor systems and applications to provide input in improving systems' stability, security, efficiency, and scalability.
• Continuously improve hybrid, on-premise and cloud infrastructure to support development teams throughout the full-service lifecycle.
• Participate in system design consulting, platform management, and capacity planning.
• Innovate and improve system processes and tool sets to automate repetitive work and increase efficiency.
• Interact with engineering teams to provide solutions and resolve problems promptly and proactively.
• Balance innovation with reliability using well-defined service level objectives.

What Skills Are Required:
• Bachelor's Degree or Master's degree in computer science or Software Engineering or related field.
• Strong experience managing observability tools, specifically ELK
• Experience with modern DevOps tools like Jenkins, Git, Gradle, Nexus, Kubernetes, Kafka
• Strong programming skills: Java, Python, and/or Go
• Ability to debug, optimize code, and automate manual routine tasks
• Infrastructure configuration management with AWS using tools like Terraform and Cloudwatch
• Willingness to learn other technologies as needed
• Proficient in developing and maintaining technical documentation, runbooks, and procedure.

To be successful in this position, you will have the following:
• Self-motivated and able to handle tasks with minimal supervision.
• Superb analytical and problem-solving skills.
• Excellent collaboration and communication (Verbal and written) skills.
• Outstanding organizational and time management skills.

#J-18808-Ljbffr
Original job Senior Site Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

This job is no longer accepting applications.

Scroll down below to view similar jobs .

icon no cv required No CV Required icon fast interview Fast Interview via Chat

Share this job with your friends

icon get direction How to get there?

icon geo-alt Greenwich, Ohio

icon get direction How to get there?
View similar Others jobs below

Similar Jobs in the US

GrabJobs is the no1 job portal in the US, connecting you to thousands of jobs fast! Find the best jobs in the US, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2024 Grabjobs Pte.Ltd. All Rights Reserved.