About the Role
We are all working with the same purpose: To create a positive impact in our ecosystem by enabling commerce through technology.
Designing the future of fintech at Trendyol. From digital wallets to smart credit systems, we build scalable financial solutions that empower millions of users across our global ecosystem.
Responsibilities
- Accomplish pillars of Site Reliability Engineering
- Develop necessary platforms to observe and heal the health of the platform
- Be a part of incident commander role
- Take actions to reduce both incident rate and MTTR
- Guide domain teams to more reliable designs
- Run chaos experiments with domain teams and find the actions according to results
- Participate in oncall rotation
Expected Qualifications
- Be passionate about developing well-architected, innovative, and elegant platform software tools and services
- Experience with Go programming language is a plus
- Eager to find problems and making sure to take actions to make it not repeated
- Understanding of core distributed systems concepts, such as fault-tolerance, consistency, reliability and availability
- Experience working with Kubernetes and CNCF tools is a big plus
- Experience working with private cloud technologies (Openstack,vCloud) is a big plus
- Technical English proficiency is a must
- Being curious about how systems work and, more importantly, how they fail
- Eagerness for self-improvement, open-minded, future-oriented
Knowledge about OSI layers