What you’ll do:
- Active contributor in defining, designing, building, and operating our internal developer platform and tools with a focus on enabling other engineering teams to focus on feature work in their products.
- Focus on cloud architecture, infrastructure as code, monitoring and alerting and other operational excellence areas.
- Ensure the products we build exceed our customer’s expectations by:
- Actively participate in on-call rotation for incident response, primarily during working hours for the platform and tools
- Build monitoring that alerts on symptoms rather than on outages.
- Use metrics/data to improve our products continuously
- Ensure we have the right SLIs defined and measured, and continuously ensure we meet or exceed our SLOs.
- Identify and implement improvements in the platform architecture and/or tools to reduce toil and/or improve resilience and scalability.
- Use solid engineering practices to build long-term solutions to prevent issues from repeating.
- Utilize chaos engineering practices to stress and validate that the right monitoring, resilience, and HA/DR commitments are being met.
What you’ll need:
- 2+ years writing secure, scalable, and resilient infrastructure as code -preferably in Terraform - and using standard SDLC methodologies for building, testing, deploying, and supporting cloud infrastructure (preferably in AWS).
- 2+ years of experience with CI/CD tools like GitOps, Ansible, Jenkins, Github, Gitlab, etc. (GitHub, GitHub Actions/Workflows preferred)
- 2+ years experience writing automation and tools using various scripting or programming languages
- 1+ years of hands-on experience in DevOps/DevSecOps/SRE
- 1+ years of experience with Kubernetes (preferably EKS)
- Strong background and passion for building secure solutions.
- Working knowledge of defining and managing SLIs and SLOs and methods for using error budgets to ensure SLOs are met.
- Strong knowledge of the English language and clear and crisp communication - both verbal and written
- Solid emotional intelligence and ability to collaborate with others, especially under pressure.
Preferred Experience:
- Bachelor’s Degree in Software Engineering or related field or relevant work experience
- Production experience building, running, and operating cloud-native applications using Docker and one or more programming languages (NodeJS, PHP, or Go preferred)
- Experience using tools such as Flux and Helm
- Previous experience building and operating an internal developer platform and tools.
- Broad working knowledge of AWS services such as EC2, MKS, RDS, and S3.
- Experience with observability tools, preferably Datadog
- Experience working in an agile development environment
Working Hours:
- 1:00 PM - 10:00 PM (BD Time), Monday to Friday
Salary Range:
- BDT 100,000 - 140,000 (Monthly)
Other Benefits:
- Competitive salary based on experience and qualification.
- Mobile Bill
- Festival Bonus
- Leave Encashment
- Medical Insurance
- Gratuity Benefit
- Lunch/Dinner - Fully Subsidized
- Transportation Service - Drop Off
- Gym Membership
- Career Development Budget
- Annual performance evaluation and increment.
- Profit Sharing - Field Nation LLC Performance Reward.
- Sound work-life balance - Regular working hours: 8 Hours/Day, 5 Days/Week.
- Flexible leave/vacation policy.
- Friendly work environment.
- Opportunity to work with cross-cultural teams in the US.