Logo-of-Margo-Inc-hiring-for-jobs-in-Poland-on-GrabJobs

Network Reliability Engineer

icon building Company : Margo Inc
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Network Reliability Engineer


 

#HPC #AI #GPU #CLUSTERS

 

YOUR DAILY ROUTINE

- Build a large AI infrastructure with monitoring, diagnosis, and remediation of production incidents- Troubleshoot high-impact production issues in collaboration with other engineering teams

- Participate in an on-call rotation to handle incidents and ensure service continuity

- Implement and maintain observability solutions to monitor AI infrastructure and application health

- Contribute to AI infrastructure lifecycle management across different environments and countries

- Promote and apply best practices in terms of stability, resiliency, scalability, and security

- Maintain clear technical documentation for tools and procedures

- Contribute to system and tool evolution based on production feedback

- Collaborate closely with development teams to ensure infrastructure readiness- Participate in team rituals and knowledge-sharing initiatives

 

ABOUT YOU

 

 SOFTSKILLS : 

- Proactive and solution-oriented mindset

- Passion for automation and continuous improvement

- Strong collaboration and communication skills

- Ability to work independently and in a team

- Willingness to mentor and share knowledge

 

HARDSKILLS : 

- Experience with Go or Python 

- Strong scripting skills (Bash, Python)

- Hands-on experience with Linux systems (Ubuntu/Debian)

- Preferred hands-on experience with GPU & HPC infrastructure 

- Knowledge of networking (VLAN/LAN, TCP/IP, DNS, BGP, load-balancing, IPv6, etc.)

- Familiarity with monitoring and logging tools (Prometheus, Grafana, Elastic, etc.)

- Comfortable with Infrastructure-as-Code (Ansible, Salt, AWX, etc.)

- Experience managing relational databases (MariaDB)

- Understanding of CI/CD pipelines (GitLab)

- Comfortable with English (written and spoken)

 

200 zł - 250 zł an hour
We may use artificial intelligence (AI) tools to support parts of the hiring process, such as reviewing applications, analyzing resumes, or assessing responses. These tools assist our recruitment team but do not replace human judgment. Final hiring decisions are ultimately made by humans. If you would like more information about how your data is processed, please contact us.
Original job Network Reliability Engineer posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Network Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Network Reliability Engineer Jobs in Poland

GrabJobs is the no1 job portal in Poland, connecting you to thousands of jobs fast! Find the best jobs in Poland, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.