Your mission
As a Senior System Administrator (m/f/d), your mission is to ensure the reliability, security, and scalability of our cloud-based infrastructure while driving operational excellence through automation and modern infrastructure practices. You will play a key role in maintaining and evolving our Linux-based environments, ensuring that systems remain resilient, secure, and highly available.
In this role, you act as both a technical expert and a strategic contributor. You continuously identify opportunities to automate repetitive tasks, improve system observability, and enhance deployment reliability. By strengthening our containerized infrastructure and optimizing CI/CD processes, you help accelerate development workflows while maintaining strong operational standards.
You will work closely with engineering, product, and data teams to ensure that infrastructure supports the company’s growth and product delivery needs. At the same time, you champion infrastructure best practices, ranging from system hardening and patch management to monitoring, disaster recovery, and performance optimization.
Ultimately, your mission is to build and maintain a robust, secure, and automated platform that empowers teams to deliver reliable products efficiently and safely.
Key Responsibilities
Infrastructure Reliability and Container Operations
Ensure the stability, availability, and performance of our Linux- and mainly AWS-based infrastructure and containerized environments. Manage and optimize Docker deployments and maintain CI/CD pipelines using TeamCity to support reliable, automated application delivery.
Automation and Configuration Management
Design, develop, and maintain infrastructure automation primarily using Ansible. Continuously improve configuration management, provisioning, and operational workflows to reduce manual processes and increase system reproducibility.
Security, Hardening, and Observability
Implement and maintain strong security standards across all systems. Perform Linux system hardening, deploy and manage security tools, and maintain monitoring, logging, and alerting solutions to ensure visibility, threat detection, vulnerability scanning and proactive issue resolution.
Public Key Infrastructure and Secrets Management
Operate and evolve our internal PKI, including the management of our internal Certificate Authority and certificate lifecycle processes. Maintain and improve our PGP/GPG-based email encryption setup, with the opportunity to design and roll out an internal key server. Consolidate and harden our diverse secrets management landscape – spanning AWS Secrets Manager, Ansible Vault, Docker Secrets, KeePassXC, and Passbolt – to establish consistent practices, clear ownership, and secure handling of credentials across the organization.
Change Management and System Optimization
Manage system changes including patching, upgrades, and infrastructure improvements. Continuously evaluate and optimize system and network performance while ensuring controlled, reliable change processes.
Incident Response and Business Continuity
Provide Tier 2/3 operational support for infrastructure incidents during working hours. Contribute to root cause analysis and continuously improve incident response procedures, disaster recovery plans, and business continuity strategies.
Cross-Team Collaboration
Collaborate closely with Product, Data, and Engineering teams to support data delivery, deployment needs, and infrastructure improvements. Contribute to internal initiatives that mature and streamline core services across the organization.
Mobile Device Management (MDM)
Design, implement, and maintain a Mobile Device Management system to support secure and scalable device administration. Plan and execute the rollout of managed devices, define security policies, and ensure compliance with company standards for mobile and endpoint devices.