Logo-of-Hyve-Solutions-hiring-for-jobs-in-Canada-on-GrabJobs

Site Reliability Engineer for Linux administration

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Site Reliability Engineer for Linux administration

@HYVE Solutions, missions to help customers, business partners, and employees achieve success through shared goals, strategies, resources and technology solutions.

Job Title

SRE1(Linux System administrator)

Job description

The Linux System Administrator is responsible for day-to-day administration, maintenance, and support of Linux-based systems. This role focuses on system stability, patching, monitoring, and incident resolution while following established standards and procedures.

1. Install, configure, and maintain Linux servers (RHEL/Ubuntu/Rocky/CentOS), both physical and virtual.
2. Manage core services: SSH, DNS, DHCP, NTP, LDAP/SSSD, postfix, NFS/SMB, systemd, and cron management.
3. Apply patches and kernel updates; maintain package repos (yum/dnf/apt).
4. Monitor system health and performance (top, iostat, sar, vmstat, netstat; or via Prometheus/Zabbix).
5. Implement backup/restore routines (e.g., Veeam/Bacula/NetBackup); test recoveries.
6. Manage storage (LVM, mdadm RAID, multipath, iSCSI/FC) and filesystems (ext4/xfs/btrfs).
7. Harden hosts per baseline (CIS/STIG), firewall rules (iptables/nftables/firewalld), SELinux/AppArmor.
8. Write basic automation/scripts (Shell script; optionally Python, Ansible playbook) for routine tasks.
9. Respond to incidents; perform root cause analysis; document runbooks and SOPs.
10. Support users/dev teams with environments, permissions, and troubleshooting.

Required Skills

1. Solid understanding of Linux OS fundamentals.
2. Experience with systemd, networking, storage, and filesystems.
3. Package management, log analysis (journalctl, syslog).
4. Storage and filesystem management; backup fundamentals.
5. Familiarity with monitoring tools (i.e. Nagios)
6. Basic virtualization (VMware/KVM/Nutanix) and/or container basics (Docker).
7. Basic scripting skills (Shell script); version control (Git).
8. Understanding of security best practices: SSH keys, PAM, auditing (auditd).
9. Ability to follow procedures and work under supervision.

Educational background and Working Experience

1. Bachelor's degree in Computer Science, IT, or related field (or equivalent experience).
2. 3-5 years administering Linux(Rocky/RHEL/Ubuntu) in production.
3. Certifications (nice-to-have): Red Hat RHCSA, CompTIA Linux+, LPIC-1.

@ HYVE Solutions, we believe employees are our greatest asset and we empower them to make a difference in our business. Diversity and inclusion make us all better. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability or protected veteran status.  

Original job Site Reliability Engineer for Linux administration posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Site Reliability Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Site Reliability Engineer Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.