Site Reliability Engineer Remote

Company : Paynearme

Job Type : Full Time

Santa Clara, California

Number of Applicants

:

000+

Apply Now

Job Description - Site Reliability Engineer Remote

Job Description

What you'll be responsible for:

Infrastructure Management: Design, implement, and maintain scalable and resilient infrastructure using Terraform for infrastructure as code, ensuring high availability and performance.

Kubernetes and Containers: Deploy, manage, and optimize Kubernetes clusters and containerized applications using Docker. Implement best practices for container orchestration and management.

Systems and Application Monitoring/Observability: Develop and maintain comprehensive monitoring and observability solutions using Datadog. Ensure detailed visibility into system performance and application health.

SLOs and SLA Management: Define, monitor, and maintain Service Level Objectives (SLOs) and Service Level Agreements (SLAs) to ensure reliable and consistent service delivery.

Incident Response and Troubleshooting: Respond to incidents, perform root cause analysis, and implement solutions to prevent recurrence. Participate in post-incident reviews and contribute to blameless postmortems.

Reliability and Production Environment Management: Ensure the reliability and stability of our production environments. Continuously assess and improve system reliability, identifying and addressing potential points of failure.

Automation and Scripting: Develop automation scripts and tools to reduce manual intervention and improve system reliability using Python, Bash, or Go. Implement and improve CI/CD pipelines.

CI/CD Pipeline Management: Enhance and maintain continuous integration and continuous deployment pipelines using GitLab CI. Ensure seamless and reliable deployment processes.

Capacity Planning and Scaling: Assist in capacity planning and ensure that systems are scalable to meet future demands. Implement auto-scaling strategies where applicable.

Security and Compliance: Implement security best practices and ensure compliance with industry standards. Regularly review and update security policies and procedures.

Collaboration and Support: Work closely with development teams to ensure reliability and scalability of new features and services. Provide technical support and guidance on infrastructure-related issues.

Software Engineering for Operations: Develop and maintain internal tools and services that enhance the efficiency and reliability of our operations.

On-Call Rotation: Participate in an on-call rotation to address production issues and collaborate in incident response efforts.

Qualifications:
Qualifications

We’re looking for someone with:

Experience: +3 years of experience in SRE, DevOps, or a related role.

Cloud Platform Experience: Proficient with cloud platforms such as AWS, GCP, or Azure. Experience with EC2, RDS, VPCs, and security groups is essential.

Kubernetes and Containers: Strong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applications.

Infrastructure as Code: Expert in using Terraform for infrastructure as code. Proficient with configuration management tools such as Ansible, Puppet, or Chef.

Monitoring and Observability: Extensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systems.

CI/CD Practices: Familiarity with GitLab CI or similar tool for continuous integration and deployment. Experience in setting up and managing pipelines.

Additional Information

Benefits

Base salary per year (paid semi-monthly)

Fast- paced and professional work culture

Stock options with standard startup vesting - 1 year cliff; 4 years total

$50 monthly communication expense stipend to go towards your phone/internet bill

$250 stipend to enhance your WFH setup

Reimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200)

Premium medical benefits including vision and dental (100% coverage for employees)

Company-sponsored life and disability insurance

Paid parental bonding leave

Paid sick leave, jury duty, bereavement

401k plan

Flexible Time Off (our team members typically take off ~3-4 weeks per year)

Volunteer Time Off

13 scheduled holidays

4-6x / year in-person team meet-ups

Salary Range:

$205,000 - $225,000

PayNearMe strives to create a workplace where all employees thrive. Our

core values

represent who we are today and we take pride in the way we work with each other as well as with our stakeholders.

We’re in this

together

to

do the right thing . We deliver

real results

we are proud of while remaining

respectful

,

transparent

, and

flexible .

PayNearMe is an equal opportunity employer. We are diligently and thoughtfully working towards cultivating a diverse workforce which in turn, enhances our products and services for the communities we serve. Applicants who represent all backgrounds are strongly encouraged to apply.

—

Candidate information will be treated in accordance with our job applicant privacy notice found at:

https://home.paynearme.com/ccpa-privacy-notice-jobs-employees/

Assistance for Disabled Applicants

Alternative formats of this Notice are available to individuals with a disability. Please let us know if you need assistance.

All your information will be kept confidential according to EEO guidelines.

Qualifications

**We’re looking for someone with:**

* Experience: +3 years of experience in SRE, DevOps, or a related role.
* Cloud Platform Experience: Proficient with cloud platforms such as AWS, GCP, or Azure. Experience with EC2, RDS, VPCs, and security groups is essential.
* Kubernetes and Containers: Strong experience with Kubernetes and Docker, including deployment, scaling, and management of containerized applications.
* Infrastructure as Code: Expert in using Terraform for infrastructure as code. Proficient with configuration management tools such as Ansible, Puppet, or Chef.
* Monitoring and Observability: Extensive experience with monitoring and observability tools like Datadog, Prometheus, Grafana, ELK stack, or Splunk. Skilled in setting up detailed monitoring and logging systems.
* CI/CD Practices: Familiarity with GitLab CI or similar tool for continuous integration and deployment. Experience in setting up and managing pipelines.

Additional Information

**Benefits**

* Base salary per year (paid semi-monthly)
* Fast- paced and professional work culture
* Stock options with standard startup vesting - 1 year cliff; 4 years total
* $50 monthly communication expense stipend to go towards your phone/internet bill
* $250 stipend to enhance your WFH setup
* Reimbursement for peripheral equipment: monitor (up to $400), keyboard and mouse (up to $200)
* Premium medical benefits including vision and dental (100% coverage for employees)
* Company-sponsored life and disability insurance
* Paid parental bonding leave
* Paid sick leave, jury duty, bereavement
* 401k plan
* Flexible Time Off (our team members typically take off ~3-4 weeks per year)
* Volunteer Time Off
* 13 scheduled holidays
* 4-6x / year in-person team meet-ups

**Salary Range:** $205,000 - $225,000

PayNearMe strives to create a workplace where all employees thrive. Our [core values](https://home.paynearme.com/about/careers/) represent who we are today and we take pride in the way we work with each other as well as with our stakeholders.

We’re in this **together** to **do the right thing**. We deliver **real results** we are proud of while remaining **respectful** , **transparent** , and **flexible**.

_PayNearMe is an equal opportunity employer. We are diligently and thoughtfully working towards cultivating a diverse workforce which in turn, enhances our products and services for the communities we serve. Applicants who represent all backgrounds are strongly encouraged to apply._

—

Candidate information will be treated in accordance with our job applicant privacy notice found at:

**Assistance for Disabled Applicants**

Alternative formats of this Notice are available to individuals with a disability. Please let us know if you need assistance.

All your information will be kept confidential according to EEO guidelines.

#J-18808-Ljbffr

Original job Site Reliability Engineer Remote posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Apply Now