Site Reliability Engineering (SRE)

Empresa : Ant Group

Tipo de empleo : Tiempo completo

Estepona, Spain

Número de solicitantes

000+

Solicite ya

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Descripción del trabajo - Site Reliability Engineering (SRE)

Description

Key Responsibilities

• Ensuring Payment System Stability and High Availability: Lead technical initiatives to strengthen the reliability of our payment systems. This includes designing

and implementing monitoring tools, logging frameworks, dashboards, diagnostic utilities, and disaster recovery plans. Conduct routine drills, develop contingency

strategies, and participate in on-call rotations to ensure rapid response and resolution of production issues across regions.

• Incident Handling and Emergency Response: Conduct routine drills, develop contingency strategies, and participate in on-call rotations to ensure rapid response

and resolution of production issues.

• Analyze and Optimize Production Issues: Investigate and analyze real-world production cases, such as performance bottlenecks or system inefficiencies, to derive

actionable insights and establish technical best practices. Contribute to the evolution of a highly available and resilient payment architecture.

• Design and Implement Infrastructure Solutions: Architect and set up new Internet Data Centers (IDCs) to meet scalability and performance requirements. Develop

and execute comprehensive data protection plans that adhere to industry standards and compliance requirements, ensuring data integrity and security.

Technical Requirements

• Solid knowledge of Computer Science, and familiar with the principles of Operating System (Unix/Linux), Computer Storage, Computer Networking and other

related principles.

• Proficient in at least one programming language, such as Java/Python/Shell with experience in developing operations and maintenance tools.

• The strong ability to resolve system problems, good communication skills and a sense of ownership.

• Experiences in operating Google Cloud Platform (GCP) / Oracle Cloud Infrastructure(OCI), OLAP platform (like DPDI, Flink, AntSpark), OcenBase (OB), Ant Trust-Native Service (ATS) is a plus.

Original job Site Reliability Engineering (SRE) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Solicite ya

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to Site Reliability Engineering (SRE) Jobs with your AI JobCopilot

Auto-Apply with AI

Similar Site Reliability Engineering (SRE) Jobs in Spain

Get your Resume Reviewed for Free

Dirección de correo electrónico

¿Por qué está informando sobre este trabajo?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

Todos los anuncios de empleo están sujetos a las :Condiciones de GrabJobs. Permitimos a los usuarios marcar los anuncios que puedan infringir dichas condiciones. Los anuncios de empleo también pueden ser marcados por el equipo de moderación de GrabJobs. Sin embargo, ningún sistema de moderación es perfecto, y marcar un anuncio no garantiza que vaya a ser eliminado.

Setup your job alert:

Frequency

Correo electrónico

Al activar las alertas de empleo, acepto los Terms & Privacy Policy de GrabJobs. Puedo darme de baja de las alertas de empleo en cualquier momento. Saltar

Site Reliability Engineering (SRE)

Descripción del trabajo - Site Reliability Engineering (SRE)

Description

Similar Site Reliability Engineering (SRE) Jobs in Spain

Aplicaciones móviles