Senior Data Engineer - Data Pipelines

Empresa : Astrazeneca

Tipo de empleo : Tiempo completo

Spain - Barcelona

Número de solicitantes

000+

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Descripción del trabajo - Senior Data Engineer - Data Pipelines

Are you ready to build high-performance data pipelines that turn complex science into real impact for patients? In this role, you will transform raw bioinformatics and scientific data into trusted, reusable assets that drive discovery and decision-making across our research programs.

You will join a team that fuses data engineering with cutting-edge science, using HPC and AWS to deliver reproducible workflows at scale. From first ingestion to consumption by scientists and AI models, you will set the standard for reliability, speed, and governance across our data foundation.

Do you thrive where learning is continuous and bold ideas are encouraged? You will have the freedom to experiment, the support to grow, and the opportunity to see your work influence breakthroughs as they take shape.

Accountabilities:

Pipeline Engineering: Design, implement, and operate fit-for-purpose data pipelines for bioinformatics and scientific data, from ingestion to consumption.
Workflow Orchestration: Build reproducible pipelines using frameworks such as Nextflow (preferred) or Snakemake; integrate with schedulers and HPC/cloud resources.
Data Platforms: Develop data models, warehousing layers, and metadata/lineage; ensure data quality, reliability, and governance.
Scalability and Performance: Optimize pipelines for throughput and cost across Unix/Linux HPC and cloud environments (AWS preferred); implement observability and reliability practices.
Collaboration: Translate scientific and business requirements into technical designs; partner with CPSS stakeholders, R&D IT, and DS&AI to co-create solutions.
Engineering Excellence: Establish and maintain version control, CI/CD, automated testing, code review, and design patterns to ensure maintainability and compliance.
Enablement: Produce documentation and reusable components; mentor peers and promote best practices in data engineering and scientific computing.

Essential Skills/Experience:

Pipeline engineering: Design, implement, and operate fit-for-purpose data pipelines for bioinformatics and scientific data, from ingestion to consumption.
Workflow orchestration: Build reproducible pipelines using frameworks such as Nextflow (preferred) or Snakemake; integrate with schedulers and HPC/cloud resources.
Data platforms: Develop data models, warehousing layers, and metadata/lineage; ensure data quality, reliability, and governance.
Scalability and performance: Optimize pipelines for throughput and cost across Unix/Linux HPC and cloud environments (AWS preferred); implement observability and reliability practices.
Collaboration: Translate scientific and business requirements into technical designs; partner with CPSS stakeholders, R&D IT, and DS&AI to co-create solutions.
Engineering excellence: Establish and maintain version control, CI/CD, automated testing, code review, and design patterns to ensure maintainability and compliance.
Enablement: Produce documentation and reusable components; mentor peers and promote best practices in data engineering and scientific computing.

Desirable Skills/Experience:

Strong programming in Python and Bash for workflow development and scientific computing.
Experience with containerization and packaging (Docker, Singularity, Conda) for reproducible pipelines.
Familiarity with data warehousing and analytics platforms (e.g., Redshift, Snowflake, Databricks) and data catalog/lineage tools.
Experience with observability and reliability tooling (Prometheus/Grafana, ELK, tracing) in HPC and cloud contexts.
Knowledge of infrastructure as code and cloud orchestration (Terraform, CloudFormation, Kubernetes).
Understanding of FAIR data principles and domain-specific bioinformatics formats and standards.
Track record of mentoring engineers and enabling cross-functional teams with reusable components and documentation.
Experience optimizing performance and cost on AWS, including spot strategies, autoscaling, and storage tiers.

When we put unexpected teams in the same room, we unleash bold thinking with the power to inspire life-changing medicines. In-person working gives us the platform we need to connect, work at pace and challenge perceptions. That's why we work, on average, a minimum of three days per week from the office. But that doesn't mean we're not flexible. We balance the expectation of being in the office while respecting individual flexibility. Join us in our unique and ambitious world.

Why AstraZeneca:
Your engineering craft will fuel science at the crossroads of biology, data, and technology. You will collaborate with researchers, data scientists, and technologists to tackle complex diseases, using modern platforms and inclusive ways of working to turn uncertainty into insight. We value kindness alongside ambition, nurture resilience and curiosity, and pair the resources of a global leader with the agility to move at pace—from hands-on experimentation to shared learning and tangible impact for patients.

Call to Action:
Step into this role and start building the data pipelines that turn bold science into real outcomes for patients.

Date Posted

15-ene-2026

Closing Date

04-feb-2026

AstraZeneca embraces diversity and equality of opportunity. We are committed to building an inclusive and diverse team representing all backgrounds, with as wide a range of perspectives as possible, and harnessing industry-leading skills. We believe that the more inclusive we are, the better our work will be. We welcome and consider applications to join our team from all qualified candidates, regardless of their characteristics. We comply with all applicable laws and regulations on non-discrimination in employment (and recruitment), as well as work authorization and employment eligibility verification requirements.

Original job Senior Data Engineer - Data Pipelines posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to Data Engineer Jobs with your AI JobCopilot

Auto-Apply with AI

Similar Data Engineer Jobs in Spain

Get your Resume Reviewed for Free

Dirección de correo electrónico

¿Por qué está informando sobre este trabajo?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

Todos los anuncios de empleo están sujetos a las :Condiciones de GrabJobs. Permitimos a los usuarios marcar los anuncios que puedan infringir dichas condiciones. Los anuncios de empleo también pueden ser marcados por el equipo de moderación de GrabJobs. Sin embargo, ningún sistema de moderación es perfecto, y marcar un anuncio no garantiza que vaya a ser eliminado.

Setup your job alert:

Frequency

Correo electrónico

Al activar las alertas de empleo, acepto los Terms & Privacy Policy de GrabJobs. Puedo darme de baja de las alertas de empleo en cualquier momento. Saltar

Senior Data Engineer - Data Pipelines

Descripción del trabajo - Senior Data Engineer - Data Pipelines

Accountabilities:

Similar Data Engineer Jobs in Spain

Aplicaciones móviles