Number of Applicants
:000+
Let AI Supercharge Your Job Hunt!
JobCopilot scans 500,000+ company career sites daily to find jobs for you
At Roche you can show up as yourself, embraced for the unique qualities you bring. Our culture encourages personal expression, open dialogue, and genuine connections, where you are valued, accepted and respected for who you are, allowing you to thrive both personally and professionally. This is how we aim to prevent, stop and cure diseases and ensure everyone has access to healthcare today and for generations to come. Join Roche, where every voice matters.
Data Engineer
Experience - 4 to 8 years
Location- Pune
Job description
The Senior IT Data Engineer is responsible for leading, designing, developing, and maintaining scalable and robust data pipelines and infrastructure. This role involves independently building ETL/ELT processes, optimizing data storage solutions (such as data warehouses and data lakes), ensuring data quality and reliability, and monitoring data systems. You will collaborate closely with data scientists and analysts to meet their specific data requirements, utilizing strong programming skills in Tableau, Snowflake, Talend, Python or Scala, expert SQL knowledge, and proficiency in big data technologies like Spark.
The ideal candidate will possess a strong background in the pharmaceutical or biotechnology industry, with experience working within Regulatory Affairs, Clinical Operations, or Pharmacovigilance / Safety team with a solid understanding on the E2E process flow across R&D. Additionally, a proven track record of navigating the stringent requirements of GxP environments (GCP, GMP, GVP) and managing complex, cross-functional data workflows is highly desirable.
Description of the area
Job Responsibilities
End-to-End Pipeline Delivery: Independently leads the design, build, and maintenance of scalable data pipelines, managing specific data engineering projects autonomously from inception to deployment.
Performance Optimization & Problem Solving: Solves complex data ingestion and processing challenges, actively optimizing data flows to enhance overall system performance and reliability.
Stakeholder Alignment & Integration: Partners directly with business units and data scientists to understand data requirements, effectively bridging technical execution with non-technical business needs.
Strategic Infrastructure Impact: Owns large-scale data engineering initiatives, implementing robust strategies that significantly modernize and strengthen the organization’s data infrastructure.
Complex Data Integration: Manages large, intricate data ecosystems by seamlessly integrating multiple diverse data sources to ensure efficient, secure cross-platform data flows.
Qualifications
Education / Experience
Large-Scale Data Systems Management: Demonstrated experience owning major data engineering initiatives and managing complex, high-volume enterprise data systems.
Autonomous ETL/ELT Pipeline Development: Proven track record of independently architecting, building, and maintaining automated ETL/ELT data ingestion and transformation processes.
Storage Solution Optimization: Hands-on experience designing and optimizing modern data storage environments, including data warehouses and data lakes, for peak performance and cost efficiency.
Data Quality & Reliability Assurance: Expert capability in implementing rigorous data quality checks, data cleansing rules, and reconciliation frameworks across all pipelines.
System Monitoring & Observability: Strong experience building robust monitoring, alerting, and logging systems to ensure continuous high availability and minimal downtime of data workflows.
Technical Skills
Programming & ETL/ELT Mastery: Advanced proficiency in SQL, Python, and Scala combined with expert use of tools like Talend to build, ingest, and process complex structured and unstructured data streams.
Cloud & Big Data Architecture: Deep expertise leveraging distributed computing frameworks (Spark, Hadoop) and cloud-native data platforms (Snowflake) to manage and scale high-volume, enterprise-level data systems.
Optimization & Performance Engineering: Proven capability to solve complex data processing bottlenecks, tune analytical environments for tools like Tableau, and continuously optimize end-to-end data flows for maximum efficiency and reliability.
Good to have : AI expertise
Additional Qualifications
Pharma & GxP Compliance: Extensive experience in pharma or biotech, architecting data pipelines that strictly comply with GxP frameworks (GCP/GMP/GVP), data integrity principles, and computer systems validation (CSV).
Compliant Data Delivery: Proven capability to build scalable data solutions within regulated R&D environments, aligning technical execution with critical Clinical, Regulatory, and Safety milestones.
Workflow & Data Optimization: Skilled at identifying data bottlenecks, eliminating operational silos, and optimizing fragmented workflows to ensure automated, streamlined cross-platform data transfers.
A healthier future drives us to innovate. Together, more than 100’000 employees across the globe are dedicated to advance science, ensuring everyone has access to healthcare today and for generations to come. Our efforts result in more than 26 million people treated with our medicines and over 30 billion tests conducted using our Diagnostics products. We empower each other to explore new possibilities, foster creativity, and keep our ambitions high, so we can deliver life-changing healthcare solutions that make a global impact.
Let’s build a healthier future, together.
Roche is an Equal Opportunity Employer.
Auto-Apply to Senior Data Engineer Jobs with your AI JobCopilot
Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.