Job Description - Data Engineer - Iceberg Migration
Req ID: 364925\n\nNTT DATA strives to hire exceptional, innovative and passionate individuals who want to grow with us. If you want to be part of an inclusive, adaptable, and forward-thinking organization, apply now.\n\nWe are currently seeking a Data Engineer - Iceberg Migration to join our team in Bangalore, Karn\u0101taka (IN-KA), India (IN).\n\nJob Duties: NTT DATA is seeking an experienced Data Migration Engineer to support the PNC Bank Hadoop-to-Iceberg POC engagement. This role sits at the heart of Workstream 1 and is responsible for executing the end-to-end migration of a representative set of Hive tables and ETL jobs to an Apache Iceberg-based Data Lakehouse architecture. \n\nThe engineer will work directly with NTT DATA\u0027s AI accelerators \u2014 KANO (for automated discovery and dependency mapping) and DaaP (for agentic code conversion and validation) \u2014 reviewing and refining AI-generated outputs, ensuring migration accuracy, and collaborating with PNC stakeholders to validate results against agreed POC acceptance criteria. \n\nThis is a hands-on technical role requiring deep expertise across the Hadoop ecosystem, Apache Spark, and the Apache Iceberg table format, with the ability to operate effectively in a regulated financial services environment.\n\nMinimum Skills Required: Hadoop \u0026 Legacy Data Platform \n\n5+ years of hands-on experience with the Hadoop ecosystem: HDFS, Hive, HiveQL, Cloudera/Hortonworks, Sqoop, Oozie, and YARN \n\nDeep understanding of Hive metastore, table formats, SerDes, and partitioning strategies \n\nExperience with Cloudera Data Platform (CDP) migrations or decommissions is strongly preferred \n\nApache Spark \n\nStrong PySpark and/or Scala Spark development skills including Structured Streaming, DataFrame API, and Spark SQL \n\nExperience optimizing Spark jobs: partitioning, broadcast joins, memory tuning, and handling data skew \n\nFamiliarity with CI/CD pipelines for Spark code promotion (Git, Jenkins, or equivalent) \n\nApache Iceberg \n\nPractical experience implementing Apache Iceberg tables including catalog configuration (Hive Metastore, Nessie, AWS Glue, or Polaris) \n\nSolid understanding of Iceberg internals: snapshot model, manifest files, metadata tables, hidden partitioning, and partition evolution \n\nExperience with Iceberg schema evolution, time travel queries, MOR vs COW merge strategies, and ACID-compliant upserts \n\nHands-on experience with Iceberg maintenance procedures: compaction (rewrite_data_files), snapshot expiry, and orphan file cleanup \n\nData Engineering \u0026 Cloud \n\nExperience with S3-compatible object storage (AWS S3, Azure ADLS, or on-prem MinIO/Ceph) as an Iceberg warehouse \n\nFamiliarity with query engines such as Trino and/or Flink for multi-engine Iceberg access \n\nUnderstanding of medallion (Bronze-Silver-Gold) architecture patterns for data lake organization\n\nAbout NTT DATA\n\nNTT DATA is a $30 billion business and technology services leader, serving 75% of the Fortune Global 100. We are committed to accelerating client success and positively impacting society through responsible innovation. We are one of the world\u0027s leading AI and digital infrastructure providers, with unmatched capabilities in enterprise-scale AI, cloud, security, connectivity, data centers and application services. our consulting and Industry solutions help organizations and society move confidently and sustainably into the digital future. As a Global Top Employer, we have experts in more than 50 countries. We also offer clients access to a robust ecosystem of innovation centers as well as established and start-up partners. NTT DATA is a part of NTT Group, which invests over $3 billion each year in R\u0026D.\n\nWhenever possible, we hire locally to NTT DATA offices or client sites. This ensures we can provide timely and effective support tailored to each client\u2019s needs. While many positions offer remote or hybrid work options, these arrangements are subject to change based on client requirements. For employees near an NTT DATA office or client site, in-office attendance may be required for meetings or events, depending on business needs. At NTT DATA, we are committed to staying flexible and meeting the evolving needs of both our clients and employees. NTT DATA recruiters will never ask for payment or banking information and will only use @nttdata.com and @talent.nttdataservices.com email addresses. If you are requested to provide payment or disclose banking information, please submit a contact us form, https://us.nttdata.com/en/contact-us.\n\nNTT DATA endeavors to make https://us.nttdata.com accessible to any and all users. If you would like to contact us regarding the accessibility of our website or need assistance completing the application process, please contact us at https://us.nttdata.com/en/contact-us. This contact information is for accommodation requests only and cannot be used to inquire about the status of applications. NTT DATA is an equal opportunity employer. Qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability or protected veteran status. For our EEO Policy Statement, please click here. If you\u0027d like more information on your EEO rights under the law, please click here. For Pay Transparency information, please click here.\n
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in India.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip