A

Data Engineer (Localization & Language Data - Team Lead)

icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Data Engineer (Localization & Language Data - Team Lead)


About Alexa Translations

Alexa Translations provides A.I.-powered translations for the largest and most prestigious legal, financial, and government institutions. Our unique combination of advanced technology and professionally certified translators deliver tailored solutions with unparalleled quality. Thanks to over two decades of award-winning client success, you can rely on us as a true extension of your team.

Our core values: 

  • Innovation
  • Dedication
  • Fanatical commitment to quality and service
  • Resourcefulness
  • Collaboration

Role Overview

As the Data Engineer (Team Lead), you will be the technical lead and will lead a specialized team at the intersection of Big Data, Global Communications, and Generative AI. You will oversee the development of our enterprise Data Warehouse, ensuring that our language assets (TMs, Glossaries, and Metadata) are structured, searchable, and optimized for human translators, machine translation searches and machine learning models.

Beyond traditional data engineering, you will collaborate with multiple teams on the design of the platform interface and the indexing strategies that power our next-generation localization workflows. Your unique value lies in bridging the gap between high-level data architecture, the nuanced translation domain, and the emerging requirements of Retrieval-Augmented Generation (RAG).

Key Responsibilities

1. Data Strategy & Warehousing

  • Architecture: Define the roadmap for our data warehouse, ensuring high availability and performance for massive multilingual datasets.
  • Data Cataloging & Governance: Implement robust cataloging solutions to ensure data lineage and "discoverability" across the organization.
  • Interface Development: Lead the creation of a user-centric interface that allows stakeholders to interact with, query, and extract data from the platform.

2. Translation & Localization Domain

  • Linguistic Asset Management: Manage the lifecycle of Translation Memories (TMs) and Terminology Databases.
  • Systems Expertise: Optimize integrations between our data platform and CAT tools and TMS systems (e.g., Phrase, Trados, MemoQ).
  • Domain Integration: Ensure data pipelines respect the nuances of translation metadata, XLIFF structures, and regional variants.

3. ML & GenAI Integration

  • RAG & Indexation: Oversee the creation and maintenance of Vector Databases and semantic search indexes to support Retrieval-Augmented Generation for automated translation and content creation.
  • Data Preparation for LLMs: Architect pipelines that clean, chunk, and format localization data for fine-tuning or prompting Large Language Models (LLMs).
  • Quality & Evaluation: Support the implementation of automated quality estimation (QE) and LLM-based evaluation metrics for translated content.

4. Leadership & Mentorship

  • Team Management: Lead a cross functional team of Linguists, Software Developers, Devops and Localization Engineers, providing technical guidance and mentorship.
  • Cross-functional Collaboration: Act as the liaison between Data, Localization, and AI/ML Research teams.

Required Qualifications

  • Experience: 5+ years in Data Engineering
  • Technical Stack: Proficiency in SQL, Python, ETL Pipelines, and cloud data platforms (e.g., AWS S3 Data Lakes, AWS Athena, AWS Redshift, AWS Glue).
  • AI/ML Fundamentals: Solid understanding of the GenAI lifecycle, specifically regarding how data is indexed for RAG (e.g., Pinecone, Milvus, or Qdrant).
  • Domain Knowledge: Understanding of the localization industry, including experience with TMX, TBX, and CAT tool workflows.
  • Product Mindset: Experience building and deploying production ready internal tools or interfaces (e.g., Streamlit, React) to democratize data access.

Preferred Skills

  • Familiarity with embedding models and semantic similarity scoring.
  • Knowledge of Data Privacy (ISO 27001, GDPR) specifically regarding PII in linguistic datasets.

Benefits & Perks You’ll Love:

  • Comprehensive Health Insurance: Including vision, dental, complementary therapies, and support for your overall well-being.
  • Your Birthday Off: We celebrate your special day!
  • 6 Personal/Sick Days: Take the time you need for your health or life’s unexpected moments.
  • Work-Ready Equipment: Get the tools you need to succeed, provided upon request.
  • Hybrid Work Model: Enjoy the best of both worlds with a mix of in-office collaboration and remote flexibility.
  • Learning & Growth Opportunities: Training and resources tailored to your role and department.
  • Supportive & Collaborative Team Culture: Work alongside team members who genuinely have your back
  • Team Recognition & Action Awards: Celebrate wins and contributions in meaningful ways.
  • Employee Referral Program: Earn rewards for bringing amazing talent to our team.
#Li-hybrid 

 
Original job Data Engineer (Localization & Language Data - Team Lead) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Data Engineer Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Data Engineer Jobs in Canada

GrabJobs is the no1 job portal in Canada, connecting you to thousands of jobs fast! Find the best jobs in Canada, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.