Data Engineer (Localization & Language Data - Team Lead)

Company : Alexa Translations

Job Type : Full Time

Montreal, Qc

Number of Applicants

000+

Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications

Activate JobCopilot

Job Description - Data Engineer (Localization & Language Data - Team Lead)

About Alexa Translations

Alexa Translations provides A.I.-powered translations for the largest and most prestigious legal, financial, and government institutions. Our unique combination of advanced technology and professionally certified translators deliver tailored solutions with unparalleled quality. Thanks to over two decades of award-winning client success, you can rely on us as a true extension of your team.

Our core values:

Innovation
Dedication
Fanatical commitment to quality and service
Resourcefulness
Collaboration

Role Overview

As the Data Engineer (Team Lead), you will be the technical lead and will lead a specialized team at the intersection of Big Data, Global Communications, and Generative AI. You will oversee the development of our enterprise Data Warehouse, ensuring that our language assets (TMs, Glossaries, and Metadata) are structured, searchable, and optimized for human translators, machine translation searches and machine learning models.

Beyond traditional data engineering, you will collaborate with multiple teams on the design of the platform interface and the indexing strategies that power our next-generation localization workflows. Your unique value lies in bridging the gap between high-level data architecture, the nuanced translation domain, and the emerging requirements of Retrieval-Augmented Generation (RAG).

Key Responsibilities

1. Data Strategy & Warehousing

Architecture: Define the roadmap for our data warehouse, ensuring high availability and performance for massive multilingual datasets.
Data Cataloging & Governance: Implement robust cataloging solutions to ensure data lineage and "discoverability" across the organization.
Interface Development: Lead the creation of a user-centric interface that allows stakeholders to interact with, query, and extract data from the platform.

2. Translation & Localization Domain

Linguistic Asset Management: Manage the lifecycle of Translation Memories (TMs) and Terminology Databases.
Systems Expertise: Optimize integrations between our data platform and CAT tools and TMS systems (e.g., Phrase, Trados, MemoQ).
Domain Integration: Ensure data pipelines respect the nuances of translation metadata, XLIFF structures, and regional variants.

3. ML & GenAI Integration

RAG & Indexation: Oversee the creation and maintenance of Vector Databases and semantic search indexes to support Retrieval-Augmented Generation for automated translation and content creation.
Data Preparation for LLMs: Architect pipelines that clean, chunk, and format localization data for fine-tuning or prompting Large Language Models (LLMs).
Quality & Evaluation: Support the implementation of automated quality estimation (QE) and LLM-based evaluation metrics for translated content.

4. Leadership & Mentorship

Team Management: Lead a cross functional team of Linguists, Software Developers, Devops and Localization Engineers, providing technical guidance and mentorship.
Cross-functional Collaboration: Act as the liaison between Data, Localization, and AI/ML Research teams.

Required Qualifications

Experience: 5+ years in Data Engineering
Technical Stack: Proficiency in SQL, Python, ETL Pipelines, and cloud data platforms (e.g., AWS S3 Data Lakes, AWS Athena, AWS Redshift, AWS Glue).
AI/ML Fundamentals: Solid understanding of the GenAI lifecycle, specifically regarding how data is indexed for RAG (e.g., Pinecone, Milvus, or Qdrant).
Domain Knowledge: Understanding of the localization industry, including experience with TMX, TBX, and CAT tool workflows.
Product Mindset: Experience building and deploying production ready internal tools or interfaces (e.g., Streamlit, React) to democratize data access.

Preferred Skills

Familiarity with embedding models and semantic similarity scoring.
Knowledge of Data Privacy (ISO 27001, GDPR) specifically regarding PII in linguistic datasets.

Benefits & Perks You’ll Love:

Comprehensive Health Insurance: Including vision, dental, complementary therapies, and support for your overall well-being.
Your Birthday Off: We celebrate your special day!
6 Personal/Sick Days: Take the time you need for your health or life’s unexpected moments.
Work-Ready Equipment: Get the tools you need to succeed, provided upon request.
Hybrid Work Model: Enjoy the best of both worlds with a mix of in-office collaboration and remote flexibility.
Learning & Growth Opportunities: Training and resources tailored to your role and department.
Supportive & Collaborative Team Culture: Work alongside team members who genuinely have your back
Team Recognition & Action Awards: Celebrate wins and contributions in meaningful ways.
Employee Referral Program: Earn rewards for bringing amazing talent to our team.

#Li-hybrid

Original job Data Engineer (Localization & Language Data - Team Lead) posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.

Apply Now

Auto-Apply to Similar Jobs

Share Job

Get your Resume Reviewed for Free

Automate Job Applications for Similar Jobs

Auto-Apply to Data Engineer Jobs with your AI JobCopilot

Auto-Apply with AI

Similar Data Engineer Jobs in Canada

Get your Resume Reviewed for Free

Email address

Why are you reporting this job?

I think it’s a discriminatory or offensive

I think it’s fraudulent or a scam

I think it’s trying to sell something unrelated to the job / it’s asking for money

I think it contains incorrect or broken information

Other

All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.

Setup your job alert:

Frequency

By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime. Skip