D

Staff Architect, Data & AI Platform

icon building Company : Dolby
icon briefcase Job Type : Full Time

Number of Applicants

 : 

000+

Click to reveal the number of candidates who applied for this job.
icon loader
Apply Now
icon loader Apply Now

Let AI Supercharge Your Job Hunt!

JobCopilot scans 500,000+ company career sites daily to find jobs for you

Never miss an opportunity Save hours by auto-filling applications forms Land more interviews with tailored applications
happy man
thunder iconActivate JobCopilot

Job Description - Staff Architect, Data & AI Platform

We're building a unified data and AI/ML model development and training platform—similar in ambition to those platforms at other media technology companies—to accelerate our entire AI/ML lifecycle from data preparation and feature computation to experimentation, large-scale distributed training, evaluation, deployment, and governance & observability. A major focus is creating horizontal, reusable components—feature pipelines, embedding services, training/evaluation frameworks, SDKs, and model management APIs—to simplify the end‑to‑end AI/ML lifecycle. You'll partner closely with AI/ML researchers/engineers, data engineers, and product teams to deliver a paved-path developer experience: from a laptop run to multi-node, GPU-accelerated training; experiment tracking & lineage; model packaging with evaluation hooks; and continuous delivery to production. Architect a cohesive, integrated platform that integrate underlying data and AI/ML compute orchestration systems (Kubernetes, Ray, Spark), feature/experiment/model stores, and enable batch, distributed training, and multi‑stage pipelines while ensuring consistent resource usage patterns. Establish unified pipeline definitions for dataset preparation flows, reliable, scalable training and inference routines, and evaluation workloads to enable reproducibility, repeatability and reuse Define stable, well‑versioned APIs and Python SDKs, which follow common architectural patterns and provide framework‑native integrations (PyTorch/Ray, etc.), to enable teams to build declarative AI/ML workflows spanning dataset selection, preparation, training, tuning, evaluation, and model registration/deployment/serving. Design and evolve the experimentation, model checkpointing/registration, and other AI/ML metadata systems to support lineage, observability, lineage, and evaluation to standardize how teams track, assess, compare, and promote models. Define best practices for AI/ML version control, build/test/evaluate/tune CI/CD, continuous training, and dependency management Collaborate with AI/ML researchers, engineers, platform engineering/MLOps and infrastructure, security, and product stakeholders to align roadmap priorities and architectural sequencing with workflow pain points. Partner with platform engineering/MLOps and infrastructure to define observability stacks for metrics, drift indicators, performance regressions, training/inference health signals, production reliability (SLIs/SLOs), monitoring, and incident response. Desired Background BS in Computer Science, Mathematics, Engineering, or equivalent technical field. Master's Highly preferred. Proven experience architecting large-scale distributed systems and integrated data and AI/ML platforms (e.g., training, serving, workflow orchestration, data pipelines). Expert-level proficiency in Python and one of Go/Java/C++ and building production-grade services/APIs/SDKs Deep knowledge and extensive experience with GPU infrastructure, distributed systems, orchestration engines, AI/ML toolchains, query engines, pipeline technologies i.e., Ray, Spark, Metaflow/Flyte/Airflow/Argo/Kubeflow, Kubernetes. Deep expertise with AI/ML ecosystem tooling such as PyTorch/TensorFlow/Hugging Face, MLOps stack: experiment tracking, feature stores, model registries/lineage (MLFlow, WnB, Feast, etc), and AI/ML-related AWS/GCP managed services (VertexAI, SageMaker, etc) Proven ability to scale AI/ML platforms emphasizing reproducibility and repeatability by integrating data preprocessing, feature access, and model training/evaluation routines through workflow orchestration/pipelines, developer-oriented SDKs, feature/model CI/CD, automated evaluation, safe rollouts, and monitoring Strong collaboration skills across research, engineering, MLOps, infrastructure, and product; ability to translate cross‑functional needs into scalable platform abstractions. Experience designing feature stores or embedding services tightly integrated with training pipelines.
Original job Staff Architect, Data & AI Platform posted on GrabJobs ©. To flag any issues with this job please use the Report Job button on GrabJobs.
Apply Now
Share Job
Share Job

Auto-Apply to Staff Architect Jobs with your AI JobCopilot

thunder icon Auto-Apply with AI

Similar Staff Architect Jobs in India

GrabJobs is the no1 job portal in India, connecting you to thousands of jobs fast! Find the best jobs in India, apply in 1 click and get a job today!

Mobile Apps

Copyright © 2026 Grabjobs Pte.Ltd. All Rights Reserved.