Architect solutions to facilitate ML training, inference, and other experimentation. Collaborate with research, development and engineering to establish machine-learning and data management workflows and supporting tools and processes that maximize machine-learning activities and use of resources. Improve capabilities of data set exploration, transformation and overall data management of large to very large datasets. Collaborate with research and development to proactively iterate and fine-tune model training for best performance and efficient use of machine-learning resources. Collaborate with infrastructure teams physical compute, storage and network infrastructure experts to improve on-premise and cloud infrastructure. Troubleshooting high-performance computing, storage and networks for machine-learning workloads. Improve use of cloud compute and storage for global research teams and manage within budget. BS or MS degree in Computer Science or equivalent experience. 4+ years of professional practical hands-on experience in machine learning operations or equivalent. Comprehensive knowledge of AWS and infrastructure-as-code techniques. Advanced proficiency with Python, Terraform, Cloud Formation, Ansible, git and related. Experience with machine learning and scaling workloads with both cloud and on-premise GPU server environments. Experience with managing and coordinating storage of large machine learning data sets. Proficiency in Kubernetes cluster design, deployment and management. Interest and understanding of industry trends in machine learning development techniques and tools and processes. Comprehensive knowledge of continuous integration and continuous release processes and tools. Experience with Conda, Python. Experience with Ray cluster design, setup, provisioning and monitoring for high-availability. Experience with ML flow or similar. Experience with high-performance file systems (lustre, beegeefs, Weka, or similar).
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in India.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip