As a senior researcher, you will invent and develop the next generation of AI based multimodal image, audio, and speech technologies. With an in-depth understanding of the latest AI technologies and a good understanding of audio/video technologies, you will explore the applications of AI to the delivery, analysis, and creation of multimedia technologies including video and audio enhancement, analysis, classification, and separation. You will define and lead research initiatives that leverage cross-media intelligence and analytics, invent multimodal machine learning algorithms, and utilize a deep understanding of multimodal perception to develop the next generation of multimedia technologies. As a leader, you will direct and mentor a group of AI researchers working in the application of AI to multimodal analysis, processing, playback, and enhancement technologies. You will work with your team as a coach and mentor. You are passionate about developing junior, highly talented staff into researchers that work fully independently in a corporate environment. You work with ATG technology leaders to co-define projects and assign your staff to global R&D initiatives led by other technology initiative leads. Work jointly with upper management, lead resource and work allocation. Contribute to developing a dynamic, flexible, transparent, results-oriented and innovative working atmosphere. Ph.D. plus 8 years of corporate research experience with a degree in Physics, Electrical Engineering, Mathematics, Computer Science with a strong focus on AI. You are an absolute top expert in AI with a deep and thorough theoretical understanding of the latest state-of the art AI technologies. You have a detailed understanding of all main network architectures, deployment modes, data augmentation and preparation, and theoretical performance analysis of model architectures. Knowledge of NLP and/or multi-modal architectures is highly desired. Diffusion, autoregressive, or other generative models. Self-supervised, contrastive learning, auto-encoders. Audio, image, or text applications - Source separation, text-to-speech, music synthesis, image segmentation, image captioning, question answering, language models, etc. Multimodal architectures and algorithms. You have a track record of successfully applying AI technologies to multimodal problems including the combination of audio and video technologies. You have deep knowledge of GPU/CPU implementations, algorithm validation and testing, implementation of ML/AI algorithms. Strong track of inventing, developing and productizing novel AI based technologies in an industrial research environment. Strong publication record, with publications in major machine learning conferences (e.g. NeurIPS, ICLR, ICML, etc.). Strong background in statistical signal processing, decision theory, greedy algorithms, Bayesian modelling, random algorithms, time series, hypothesis testing, classification, clustering, hypothesis testing and multilinear regression analysis. Experience with audio and video processing is a plus. Highly skilled in C/C++, Python, TensorFlow or PyTorch. Experience in managing, guiding and mentoring younger researchers. Team-oriented work ethic and interest to work in cross-continental teams. Excellent communication, collaboration, and presentation skills in English.
All Job Ads are subject to GrabJobs’s Terms of Service. We allow users to flag postings that may be in violation of those terms. Job Ads may also be flagged by GrabJobs moderation team. However, no moderation system is perfect, and flagging a posting does not ensure that it will be removed.
Be the first to receive the latest Others Full-Time Jobs in India.
Setup your job alert:
By activating job alerts, I agree to GrabJobs Terms & Privacy Policy. I can unsubscribe to job alerts anytime.
Skip