Natural Language Generative Modeling Research Engineer - SIML, ISE

Cupertino, California, United States
Machine Learning and AI


Weekly Hours: 40
Role Number:200489593
Are you excited about Generative AI and Large Language Models? Are you interested in working on cutting edge generative modeling technologies to enrich billions of people!? We are looking for senior research engineers experienced in training, adapting and deploying large scale ML models with a focus on natural language and multimodal understanding and generation. You will be inventing and shipping the next generation of core technologies to enable system wide experience! The mission of Robustness & Safety AI team is to develop core technologies to ensure safety on Apple’s devices and services. This includes technologies to ensure communication safety for our users as well as to ensure ML models and features built on top of generative AI are robust and safe. The team comprises of domain experts in NLP, Computer Vision, and ML Fairness who contribute to significant system wide safety experience. Our team provides an opportunity to be part of an incredible research and engineering organization focusing on System Intelligent Machine Learning (SIML) within Apple. An ideal candidate will have solid ML fundamentals and ability to turn research contributions into products. Proven experience in Conversation Understanding and Synthesis, Reinforcement and Human Preference Learning, and Large Language Model Training are essential for this role. SELECTED REFERENCES TO OUR TEAM’S WORK: Communication Safety & Sensitive Content Warning

Key Qualifications

  • 3 - 5+ years experience in Machine Learning and NLP fundamentals
  • Hands on experience training LLMs
  • Experience adapting pre-trained LLMs for downstream tasks & human alignment
  • Familiarity with distributed training
  • Proficiency in using ML toolkits, e.g., PyTorch
  • Strong programming skills in Python, C and C++
  • You're aware of the challenges associated to the transition of a prototype into a final product


We are seeking a candidate with a proven track record in applied ML research. Responsibilities in the role will include training large scale language and multimodal models on distributed backends, deployment of compact neural architectures such as transformers efficiently on device, and learning policies that can be personalized to the user in a privacy preserving manner. Ensuring quality with an emphasis on fairness and model robustness would constitute an important part of the role. You will be interacting very closely with a variety of ML researchers, software engineers, hardware & design teams cross functionally. Your primary responsibilities will center on enriching conversation understanding capabilities through LLM and multimodal models. The user experience initiative would focus on enriching system safety experience.

Education & Experience

M.S. or PhD in Electrical Engineering, Computer Science or a related field ie, mathematics, physics or computer engineering with a focus on computer vision and/or machine learning or comparable professional experience.

