Engineering Manager – Apple Intelligence and Siri Evaluation Tools
At Apple new ideas have a way of becoming extraordinary products, services, and customer experiences very quickly. Apple is a place where extraordinary people come together to do their life’s best work. Together, we build technologies and experiences people once couldn’t have imagined — and now can’t imagine living without!
The Apple Intelligence team is looking for an Engineering Manager to lead a team focused on the evaluation of next-generation features for Apple Intelligence. This role is at the intersection of software engineering, machine learning, and human-centered design. You will be responsible for developing the tools and infrastructure that allow Apple to assess and optimize the intelligence, responsiveness, and quality of Siri and other Apple Intelligence features before it reaches millions of users around the world!
As an Engineering Manager based in Yokohama, you will lead a small team of highly skilled engineers building scalable systems and frameworks for end-to-end (E2E) evaluation of Apple Intelligence products such as Siri. Your work will be critical in validating the performance and reliability of unreleased software and models, ensuring that Siri and Apple Intelligence continues to set the standard for intelligent voice assistants. You will collaborate closely with teams across Apple Intelligence, Siri, Machine Learning, QA, and Product to define evaluation strategies, integrate testing pipelines, and drive improvements in user experience. Your leadership will help shape the technical vision for the team, while fostering a culture of collaboration, innovation, and technical excellence.
This role requires a hands-on leader who is comfortable contributing to the codebase and actively supporting the team with technical expertise, in addition to managing day-to-day operations and strategic initiatives.
- 10+ years of professional software engineering experience and 2+ years in a leadership role managing high-performing teams.
- Strong programming background in Python, Swift, or similar languages, with a focus on infrastructure, test automation, or data tooling.
- Demonstrated experience designing and scaling systems for software validation, quality assessment, or ML model evaluation.
- Bachelor’s degree in Computer Science, Engineering, or a related technical field.
- Experience working on voice assistants, NLP systems, or real-time AI-powered applications.
- Familiarity with continuous integration pipelines and E2E evaluation frameworks.
- Proven success collaborating with globally distributed teams in a fast-paced, cross-functional environment.
- Master’s degree or higher in Computer Science, Machine Learning, or a related field.