Computational support for Machine Learning and LSTM
Santa Clara Valley (Cupertino), California, United States
Software and Services
Are you a big-picture thinker who loves setting ambitious goals? Do you have a passion for understanding how each line of code affects all the others? In the Core Operating Systems group ensuring the OS is inseparable from each device’s identity as a whole. That’s because this group is committed to building fully integrated operating systems that combine hardware, software, and apps into a single Apple experience. Your dedication to cross-disciplinary collaboration will help develop groundbreaking technologies, such as iOS, macOS, watchOS, and tvOS. By crafting these distinct, holistic user experiences, you’ll continue to uphold and advance the excellence people expect from Apple devices. The Vector and Numerics Group is tasked with designing, enhancing and improving various subsystems running on iOS, macOS, watchOs and tvOS. Most of this support is encapsulated in the Accelerate framework, a well used library serving many technologies. The group is looking for an exceptional high performance programmer to complement the team and make a difference. As a member of our fast paced group, you will have the unique opportunity to delight and inspire millions of Apple’s customers every day. You will work in a cross functional team which is implementing innovative and state of the art routines to support the necessary computation for such things as machine learning and vision algorithms. Push the state of the art in low level computation and drive them towards energy efficient and high performance implementations by tightly integrating software and hardware. The successful candidate will have excellent understanding and knowledge of micro architecture of NEON on ARM and AVX on Intel CPU cores from a vector programming perspective. Team members are engaged in the design and optimization of low level computational support for machine learning and computational vision such as 2D Multi-layered convolution, LSTM in hand writing recognition, FFT and DFT for audio codecs, speech synthesis and encryption to mention a few technologies. Understanding and working knowledge of cryptographic algorithms such as AES and SHA is a desirable attribute. Being able to craft the fastest and the most energy efficient routines for a particular CPU core is a plus. Low level high performance programming experience is a must for this position. Being comfortable in vector assembly and low level C is a requirement. The ideal candidate would be at ease in developing both innovative and robust CPU core level algorithms derived from a particular technology’s need in a tight deadline .
- 5+ years low level algorithmic development in Machine Learning computation, LSTM, BLAS and DFT.
- Detailed knowledge of vector Instruction Set Architectures (ISA) of ARM and Intel.
- Strong understanding of computational efficiency.
- Excellent coding skills in ASM and C.
- Strong verbal and written communication skills.
- Ability to manage multiple tasks and self-prioritize.
- Ability to work with cross functional teams in compression related components.
Design and implement micro architecturally optimized pieces of Accelerate framework taking into consideration power and energy usage.
Education & Experience
BS/MSEE in mathematics, computer science or computer engineering required, advanced degree preferred.