Senior System engineer - Cloud infrastructure
Imagine what you could do here. At Apple, new ideas have a way of becoming extraordinary products very quickly. Bring passion and dedication to your job, and there's no telling what we can accomplish together. We're looking for a hardworking and passionate person to join this amazing team, and if you feel this is you, we'd love to hear from you!
The Apple Services Engineering (ASE) organisation is responsible for building powerful platforms that enable engineers to deliver incredible experiences to customers.
Our compute team is responsible for designing and building the foundational pieces of our in-house cloud technologies. In this role, you will collaborate with teams across Apple to deliver forward-looking high-performance distributed storage solutions for various cloud platforms. The successful candidate is highly motivated individual with strong technical and communication skills to create intuitive user experiences, who is passionate about quality, and pays attention to details that surprise and delight our customers.
Join this team and you will design, implement, and run services and tools for infrastructure management such as SaltStack, Postgres, Zookeeper, DHCP, DNS, etc, alongside internally developed services. At the same time, as a member of the Simcloud team, they will also engage with platform customers and will support platform issues in the local timezone.
In this role you will be responsible for developing, debugging and maintaining software to operate a large compute fleet. You will:
- Closely collaborate with teams within and across organizations to support their workflows or integrate their technology into our platform
- Automate operations processes via services and tools
- Develop within configuration management and fleet orchestration via SaltStack, Ansible, Puppet, or others
- Design, implement, and maintain robust, scalable, and highly available services that support infrastructure management
- Monitor on-server system performance, identify bottlenecks, and implement solutions to enhance efficiency
- Conduct root cause analysis for on-server system failures and implement preventive measures
- Write and review code, generate and review design documentation
- Participate in qualifications and rollouts of software to production clusters
- Participate in a business-hours rotation where engineers respond to platform issues for same-day resolution
- BS with 5+ years of experience in system software development or MS in Computer Science
- Strong systems programming skills and knowledge of operating system fundamentals
- Fluency in Go (Golang), Python, or similar languages in a systems context
- Familiarity with the mechanics behind infrastructure management
- Customer-focused thinking and strong problem solving with attention to detail
- Experience with large scale server provisioning and maintenance
- Familiarity with node management systems like SaltStack, Ansible, Puppet
- Strong proficiency in Linux/Unix internals, administration, and troubleshooting
- Operational knowledge of Kubernetes clusters
- Strong knowledge of networking protocols and security standard methodologies
- Experience with measuring, analyzing, and optimizing performance
- Ability to effectively communicate within a team and with project collaborators
Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.