Site Reliability Engineering Manager, Storage - Apple Cloud Services

Seattle, Washington, United States
Software and Services

Summary

Posted:
Weekly Hours: 40
Role Number:200548645
Are you a talented Site Reliability Engineering Manager with a passion for distributed storage systems? Ready to be part of a focused and lively team bringing distributed storage technologies to Apple's infrastructure? At Apple, scale is huge and impact is enormous. Join our team and be part of our mission, which is to power storage behind many of Apple's most popular services. Bring passion and dedication to your job and there's no limit to what you can achieve!

Key Qualifications

  • Proven expertise in distributed systems
  • Demonstrable success leading engineering teams; ideally SRE or Production Engineering
  • Knowledge of distributed storage (object storage or block storage), or similar large scale distributed databases
  • Deep knowledge of SRE principles, including monitoring, alerting, error budgets, fault analysis, and other common reliability engineering concepts
  • Experience with Kubernetes, Docker, and containerization
  • Proficient in at least one of these programming languages: Golang, Java or Rust

Description

The Storage SRE organization is seeking a strong engineering leader to manage Storage focused SRE teams, working closely with peer SRE teams and development partners. You'll help build and optimize the Storage stack from the bare metal to the top of the application, helping design provisioning systems, code deployment, monitoring, alerting, and performance improvements. Together with the team, you'll help run the storage used by some of Apple's largest teams.

Education & Experience

BS or MS in Computer Science or equivalent industry experience

Additional Requirements

Pay & Benefits