Site Reliability Engineer - Build Services

San Diego, California, United States
Software and Services

Summary

Posted:
Role Number:200284646
The Build Services Site Reliability Engineer (SRE) role resides within the Software Delivery organization, which is at the core of the Apple software release process. This role is responsible for creating and maintaining services, which are used by software engineers to compile source and assemble Apple software for delivery to customers around the world. The Build Services team drives reliability and performance engineering, instruments observability of services, responds to incident alerts, and reports on SLI/SLO metrics for visibility across the organization. This SRE role is essential in maintaining cloud-based services that are responsible for delivering the state-of-the-art operating systems (macOS, iOS, watchOS, tvOS) and more to Apple customers.

Key Qualifications

  • Strong passion in building reliable, scalable, and performant cloud-based services
  • Provisioning and maintaining distributed systems using Kubernetes and containers
  • Strong understanding of distributed systems w.r.t. storage, networking, and security SRE or Dev/Ops experience in managing customer-facing systems in 24/7 environment
  • Excellent judgment and integrity with ability to make timely and sound decisions
  • Ability to anticipate the needs of others and adapt to changing conditions
  • Excellent verbal and written communication skills

Description

As a Build Services Site Reliability Engineer, responsibilities include: - Configuration management and integrating container platforms - Automating continuous integration (CI) and continuous delivery (CD) pipelines - Analyze requirements and migrate legacy workloads to containerized solutions - Develop and deploy reliable, large-scale, services across multiple technology stacks - Implement observability of system for monitoring, alerting, and metrics reporting - Generate reports regarding service metrics on performance, availability, and reliability - Provide feedback and insights of metrics and data analysis to engineering teams - Champion practices regarding change control management and incident response A successful Build Services Site Reliability Engineer will be expected to: - Proactive communication of status to stakeholders and follow through on time-sensitive tasks - Demonstrate willingness to ask for clarification and increase awareness of the larger context - Explore multiple solutions to a problem, evaluate tradeoffs, then execute best approach - Communicate asynchronously with a global team across multiple timezones - Document new processes or update existing documentation pages - Eager to learn about Apple software products and schedules

Education & Experience

Additional Requirements