SRE Engineer, ASE, London
Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish.
Join the Apple Services Engineering team as a site reliability engineer to help support and scale cloud services for thousands of development and operations engineers.
This is a hands-on role to maintain and enhance SRE practices for a private cloud service to accelerate our ability to reliably and consistently deliver thousands of applications.
As a Site Reliability Engineer you will be responsible for providing the platform for mission critical cloud systems to maintain constant uptime, scale seamlessly, and allow for new applications and services to flourish.
The successful candidate will be highly self-motivated with a passion for excellence, quality and detail. The SRE will not only support operations, but also work closely with the developers and architects within the team to aid in the design and assist with the implementation to improve stability, security and scalability.
As an SRE at Apple, you will:
Operate, monitor, and triage all aspects of our production and non-production environments.
Design, build and implement innovative solutions for previous, present and future issues.
Prepare alert handling procedures, runbooks, and collaborate with the off-shore SRE teams.
Automate deployment and orchestration of services into the cloud environment as well as other routine processes.
Actively participate in capacity planning, scale testing, and disaster recovery exercises.
Interact with and support partner teams, including engineering, QA, and program management.
Cultivate and maintain relationships with internal and external third-party vendors.
- In depth experience in a Site Reliability Engineering, DevOps, or Infrastructure focused role
- Must be an expert and have in-depth professional experience with cloud operations, with a focus on "infrastructure-as-a-service" (compute, storage, and network virtualization)
- Proficient in Java with a good understanding of GoLang and Python
- Experience operating large-scale multi-tenant Infrastructure as a Managed service
- Familiarity with cloud infrastructure concepts (zones, regions, VPCs, etc)
- Automation advocate - you truly believe in removing operational load via software.
- A strong sense of ownership. At the same time, you're a great teammate who communicates clearly and transparently - Self-motivated, inquisitive, and always looking to learn more.
- Experience managing, scaling, and troubleshooting Java and GoLang applications
- Be capable of collaborating and coordinating with multiple distinct engineering teams and mentoring others