Site Reliability Engineer, Apple Pay Payments

Santa Clara Valley (Cupertino), California, United States
Software and Services


Weekly Hours: 40
Role Number:200071731
Imagine what you could do here. At Apple, new ideas have a way of becoming phenomenal products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. Apple Payments Engineering is looking for an experienced Site Reliability Engineer who can contribute towards making strategic decisions about scaling, maintainability and reliability of the existing services and then work with the team on hands-on implementation. You would join a team responsible for providing scalability solutions for our ever growing business and data processing needs. Are you interested in solving the most complex and high scale challenges in the world today? Do you like the idea of running critical financial services that are used by millions of people all over the world? Do you want to help change how the world uses their wallet and money? If you love to solve internet scale challenges on critical financial systems then this is the right job for you.

Key Qualifications

  • Experience supporting and scaling large enterprise applications in a critical "Follow-the-sun" environment
  • Interest in designing, analyzing and troubleshooting large-scale distributed systems
  • Acute drive to automate, to constantly replace manual operations with automated solutions
  • Experience and proficiency in event-based and log-based monitoring systems, deployment tools, Citrix Netscalers, Unix, Cloud computing
  • Application development experience in programming languages like Java/J2EE
  • Excellent code-debugging/optimization, analytical problem solving skills
  • Ability to clearly and precisely communicate day-to-day operations to ensure thorough hand-off to the regional SRE teams
  • A desire to constantly learn new technologies and stay abreast of the distributed computing landscape and drive digital transformation
  • Drive high levels of engagement through discovery sessions, enablement and solution positioning with executives and technical partners
  • Computer science fundamentals in object-oriented design, data structures, algorithm design, problem solving, and complexity analysis
  • Consistent track record of taking ownership of challenging problems and successfully delivering results
  • Excellent communication and collaboration skills
  • Excellent problem solving and analytical thinking skills
  • Fast learner who is generous with their knowledge
  • Self-directed, demonstrates leadership potential, and a great teammate
  • Experience in getting the sizing projection and recommendation/Capacity planning
  • Expertise in Container Orchestration systems like Kubernetes and continuous delivery platform like Spinnaker is a plus
  • Experience with implementation of PCI/SOX Security is a plus


Site Reliability Engineering (SRE) is an engineering discipline that combines software and systems engineering to build and run large-scale, massively distributed, fault-tolerant systems. This position requires a highly motivated individual who likes large-scale challenges in a fast paced environment. You will support Payment services to ensure the services are up, work on improving Reliability and availability, will think creatively and come up with innovative solutions to automate the manual support activities.

Education & Experience

BS/MS in Computer Science or Equivalent, 5-15 years of IT experience

Additional Requirements