Siri - Site Reliability Engineering Manager / Lead
London, Greater London, United Kingdom
Software and Services
Siri is fast becoming the best way to get things done by talking to our devices. Billions of Siri requests are made each and every day to send a message, play your favourite song, or just ask a question. Now imagine what could you do here! We are now offering an amazing opportunity to work with the teams building the most scalable big-data systems in existence. We make sure Apple's most innovative services are always online. Build groundbreaking technology for algorithmic search, machine learning, natural language processing, and artificial intelligence.
- Linux systems in a large scale 24/7 production environment
- Proficiency programming in Python, Ruby or Go
- Working knowledge of multi-tier applications and their dependencies including load balancing, TCP/IP networking, web services, LDAP and DNS.
- Familiarity with Kubernetes and containerizers like docker, rkt, and crio.
- Monitoring (Nagios and Graphite or Prometheus highly preferred)
- Automation for systems and application management.
- Configuration managers such as Puppet, Chef or Ansible.
- Web server administration (Apache and Nginx preferred)
- Database support and administration (Postgres, MySQL and HBase)
- Experience with cloud providers such as AWS, GCP and Azure.
We are looking for a working manager, who will perform hands-on duties and ramp up our team in London. This position involves operating a global Internet service with many interdependent moving parts. The team is exposed to every aspect of the service, both internal and public facing where every change has the potential to affect hundreds of millions of Apple users. You must have a passion for automating tasks large and small, discern signal from the noise and have desire to squeeze every ounce of performance from available resources. Working closely with engineering and product managers, individuals in this role must excel under pressure and handle unpredictable interruptions. As part of this team, you will: - Monitor production, staging, test and development environments for a myriad of applications in an agile and dynamic organization. - Strong troubleshooting ability will be used daily; a successful Engineer will take steps on his or her own to isolate issues and resolve root cause through investigative analysis. - The Engineer should be an independent problem-solver who is focused and capable of exhibiting deftness to prioritize multiple simultaneous competing priorities and deliver solutions in a timely manner. - Provide incident resolution for all technical production issues. - Build and maintain accurate, up-to-date documentation reflecting configuration. - Be responsible for writing justifications, training users in complex topics, writing status reports and interacting with other Apple staff and management. - Strive to improve the stability, security, efficiency and scalability of all production systems. - Resolve future needs for capacity and investigate new products and/or features. - Design, implement, and extend automation tools to improve internal and external team efficiency. You will also be the point person for hiring several people to grow into a team, and be the front-line manager for our London SRE function. The position will require rotating day, night and weekend shifts.
Education & Experience
BS in Computer Science or equivalent program preferred