Lead Site Reliability Engineer - RAD BI
Imagine what you could do here. At Apple, great ideas have a way of becoming great products, services, and customer experiences very quickly. Bring passion and dedication to your job and there's no telling what you could accomplish. The people here at Apple don’t just craft products - they build the kind of wonder that’s revolutionized entire industries. It’s the diversity of those people and their ideas that inspires the innovation that runs through everything we do, from amazing technology to industry-leading environmental efforts. Join Apple and help us leave the world better than we found it.
WW Business Process Re-Engineering (BPR) RAD team is looking for an exceptional and talented Site Reliability Engineer (SRE) to join our team to bring passion for infrastructure and distributed systems to build world-class data platforms/products across multi-cloud environments.
You like to automate anything which you do and you document it for the benefit of others. You are an independent problem-solver who is self-directed and capable of exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. Provide incident resolution for all technical production issues. Create and maintain accurate, up-to-date documentation reflecting configuration, and responsible for writing justifications, training users in complex topics, writing status reports, documenting procedures, and interacting with other Apple staff and management. Provide guidance to improve the stability, security, efficiency, and scalability of systems. Determine future needs for capacity and investigate new products and/or features.
Strong troubleshooting ability will be used daily; will take steps on their own to isolate issues and resolve root causes through investigative analysis in environments where the candidate has little knowledge/experience/documentation. Administer and ensure the proper execution of the backup systems. Provide 24x7 on-call support to handle urgent critical issues.
We are dedicated to the goal of building a culturally diverse and pluralistic team that reflects the multicultural variety of our customers
- Strong sense of ownership, customer service, and integrity demonstrated through clear communication.
- 7+ years experience with deploying, supporting and monitoring new and existing services, platforms, and application stacks.
- Proficiency with Python, Bash scripts, GO, REST APIS, and any object oriented programming a must.
- Proficiency in using monitoring and observability tools such as Prometheus, Grafana, Splunk, etc.
- Extensive knowledge with Enterprise Linux (RHEL).
- Understanding of AWS and Kubernetes concepts.
- Experience with automation and configuration management (Ansible, Puppet, SALT)
- Superb problem-solving skills, and ability to thrive in a fast-paced and dynamic environment
- Bachelor’s degree in computer science, mathematics or relatable field or equivalent work experience.
- 2+ years of people leadership or team lead experience is a plus
- Experience with Linux based server virtualization (KVM, containers).
- Experience with CI/CD, unit testing and version control systems (GIT).
- Deep understanding of MySQL server administration or similar relational databases.
- Knowledge of IPv6, DNS, DHCP is a plus.
- Good understanding of standard networking protocols and components such as HTTP, DNS, TCP/IP and load-balancing.
- Good understanding of basic security concepts and protocols - authentication, authorization, signing, encryption, SSL/TLS, SSH/SFTP, PKI, etc.
- Experience working closely with global teams.
- Experience in DevOps/SRE in production environment.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.