Senior Site Reliability Engineer, Insight BPR
Elk Grove, California, United States
Operations and Supply Chain
Do you want to be part of a group critical to the success of Apple? Are you a Senior Site Reliability Engineer who is passionate about solving hard problems, owning the entire solution and leveraging cutting edge technologies to enable business operations? Do you enjoy creating automation to eliminate toil? Do you excel under pressure? Can you summarize highly complex problems so that others can help you solve them? Do you have rock solid integrity and are the team member people trust and count on? Does everyone turn to you to brainstorm solutions? Do you like gathering evidence to base your decisions off of, but can use your gut, intuition and experience to make quick decisions when necessary? If you smile in the face of pressure, can work independently but are also great team player, we're looking for you!
We are seeking an experienced and highly skilled Senior Site Reliability Engineer (SRE) to join our dynamic team. As a Senior SRE, you will play a crucial role in ensuring the reliability, scalability, and performance of our systems and applications. The ideal candidate will have a deep understanding of cloud technologies, strong problem-solving skills, and a proven track record of implementing and maintaining robust infrastructure.
Description
Shape the next generation of big data solutions by working on the bleeding-edge technologies and solutions for the Insight BPR team. Insight BPR is looking for exceptional engineers to help run, optimize and scale our environment to the next level. Be a member of the team that is responsible for the data collection and reporting for all of Apple’s products around the world. You will operate and scale systems that every iPhone, iPad and Mac have interacted with. Apple’s engineering and operations teams will utilize your systems to build the next insanely great product.
In this role, you will be working with very large-scale, highly-available Big Data platform supporting multi-Petabytes of data with super-linear growth. You must have a “build-to-manage”, problem-solving and innovative mindset applied to the design, build, test, deploy, change and maintenance of enterprise class applications drawing from deep engineering expertise. Key measures of success will include platform stability, effective integration and delivery, instrumentation, release quality, technical debt(toil) reduction, development of automation, risk/security compliance, and sustained advancement of the SRE practice.
As a member of a cross-functional team, you'll have the opportunity to solve challenging big data engineering problems across a broad range of Apple manufacturing services.. You will have hands on experience operating and managing very large scale systems.
Minimum Qualifications
Key Qualifications
- Have a passion for Site Reliability Engineering and a flexible, creative approach to problem solving.
- 5+ years of hands-on experience with one or more programming languages: Java, Python, Node, Go or Ruby
- Full-stack experience. Frontends using Python or Javascript along with frameworks (Flask, ReactJS, Angular, etc) as well as backends using different stacks (PHP Symphony, NodeJS, Express, etc).
- Demonstrated experience with relational databases: MySQL, Postgres, etc
- 3+ years of hands-on experience with Kubernetes
- Experienced professional with a deep experience with cloud providers such as AWS or GCP
- Experience with at least one of these monitoring systems: AppDynamics, Grafana, Kibana, Prometheus, InfluxDB
- Experience with build automation, source control and CI/CD tools (ArgoCD, GitHub, Artifactory, Jenkins, Spinnaker, etc)
- Linux configuration, deployment and troubleshooting
- Excellent problem solving and programming skills; proven technical leadership and communication skills
- Flexibility for travel and work schedules
Preferred Qualifications
Education & Experience
Minimum 5 years in a lead / senior engineer role
BS or MS in Computer Science preferred, equivalent work experience will be considered.
Apple is an Equal Opportunity Employer that is committed to inclusion and diversity! We also take affirmative action to offer employment and advancement opportunities to all applicants, including minorities, women, protected veterans, and individuals with disabilities.
Additional Requirements
- Cloud infrastructure as code experience, e.g., Crossplane, Pulumi, Terraform, CloudFormation, etc.
- Experience with Open API and Microservice architecture
- Experience with configuration management tools such as: Ansible, Chef, Puppet, Salt
- Experience in helping to define service agreements such as: Error budgets, SLOs, SLIs and SLAs
- Excellent problem solving and programming skills; proven technical leadership and communication skills
- Ability to learn new technologies quickly
- Experience with Kafka, Elastic, Druid, Object Storage a strong plus
Pay & Benefits
Apple is an equal opportunity employer that is committed to inclusion and diversity. We take affirmative action to ensure equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.