AI/ML - Operations Engineer/Production Services Engineer

Santa Clara Valley (Cupertino), California, United States
Software and Services


Role Number:200185632
Play a part in ensuring the quality and delivery of groundbreaking technology for large scale systems, natural language, big data, and artificial intelligence to end users. You will be interacting with various teams of cross functional engineering and project management to help identify issues, troubleshoot build and infrastructure failures, ensure frictionless delivery of features and make lives of our Artificial Intelligence and Machine Learning organization better. Work with the people who created the intelligent assistant that helps millions of people get things done — just by asking. Join the Siri Pipeline Ops team at Apple.

Key Qualifications

  • 6+ years in roles as DevOps, SRE, SDET or TE/QE
  • Experience in one or more scripting language such as Python
  • Working familiarity with one or more object-oriented programming language (Java preferred)
  • Moderate meetings for root-cause analysis and post-mortems
  • Troubleshooting and debugging skills: Analyze and investigate test failures, errors, and build issues
  • Experience with troubleshooting tools or languages such as splunk, GitHub, troubleshooting java exceptions
  • Team City or Jenkins Experience deploying micro-architecture applications into cloud-based computing service
  • Experience working in a continuous delivery pipeline using job scheduling tools
  • Expert knowledge of Linux OS and shell languages such as BASH
  • An outgoing and positive attitude
  • Excellent verbal and written communications skills


Pipeline Operations team is tasked with ensuring keeping the pipelines flowing, troubleshooting automated tests and release of defect-free code to users. As a result we need to keep our server and client pipelines operating at full speed with minimal downtime. As a Pipeline Ops Engineer, you will be in charge of analyzing test failures and ensuring issues are resolved quickly. You will track code changes and communicate build pipeline status on a regular basis. You will run root cause analysis of failures which impacted developer agility by running post mortems. You will also create automation, monitor telemetry and address alerts to ensure smooth operation of the pipelines. You will serve as a full time, primary on-call, responding and mobilizing efforts to address outages. Excellent communication skills will be required to coordinate work across multiple teams to resolve issues. Your strategic goals will be to automate operational tasks and identify process or technology gaps.

Education & Experience

BS or MS Degree in Computer Science (or equivalent work experience - 6 years or 8 years respectively)

Additional Requirements