DevOps Engineer (Big Data Platform), Enterprise Systems

Sunnyvale, California, United States
Software and Services


Role Number:200515456
Apple’s Applied Machine Learning team has built platforms for a number of large-scale data science applications. We work on many high-impact projects that serve various Apple lines of business. We use the latest in open source technology and as committers on some of these projects, we are pushing the envelope. Working with multiple lines of business, we manage many streams of Apple-scale data. We bring it all together and extract the value. We do all this with an exceptional group of software engineers, data scientists, SRE/DevOps engineers, and managers. We are looking for a talented and dedicated senior engineer to join our team to bring passion for infrastructure and distributed systems to build world-class data platforms/products at a very large scale across cloud environments.

Key Qualifications

  • Experience architecting, building, monitoring, and operating large-scale complex data processing systems in Java/Scala like Spark, Flink, and Kafka in public clouds like AWS and GCP.
  • In-depth knowledge and experience in one or more large-scale distributed technologies including but not limited to: Hadoop ecosystem, Beam, Kafka, Samza, Flink, Storm, Flume, HBase, Cassandra, Redshift, Vertica, and Spark.
  • Experience in big data storage platforms and query engines with knowledge of cutting-edge technologies like Trino, Hive, Iceberg, Delta Lake, and Hudi.
  • Experience with containers and container orchestration platforms such as Docker and Kubernetes.
  • Strong proficiency with Helm and Kustomize for managing Kubernetes applications and configurations.
  • Passionate about operational excellence through proper automation and engineering processes using programming languages Go, Python, Java, or other JVM languages
  • Proficient in working with Linux or other POSIX operating systems, shell scripting, and networking technologies.
  • Proficient in best practices and enforcement for data security and data governance
  • Should be highly proactive with a keen focus on improving the uptime availability of our mission-critical services
  • Excellent verbal and written communication skills, able to collaborate cross-functionally with program managers and engineering partners
  • Comfortable working in a fast-paced environment while continuously evaluating new technologies


You like to automate anything which you do and you document it for the benefit of others. You are an independent problem-solver who is self-directed and capable of exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. Provide incident resolution for all technical production issues. Create and maintain accurate, up-to-date documentation reflecting configuration, and responsible for writing justifications, training users in complex topics, writing status reports, documenting procedures, and interacting with other Apple staff and management. Provide guidance to improve the stability, security, efficiency, and scalability of systems. Determine future needs for capacity and investigate new products and/or features. Strong troubleshooting ability will be used daily; will take steps on their own to isolate issues and resolve root causes through investigative analysis in environments where the candidate has little knowledge/experience/documentation. Administer and ensure the proper execution of the backup systems. Provide 24x7 on-call support to handle urgent critical issues. We are dedicated to the goal of building a culturally diverse and pluralistic team that reflects the multicultural variety of our customers.

Education & Experience

BS in computer science with 7-10 years or MS plus 5-7 years experience or related experience.

Additional Requirements

  • Experience using data storage technologies such as Apache Parquet or Avro Experience in machine learning algorithms is a plus.
  • Validated software engineering experience and field in design, testing, source code management, and CI/CD practices.
  • Position yourself as a go-to consultative resource and solution expert for Data Engineers and analysts.
  • Adaptable to prioritizing multiple issues in a high-pressure environment
  • Bonus: Experience with technologies Redis, Solr
  • Bonus: Design, implementation, and benchmarking of ML/deep learning algorithms

Pay & Benefits