DevOps Engineer - Cassandra/Solr
Hyderabad, Telangana, India
Software and Services
Do you want to be part of a team that builds cutting edge software service, a team that is continually innovating and is proud of making a difference? If so, bring your passion and talent and come join us to be part of something big and amazing. Apple's IS&T team is looking for highly motivated and talented DevOps/Site Reliability Engineers (SRE) to build the next generation of software services that powers several mission critical applications.
- Experience in managing large scale Cassandra, Solr clusters
- Experience in managing data ingestion pipelines for large big data infrastructure
- Expertise in configuration management (such as Ansible, Salt) for deploying, configuring, and managing servers and systems
- Have a passion for automation by creating tools using Python, Java or other JVM languages
- EXPERIENCE DEPLOYING AND MANAGING CI/CD PIPELINES
- Experience managing infrastructure in AWS
- Have a strong experience in managing distributed computing systems, e.g., NoSQL, Cassandra, Hadoop
- STRONG EXPERTISE IN TROUBLESHOOTING COMPLEX PRODUCTION ISSUES
- Expert understanding of Unix/Linux based operating system
- EXCELLENT PROBLEM SOLVING, CRITICAL THINKING, AND COMMUNICATION SKILLS
- The candidate should be adapt at prioritizing multiple issues in a high pressure environment
- Should be able to understand complex architectures and be comfortable working with multiple teams
- Ability to conduct performance analysis and troubleshoot large scale distributed systems
- Should be highly proactive with a keen focus on improving uptime availability of our mission-critical services
- Comfortable working in a fast paced environment while continuously evaluating emerging technologies
- The position requires solid knowledge of secure coding practices and experience with the open source technologies
- Monitor production, staging, test and development environments for a myriad of applications in an agile and dynamic organization.
You are an independent problem-solver who is self-directed and capable of exhibiting deftness to handle multiple simultaneous competing priorities and deliver solutions in a timely manner. Provide incident resolution for all technical production issues. Create and maintain accurate, up-to-date documentation reflecting configuration, and responsible for writing justifications, training users in complex topics, writing status reports, documenting procedures, and interacting with other Apple staff and management. Provide guidance to improve the stability, security, efficiency and scalability of systems. Determine future needs for capacity and investigate new products and/or features. Strong troubleshooting ability will be used daily; will take steps on their own to isolate issues and resolve root cause through investigative analysis in environments where the candidate has little knowledge/experience/documentation. Administer and ensure the proper execution of the backup systems. Provide 24x7 on-call support to handle urgent critical issues.
Education & Experience
BS in computer science with 7-10 years or MS plus 5-7 years experience or related experience.
- Experience with Kubernetes, Docker or other container orchestration framework
- Experience with big data technologies - Hadoop, Hive, Spark
- EXPERIENCE BUILDING AND OPERATING LARGE SCALE SEARCH INFRASTRUCTURE
- Experience in Workflow and data pipeline orchestration (Oozie, Jenkins etc.)