Application Reliability Engineer - GBI

Hyderabad, Telangana, India
Software and Services

Summary

Posted:
Weekly Hours: 40
Role Number:200377057
Apple’s Application Reliability Engineering (ARE) team is looking for an extraordinary engineer to improve the health of Apple’s Business Intelligence (BI) systems by driving root cause analysis and permanent fixes for critical/repetitive production issues production issues by bringing the engineering and Support teams together, and also to help build tools to improve the efficiency of the support teams. Apple's BI landscape caters to a wide variety of real-time, near-real-time and batch analytical solutions. These solutions are integral part of business functions like Sales, Operations, Finance, AppleCare, and Marketing, enabling business drivers to make critical decisions. It uses a diverse technology stack such as AWS, Snowflake, Teradata, HANA, Vertica, Hadoop, Kafka, Spark, Cassandra and beyond. The ARE team is a focussed group tasked with ownership of overall application stability and is responsible for RCA and permanent fixes for critical issues and improving Support team productivity by building automation tools. This position will interface significantly with Application teams, Production Support Team, System Engineers, Network Engineers and DBAs on a regular basis. Additionally, this role focusses on creating dashboards to analyse the trends in system health to define the roadmap for ARE initiatives. We are looking for an energetic and seasoned engineer to contribute to the support of several key systems. You should have excellent written and verbal communications skills, knowledge of and experience with incident and problem management skills to join our team. This is an extremely fast-paced and highly demanding environment. If you have the determination, we would like to talk to you.

Key Qualifications

  • 5-10 years of relevant experience in application development and support for on-premise and cloud based Enterprise Data warehouse applications
  • Experience in 2 or more database technologies like Snowflake, Hana, Oracle, Vertica, Cassandra, Singlestore, Teradata
  • Experience in building/supporting native Cloud applications(AWS preferred)
  • Good programming skills in at least 2 languages viz. Java, Python, Scala
  • Experience in visualisation tools such as Thoughtspot and Tableau
  • Ability to write complex SQLs
  • Hands-on experience in Unix, Linux, Shell scripting, Autosys and Splunk
  • Hands on with Development to Production processes including testing, version control tools like git/svn and experience with source control, continuous integration, deployments etc
  • Knowledge on messaging queues like Kafka, rabbitMQ, Solace, Stratos etc
  • In depth knowledge of Spark, Kubernetes, Docker, Redis is a huge plus
  • Experience on Apache Airflow is desirable
  • Strong verbal and written communication skills and ability to coordinate with multiple technical and functional users
  • Being determined, ability to multitask and attention to detail are must haves

Description

Identify patterns and opportunities to make applications more stable through automations, code changes etc Build dashboards to provide insights on incident inflow trends etc Conducting critical issue postmortem (follow up) meetings to engage relevant partners like Application team, production support, Infrastructure teams etc Do root cause analysis of critical/repetitive production issues and plan and implement corresponding resolution steps Perform code fixes for issue resolution and production deployments with minimum supervision Optimally present and communicate incident inflow trends and required actions to the Application teams Automate manual support processes to improve support efficiency

Education & Experience

B.E/B. Tech. degree or higher in a related field

Additional Requirements