Job Details
Sr System Reliability Engineer (Application Support) Job Location: - Pune Experience: - 4-6years Skillset required: Application Support Roles and Responsibilities: Provide L2 support to production system like application, database, middleware components, infrastructure and network components Manage productions incidents end-to-end within defined SLAs with focus on resolution rather than who caused it Interact with various stake holders such as Release managers, program leads, service managers, development and test leads Review operational readiness requirements such as monitoring and alerting, log rotation and resilience of the components and report the gaps Provide pre-implementation support with activities such as release notes review and implementation dry runs Protect production components by running health checks, monitoring latency and memory utilisation Automate day to day activities and propose changes that improves reliability Participate in CAB and provide feedback on change requests Support the DevOps team in testing the promote pipelines and suggest automation of configuration items Practice incident management best practises and perform RCA.
Participate in disaster recovery tests and operational acceptance tests Analyse the technology stack that makes up the product and optimize recovery time objective Work with team members spread across and time zones Share knowledge, document improvements and mentor junior resources Skills: ITIL certified Good exposure to change management, incident management and problem management processes is a must Experience of BMC Remedy is preferred Good understanding of deployment methodologies like Blue/Green and network configuration is a must Experience with supporting Java based applications is a must Experience with springboot is an added advantage.
Experience in supporting APIs, micro service based architecture and containerised environments is preferred Experience with any of the monitoring tools such as Dynatrace, Appdynamics, Splunk, ELK, Nagios or Grafana is a must Experience in basic SQL scripting and data extraction is a must Experience in running batch jobs using schedulers like Tivoli or Control M is required Experience with file systems is a must Have an automation mind set and look out for automation opportunities Experience with running promote pipelines on Jenkins is a must Experience of using artefact management tools like JFrog or Nexus is required Understanding of Ansible or Chef or Puppet is an added advantage.
Should have an understanding of release cycles Should have a systematic problem-solving approach Should have strong communication skills to deal with stake holders across the globe Ability to own the issues and drive it to closure Be a team player and work cordially with all the colleagues and vendor team members Ability to debug and optimize code and config along with day-to-day task automation Experience in handling challenging situations and make pragmatic decisions Experience supporting systems on cloud platforms like AWS or Azure is an added advantage Have keen interest in analyzing and troubleshooting large-scale distributed systems Ability to build cordial working relationship with all other functions