Job Description:
- 8+ years experience in application support, Middleware support and production support with strong troubleshooting capabilities
- Experience with using ITIL change, incident and problem management processes
- Resolve critical production issues by leading major incident calls , engaging proper teams and driving root cause analysis
- Solve and debug system components to resolve technical issues in complex and highly regulated environments comprised of on-prem and cloud applications and services
- Hands on experience with monitoring and alerting processes in distributed, cloud and microservices based environments
- Collaborate both within the team and across teams to resolve application issues and advance as needed
- Create, modify and monitor dashboards to better catch potential issues and aide in observability
- Support weekly and weekend on-call rotation as scheduled
- Good verbal and interpersonal skills , communicating openly with the team members and stakeholders
- Solid ability to gather and analyze project requirements and translate them into technical specifications and own the implementation
- Resolve critical application alerts in a timely fashion including production issues across the environment ingress/egress points
Technical qualifications:
- Hands on experience in Unix/Linux, windows
- Hands on experience with Middleware technologies (Apache, Client, WAS , MQ , Tomcat, Jboss etc)
- Hands on experience with Devops and tools
- Familiarity working with relational/non-relational databases
- Solid understanding and hands on experience with Azure cloud
- Hands on experience with Cloud technologies - Redhat Openshift , Kubernetes , AKS, EKS cluster administration and build
- Experience deploying applications on Websphere Application Server/Tomcat/Jboss or Cloud
- Experience with monitoring tools – Splunk, AppDynamics, Dynatrace
- Infrastructure build engineering experience – middleware , openshift , Azure cloud
- Good to have understanding and experience with Ansible , Terraform , Bitbucket
- Good understanding of network topologies and troubleshooting network related issues
- Hands on experience with triaging application issues, network issues and OS performance issues.